Find Jobs
Hire Freelancers

Build a hadoop program

$30-80 AUD

Cancelled
Posted almost 5 years ago

$30-80 AUD

Paid on delivery
As this is a continuous assignment i am including the description of the test 1 and solution file for the same but i want the solution for test 2. Also included the data file for this test. This is a hadoop program basically. Test 1 - Python Use data set from the files movie ratings 1 million records ([login to view URL], [login to view URL], [login to view URL]). Please make Python/Mapreduce code (mapper and reducer) to answer the following research question: "What are the most popular movies for different age groups?" Data set [login to view URL] has an information about age groups * 1: "Under 18" * 18: "18-24" * 25: "25-34" * 35: "35-44" * 45: "45-49" * 50: "50-55" * 56: "56+" Your code should be able to provide a movie ID for the movie that has the highest number of ratings and that number for each age group. If you want, you can also provide the name of the movie as well. However, this is optional. To achieve the first task, you can join [login to view URL] and [login to view URL] and get most popular movies IDs. For the optional task, you can produce two mapreduce programs (that is, mapper1, reducer1, mapper2, reducer2). The first one will join [login to view URL] and [login to view URL] and get most popular movies IDs. The second one will join your result with [login to view URL] and output movie titles. If you go this way, you should provide me an instruction what mapper/reducer use first and what data to load in each of them. Your submission will include three files: mapper, reducer and result output from Hadoop (part-00000 file). If you decide to go with the optional task, then you will submit more files and an instruction how to use them. Either way - you don't need to submit data files. Hadoop Test 2 - Pig Your test 2 is to finish the optional task the same as in test 1, i.e., provide a movie name for the movie that has the highest number of ratings and that number for each age group. The only difference - now you have to use Pig and PigLatin. This task requires "normal" programming logic: load three data sets, join first and second, then join resulted set with the third one, group, aggregate, probably group again to find maximum. You have to submit two files - PigLatin script and Hadoop/MapReduce output with results.
Project ID: 19858406

About the project

8 proposals
Remote project
Active 5 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
8 freelancers are bidding on average $81 AUD for this job
User Avatar
Hi, I'm a Hadoop developer with over 5 years of experience and expertise working on different tools and technologies including sqoop, flume, oozie, hive, pig and spark. I have delivered over 100 projects here on freelancer successfully. I'm currently pursuing my Master's in Data Science from Trinity College Dublin and would like to work on this project. Thanks, Tousif
$80 AUD in 3 days
4.9 (66 reviews)
5.8
5.8
User Avatar
Dear Employer I have extensive experience in map reduce programming using hadoop and java. I can finish the work as per your requirements. Please let me know if you are interested.
$75 AUD in 3 days
4.9 (69 reviews)
5.4
5.4
User Avatar
Hello I am good at hadoop ecosystem. I have gone through your problem statement and I can solve your second problem. Hadoop Test 2 - Pig. lets chat to explore more
$70 AUD in 4 days
5.0 (5 reviews)
3.2
3.2
User Avatar
Hi I have good experience in hadoop and map reduce programming . I have 4+ experience in Hive , Map Reduce and Pig . Please provide the opportunity to start the work . Thanks Akram
$88 AUD in 3 days
5.0 (1 review)
1.2
1.2
User Avatar
Dear Prospect Hiring Manager. Thank you for giving me a chance to bid on your project. i am a serious bidder here and i have already worked on a similar project before and can deliver as u have mentioned "I can do this job and give you an efficient job that will be very acceptable and presentable. I and my team work on web development and mobile apps and I can assure you that you will never be disappointed"
$72 AUD in 7 days
0.0 (3 reviews)
0.0
0.0
User Avatar
I have already worked on these movielen data set. I am fast, accurate and reliable, results oriented Virtual Assistant. Believe in delivering accurate results within the expected turnaround time.I have 8 years of work experience. An expert in Hadoop ecosystem, HDFS, Map Reduce, Sqoop, Hive, Pig, Spark, Scala, Kafka, Spark SQL, Spark streaming, Spark graph, RDD and Dailyomotion traanscoding,Aws ,Aws traanscoding,AWS s3 and several webservice. I am interested to do this job project too. Looking forward for your reply!
$111 AUD in 6 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of AUSTRALIA
Adelaide, Australia
0.0
0
Payment method verified
Member since Jun 2, 2019

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.