Find Jobs
Hire Freelancers

R script to merge & clean-up tabular data from 13 governmental reports #medicines

€8-30 EUR

Closed
Posted over 6 years ago

€8-30 EUR

Paid on delivery
pmprb.R collects tables with the lists of Patented Drug Products from 13 annual reports of the Patented Medicine Prices Review Board. The task is to reorganise this information into a single dataframe. The main challenge is that the tables for years 2004 & 2012 are extracted from a pdf format (only available), while others - from html tables. Therefore the 13 tables to be merged are not of identical format when read into R. All tables, however, contain consistent/similarly-structured information, even though some years have more information (additional columns) than others. The merged dataframe should contain only 7 columns: [login to view URL] Company DIN [login to view URL] (data only available until 2009) ATC (data only available until 2009) Status [login to view URL] Resulting dataframe must then be merged with [login to view URL] by.x = “[login to view URL]”, by.y = “DrugName”. This is the final product. The final deliverable must be an R script returning the dataframe described in the paragraph above. Please first submit your budget & the timeframe/delivery time for the project. I shall then get back to you shortly to inform you if your bid is accepted or not. Thank you!
Project ID: 15983376

About the project

20 proposals
Remote project
Active 6 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
20 freelancers are bidding on average €42 EUR for this job
User Avatar
Hi I am a very experienced statistician and academic writer. I have completed several PhD level thesis projects involving advanced statistical analysis of data. I have worked with data from several companies and have done projects involving high level quantitative analysis and data interpretation skills to study the trends, time behaviour and compare the variables in the data. I can do advanced level analysis in SPSS, R, WEKA, TABLEAU and excel tools like machine learning, hypothesis testing, forecasting, T-test, ANOVA etc. Looking forward to discussion, Best Regards, Suyash
€75 EUR in 3 days
4.7 (89 reviews)
6.9
6.9
User Avatar
I am Prajwal Bhatt, a final year undergraduate student at IIT Roorkee. I am a creative, flexible and focused individual who is passionate about Data Science, Natural Language Processing and the entire Analytics space. I have acquired knowledge of R programming, Python and C++. Further, I have exposure to Machine Learning algorithms like Neural Network, Regression, Decision Trees, and Clustering algorithms etc. I have experience of working as a data science intern in various recognized companies and start-ups before, Schlumberger, Razorpay, KUAI (Israel, Remote intern); to name a few.
€30 EUR in 1 day
4.9 (29 reviews)
4.9
4.9
User Avatar
Hello, I'm a data scientist for 3 years of experience. I often use R and python for my work. I can finish your task in 2 days with the table you demanded and a r script. Sincerely
€133 EUR in 2 days
5.0 (1 review)
2.2
2.2
User Avatar
Hi. I have been using R since past 3 years and I have implemented many bioinformatics models using it. Coincidentally, I have been pruning drug-related data. I am sure that I can help you with your project. Kindly let me know more about the project if you have any other queries.
€34 EUR in 1 day
3.6 (2 reviews)
2.4
2.4
User Avatar
A proposal has not yet been provided
€55 EUR in 5 days
3.6 (1 review)
2.6
2.6
User Avatar
I have extensive experience in R programming, data cleaning and collating "messy" data from multiple sources.
€34 EUR in 2 days
5.0 (1 review)
1.2
1.2
User Avatar
I have written master thesis about R programming language and data preprocessing. I think I can do what you asked for.
€29 EUR in 2 days
0.0 (0 reviews)
0.0
0.0
User Avatar
hi, please share more details so that i can review & start your project easily. i can do it easily Looking forward
€23 EUR in 1 day
0.0 (0 reviews)
0.0
0.0
User Avatar
I am familiar with a package which reads tables from pdfs and htmls. I have also done some data extraction with tabula package earlier.
€34 EUR in 2 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I am ready to work for your project. I have worked on many applications based on R programming language. I am not expert in text mining but I will definitely complete your work within 5 days. I am only saying what I can do... Give me a chance I will put my 100% for your project.
€24 EUR in 5 days
0.0 (0 reviews)
0.0
0.0
User Avatar
A proposal has not yet been provided
€34 EUR in 5 days
0.0 (0 reviews)
0.0
0.0
User Avatar
A proposal has not yet been provided
€34 EUR in 1 day
0.0 (0 reviews)
0.0
0.0
User Avatar
Dear Sir, I have more than 15 years of experience as bioinformatician. Your project looks, a priori, easy. One of my major problems is to merge and structure information from different databases (global system biology analysis) so I can do this with no major problems. Best regards, Joan
€23 EUR in 10 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hi, I have overall 6+ years of experience and mainly into Predictive Modelling. To build a Statistical/Predictive Model, we used to spend almost 80% of time only on data slicing and dicing. Worked on many projects which require extensive data mining in SAS, SQL, Excel and R tools. Given a chance I can complete the project on time at a lesser price. Looking forward for a positive response. Thanks, Pavan
€24 EUR in 2 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hi, I just have written an R script to merge data frames. Having a common column of data among different data frames is a must to merge them. I'm working on pdf's info extraction. Looking forward to hearing from you. Oscar.-
€50 EUR in 5 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Reviewed the data. PDF Conversion is someone what difficult. It is better to put it in excel and edit it manually. The final merge requires additional information as the drug name needs to be associated with formulation. This will require domain knowledge if 0.03% is it equivalent to 3mg/mL ? Budget is not sufficient given the complexity of data. But could be done in one day.
€39 EUR in 2 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of GERMANY
Frankfurt, Germany
5.0
15
Payment method verified
Member since Mar 2, 2016

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.