Analyze some Data
$30-250 CAD
Paid on delivery
We require a freelancer to data mine the Wikipedia categories entitled "Natural Science", "Physical Science", "Formal Science" and "Applied Science", for:
1) All Pure and Applied Scientific Discipline page_titles (fields of study, for example: Physics, Biology, Mechanics, Evolutionary biology, Astronomy, Thermochemistry, Surgery, Electronic engineering)
2) The first paragraph of the article in question (omitting any Greek, Latin or other etymological information)
3) The category type (NATURAL, FORMAL, OR APPLIED). Note that all disciplines under the Category: Physical science must be considered as NATURAL.
4) The relationship of each page_title with all other page_titles - For example: Physics is a parent discipline of Mechanics. This is crucial to building a structured hierarchy.
Note - We do not want the following:
a) Pseudosciences (i.e., Fringe sciences such as Astrology, Telekinesis, Sci-Fi Sciences and technologies etc)
b) Social sciences (Cognitive science, History, Economics, Sociology, Psychology, Cognitive Science, Philosophy, etc)
c) Any people (scientists, professionals, historical figures etc)
d) Any Opinion pieces and/or other articles that do not conform to Wikipedia's Neutral Point of View (NPOV) - see [url removed, login to view]:NPOV_dispute
Data must be obtained as per the fields described in the attached WIKI_DATA_OUTPUT excel file.
A rudimentary decision tree entitled "Pure and Applied Disciplines Decision Tree", describing several analysis criteria and methods for determining whether or not a page_title is a scientific discipline, has been provided, along with the supporting keyword library excel files ("Pure Libraries" and "Applied Libraries"). We recommend that the freelancer download the XMind software to visualize this tree and get a better idea of how we want this information to be gathered.
We leave it up to the freelancer whether to follow our decision tree, or instead to disregard it in favor of other data mining methods. We of course welcome any and all suggestions for more accurate analysis criteria and methods, should the freelancer find more suitable and performant alternatives.
Project ID: #7732100
About the project
9 freelancers are bidding on average $238 for this job
Hello there, I will like to help you out with this project. I know my offer can by a little over your budget but that is because I have plenty of experience in data mining tools and techniques so I know you are going t More
I am an expert in scraping and crawling and look forward to discuss further about the project requirements
i can do your project please discus your project .i can do your project please discus your project .i can do your project please discus your project .i can do your project please discus your project .i can do your proj More
*** Do You Like Getting the Results You Want? *** Need to Stay on Budget?***... Hi, I think you just found the right candidate here! I will make sure to deliver the best as per your requirements. I am an experienced More
I am an ex MNC employee. I understand the nuances of business and the importance of quality and TAT. I have recently left my job to be a stay home dad and am new to Freelancer. Prior to my sabbatical, I was Manager Adv More
Good day If you are looking for hard working and efficient freelancers, then we are the team for you. We hope you consider our bid and look forward to working with you. Regards Gary / Daniel