Find Jobs
Hire Freelancers

Write some Software

$10-30 USD

Closed
Posted over 7 years ago

$10-30 USD

Paid on delivery
Hi Please find the project details and let me know your interest of completing the same. Data Acquisition:  Find or collect your data set of interest. There are many sources on the web for data sets. I would prefer the data to be of a reasonably large size (this is a data mining class after all), but really large data sets can bog down computers. R (for example) can easily handle data sets in the tens or even hundreds of thousands (depending on your computer). A lower limit for data size should be n=1000 although I will be willing to accept exceptions. See below links to look for data sets that might interest you. Data Analysis:  Consider your data carefully. Even if you downloaded it, you should look for information about it. This information should also be included in your proposal: i. How was it collected? ii. What are the data quality issues? iii. Are there biases inherent in who collected the data or how it was collected? iv. Are there any data preparation needed? And v. What are these operations? How might this impact the subsequent conclusions? 2/3  Formulate questions that you would like to answer about this data set. You can follow the way the lecture notes listed the question (What is the dependent variable or variables? What are the predictors?)  Implement your analysis using data mining tools. These should have some relation to what we have learned in the class! Are you doing a classification or clustering task? Can the data be expressed as a network of some kind? Are there interesting visualizations to do? How will you evaluate the performance of your model, or choose between competing models? Results Analysis:  Gather all results from all individual steps or projects and run your analysis on it.  This would include some fidelity criteria (performance evaluation) of the method. Report Format:  Your paper should follow IEEE/ACM standard (.doc word template is also given) [login to view URL] [login to view URL]  Total pages should not exceed 6 pages (including references).  Times new roman, 10pt size font, single spacing. Subjects:  You may work on any dataset in the field you choose; i. Databases and SQL ii. Page Ranking and Web Mining iii. Text Mining and NLP iv. Image Mining and the Web v. Any other data set, I’d prefer you discuss that with me in advance. Software:  Use whatever software is comfortable for you, STATISTICA Data Miner, KNIME, RapidMiner, Weka, SAS Enterprise Miner, Oracle Data Mining, IBM SPSS Modeler, and of course all programming languages C#, C++, Java , Python, R, are fine. Method:  You can use the machine of your choice: Conventional, or non-conventional (Neural nets, Genetic Algorithm, Fuzzy logic, Decision Trees, Frequent Patterns, .. )  The University of California at Irvine has put together a large repository of data sets for machine learning at [login to view URL]  Another repository of data sets for data mining at the University of Edinburgh.  Statlib is a general repository for all things statistical, they have a nice collection.  Ideas for projects from a previous lecturer of this class  New York City Datasets from the Columbia Population Research Center  Datasets from Chance Magazine  Health Data Sets  [login to view URL] Regards Pankaj
Project ID: 11860127

About the project

2 proposals
Remote project
Active 7 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
2 freelancers are bidding on average $21 USD for this job
User Avatar
9+ years of experience into Machine learning, NLP, Data extraction, building Search Engines, Python. I can show you quick demo to prove my epertise
$25 USD in 1 day
0.0 (0 reviews)
0.0
0.0
User Avatar
Expert with more than 8+ years of experience. I have been doing descriptive and inferential statistics Key Techniques are Regression Model Binary Logistic Model Factor Analysis Cluster Analysis Neural Network Parametric and Non-Parametric Test Good in data visualization using Data mining technique like CRT,QUESTetc Please refer my client's feedback ( 5 star rated) Kindly reach out to me for further discussions
$17 USD in 1 day
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
NORWOOD, United States
3.1
2
Payment method verified
Member since Apr 14, 2016

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.