Closed

DataMining

This project received 8 bids from talented freelancers with an average bid price of $117 USD.

Get free quotes for a project like this
Employer working
Project Budget
$30 - $250 USD
Total Bids
8
Project Description

There are two sets of Wikipedia articles. The first set is from Wikipedia featured articles of a

certain type. The first set becomes class Featured. The second set of articles are Wikipedia (non-

featured) articles of similar type to featured articles. The second set becomes class Non-Featured.

We are dealing with a binary classification problem. 

To create attributes, extract all possible tokens from the entire dataset after stemming and stop-

word removal. Create 1-gram, 2-gram and 3-grams from these tokens. Use these n-grams as the

attributes for ARFF files. 

Perform attribute selection on each of 1-gram, 2gram, 3-gram an using information gain and gain

ratio. Perform classification using decision tree, and naïve Bayes. 

Make a Wiki report on your finding including various statistical evaluation measures given by

WEKA for each classifier.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online