Data comparison in batch - repost

Closed Posted Mar 7, 2014 Paid on delivery
Closed Paid on delivery

Develop a mechanism & software to identify similar content in a huge base of articles.

Input format from csv flat file. Output should tell which entries are similar with indicator of "similarity strength"

Language for this program is flexible as long as it deliver the result.

If you are interested, please give me a message and let me know how you want to start this. I can give you examples and our detail requirements.

Data Processing PHP Python Ruby Software Architecture

Project ID: #5525748

About the project

18 proposals Remote project Active Apr 13, 2014

18 freelancers are bidding on average $513 for this job

srinichal

I like to discuss further about the project details and also willing to get some sample data as well and interface.

$631 USD in 10 days
(121 Reviews)
7.2
zeke

Dear Customer! I am an expert PHP developer with over 6 years of experience and very interested to work on this project. Available to start immediately and finish as soon as possible. My bid is for fast professional s More

$515 USD in 10 days
(169 Reviews)
7.0
siddhu1986

A proposal has not yet been provided

$421 USD in 10 days
(155 Reviews)
6.6
chirgeo

Hi. Interesting project and I'm interested to be involved. What is the format of the article? Is a .txt file, .html ? or we need to compare 2 .csv file? It's not really clear about this and I would like to kno More

$526 USD in 5 days
(30 Reviews)
6.1
ebson

A proposal has not yet been provided

$250 USD in 2 days
(37 Reviews)
6.0
anuyadav1

A proposal has not yet been provided

$750 USD in 10 days
(23 Reviews)
4.9
suraj99p

This problem can be solved using dynamic programming. If you see auto suggest when we do google search, they also use this algorithm. We have concept of distance between two words which can be used to identify the simi More

$500 USD in 10 days
(10 Reviews)
4.6
ashishicfai

Hello sir , i read carefully your project descriptions. trust me, I will gave the best result for me. no milestone , no advance payment , required am having 4 year exp in this domain, there is not any loss awarding pro More

$360 USD in 9 days
(20 Reviews)
4.6
jaylancer43

Hello - I am an Expert Techno-Functional Analyst in lots of arenas of IT industry including Excel Macros and formulas. I am an Engineering Graduate with an MBA degree. If you see I am among the niche bidder who has More

$333 USD in 5 days
(18 Reviews)
4.4
smartguy666

Hi sir, i am interested in your project. I can do this for sure. I have made many bots/scrapers/automated softwares before. If you are interested I can give you an offer that i can make a demo if you like the demo than More

$444 USD in 10 days
(11 Reviews)
4.3
meljux

Hi, I can use python to create a script with that functionality. Please provide and example of your CSV file to start working immediately on your project. Drop me a line if you are interested in my services.

$750 USD in 5 days
(5 Reviews)
3.8
Time2win

Hello, We have excellent team of programmers and designers to work on your project efficiently and complete job in time. We have read your deepest requirement at our best and will surely give better results. thanks

$824 USD in 20 days
(9 Reviews)
3.8
TechJSolutions

Hi Lwyyen, I'm interested with your project. I will using HTML PHP as front end and Ms SQL Server as back end to compare the article. Waiting for examples and detail requirements. Thanks

$550 USD in 9 days
(3 Reviews)
1.2
ibaydan

A proposal has not yet been provided

$250 USD in 10 days
(1 Review)
1.0
andrewscm

I am interested in writing a program that will give the similarity strength of various CSV files. To create the program, I would use Java since that is the language I am best with and deploying the program to you is ve More

$300 USD in 10 days
(1 Review)
0.2
surenfl

I have developed such similar projects in Python in history. I can very easily and quickly develop such projects.

$250 USD in 10 days
(0 Reviews)
0.0
ponsider

Hi. I work with similar project before (English and Italian news), and can do this work event without prepaid, because i'm new on freelancer. :) I do work, demonstrate result and you pay. Can do this task as windows de More

$277 USD in 3 days
(0 Reviews)
0.0
srinivasreddy88

Hi currently am working as a data analyst in a software company for last two years. I have a good experience in analyzing data using python. I can able to finish the job as soon as possible

$555 USD in 10 days
(0 Reviews)
0.0
runsepprun

Hello, I implemented a similar project a few years ago, but it was a bit more complex. It processed thousends of textual documents with a bunch of distributed computers. Obviously I have enough experience with Infor More

$500 USD in 2 days
(0 Reviews)
0.0