Algorithm to match short text strings (equivalent product names from disparate sources)

In Progress Posted 7 years ago Paid on delivery
In Progress Paid on delivery

We have an existing MS SQL database which is populated with products retrieved from various sources (portals).

The various portals usually have slightly different names for the products.

E.G

Portal 1:

PortalProduct 1: CocaCola

PortalProduct 2: Manchester United

PortalProduct 3: Maplin Electronics

Portal 2:

PortalProduct 1: Coca-Cola

PortalProduct 2: Man Utd

PortalProduct 3: Maplin Elec Ltd.

- We need to correlate these PortalProducts to a central list of Products.

- One Product can have 1-to-many PortalProducts

- The database is populated with around 10,000 PortalProducts

- The database is not populated with any Products. Products need to be added for each new PortalProduct that doesn't match an existing Product.

This must be implemented as a C# console application so that a) we can see the results when the correlate is run and b) so that we can ultimately run it on a task-schedule.

The developer must develop a text matching algorithm to correlate PortalProduct names with Product names (and create a product where there is no match). The emphasis is on the accuracy of the text matching algorithm. Speed is less important than [login to view URL] developer should have a mathematical mind and preferably has experience in developing mathematical text matching algorithms.

Examples of mathematical text matching algorithms that could be used (and potentially refined and combined) to meet our requirements are:

String metric

Locality-sensitive hashing

Needleman–Wunsch algorithm

Smith–Waterman algorithm

Levenshtein distance

Concept Search

Approximate matching with addition of regular expressions ability

Regular expressions for non-fuzzy (exact) matching

Metaphone

Soundex

Agrep

Plagiarism detection

[login to view URL]

Ideally the developer will already understand and have used some of these before and able to pick the best algorithm(s) for this application.

Algorithm C# Programming Mathematics

Project ID: #10964138

About the project

14 proposals Remote project Active 7 years ago

14 freelancers are bidding on average £174 for this job

abhijitbuet

easy task for me :)

£133 GBP in 1 day
(146 Reviews)
6.3
sflogics

Hi, We are small team of 4 developers with 5+ years of .Net experience and have happy old clients. One from UK from about last 4 years in Bedford. We never had excuses; even not asked to clients to pay extra money. More

£200 GBP in 3 days
(50 Reviews)
6.2
solutionEngrn

Hi there, You have a Challenge, We Create the Solution Solution Engineering is a Team of quality and solution oriented engineering professionals across various engineering disciplines that have tackled challenges lik More

£244 GBP in 4 days
(2 Reviews)
5.3
SharjeelSohail

hi. im a software development engineer with excellent algorithm design skills. i can do this work for you

£100 GBP in 5 days
(60 Reviews)
5.2
AbanoubLabib92

I'm C# developer. I worked in Microsoft in AppFactory program for developing Windows 8 and Windows Phone Apps. I got here many Desktop and WPF projects. You can check my profile and if you accepted, just let me know to More

£150 GBP in 15 days
(11 Reviews)
4.7
Alethor

Hi! I would like to help you with this project. I have some questions, so, if you are interested, please let's talk BEFORE closing anything. One everything is set and understood, then we will be able to work toge More

£150 GBP in 3 days
(20 Reviews)
3.9
pakaimsolutions

Hello, I hope you doing very well! I have gone through the description and would like to provide a quality solution using my 5+ years of professional experience in required skills. (.NET/ASP.NET - MVC, EF, SQL, C#). More

£122 GBP in 3 days
(11 Reviews)
4.0
joy151280

Hi! Thanks for posting this requirement. I am experts with design and development works. I have developed several websites in ASP.net vb. I have done research in .net I have also made several research tool in vb.net. I More

£177 GBP in 3 days
(2 Reviews)
3.1
aptriangle

First, when using an algorithm, instead of a list that contains phrase pairs considered equivalent, there's no way to be completely certain that a match is correct. Any match would have a confidence level. Your first More

£150 GBP in 5 days
(2 Reviews)
2.0
sammy2889

Our core competency lies in complete end-end development cum management of a new website development project, and we are seeking opportunities to build Websites/Applications from the ground up for you or your business. More

£200 GBP in 5 days
(0 Reviews)
0.0
fifthjourney

I have used Copyleaks and Fuzzystring in a plagiarism checking software for library for a college. Im sure it will be more than enough for this purpose.

£135 GBP in 5 days
(0 Reviews)
0.0
Mazookr

I wrote something similar for the BBC and MACAT.com. Probably do it in less than 5 days, but good to have some leeway, I have used those exact algos and much more such as ML(wont be needed) I work as a Tech lead More

£277 GBP in 5 days
(0 Reviews)
0.0