Find Jobs
Hire Freelancers

Multithread python crawler for determination of pHash values -- 2

$30-250 USD

Closed
Posted about 9 years ago

$30-250 USD

Paid on delivery
I am looking for a guy who can program a multithread python crawler and the user interface. User interface: ------------------- - User has the option to add several pictures (of which the pHash value is determined) - User has the option to add URLs to be crawled detected Step 1: --------- Each day, the entire website of the typed in URLs is crawled that way, that the paths of each subpage, etc. is determined. The paths shall be shared into 10 databases. (e.g.: A website has 100 subpages - 10 of these URLs are put to the first database, 10 to the second, etc.) Step 2: --------- One crawler (one crawler per server) cares for one database, visiting all stored URLs and getting the pHash values of the pictures displayed on the website. The pHash value shall be stored in a central result database together with the server path of the picture) Step 3: --------- The pHash value of the originally uploaded picture and the found picture on the websites is compared. If the value is above a certain, by the user determinable value, the found picture is listed in the user interface as a potential match. Other requirements: -------------------------- * I need to have the option whether [login to view URL] and meta information shall be respected or not while crawling. * I need to be able to set an hourly limit of the number of crawls per one website in order not to take to much attention and resources of the foreign websites. * German freelancers are preferred.
Project ID: 7467082

About the project

4 proposals
Remote project
Active 9 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
4 freelancers are bidding on average $253 USD for this job
User Avatar
Dear Sir, I'm very much delighted to let you know that i did data scraping with PHP-cURL, Node.js, Selenium from many sites. I just scraped the data from web site and then wrote the data in mysql database or excel or csv or xml file. I worked on many similar projects, I have big experience in data mining projects. I have written hundreds of web scrapers which scrape millions of pages each day. I'm ready to fulfill your requirement. I can finish this task in short time, with the best quality. I can assure 100% accuracy. Please give me the opportunity to do the work. With Kind Regards, Debdulal Roy Proshanta
$277 USD in 7 days
4.9 (62 reviews)
7.0
7.0
User Avatar
I've read your requirements thoroughly and I'm confident that I can finish the crawler in less than 4 days.
$210 USD in 4 days
4.9 (14 reviews)
3.7
3.7
User Avatar
A proposal has not yet been provided
$277 USD in 7 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of GERMANY
Lüneburg, Germany
5.0
113
Member since Jul 21, 2013

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.