Budget : $30
This project is very simple if You know wek scrapping and if you can handle big database...
***
Hello,
I'm looking for a freelancer specialized in web scraping.
I want to get the french websites of the DMOZ Database.
Here's the URL Address :
[login to view URL]
You can download their database from this url address :
[login to view URL]
You need to get 253 328 urls or more (only french websites)
I want you to give me a file in this format : (.txt file please)
- one url per line
- for each url, you need to get only the domain name (for example for this url address : [login to view URL] => you need to get : [login to view URL]) for subdomain, you need to get also the domain name for example [login to view URL] => [login to view URL]
- remove duplicated urls
Thank you
Best regards,
This project is very urgent. $30 and in one day please.