Find Jobs
Hire Freelancers

DMOZ Extraction Database

$30-250 USD

In Progress
Posted over 12 years ago

$30-250 USD

Paid on delivery
Budget : $30 This project is very simple if You know wek scrapping and if you can handle big database... *** Hello, I'm looking for a freelancer specialized in web scraping. I want to get the french websites of the DMOZ Database. Here's the URL Address : [login to view URL] You can download their database from this url address : [login to view URL] You need to get 253 328 urls or more (only french websites) I want you to give me a file in this format : (.txt file please) - one url per line - for each url, you need to get only the domain name (for example for this url address : [login to view URL] => you need to get : [login to view URL]) for subdomain, you need to get also the domain name for example [login to view URL] => [login to view URL] - remove duplicated urls Thank you Best regards, This project is very urgent. $30 and in one day please.
Project ID: 1254411

About the project

2 proposals
Remote project
Active 13 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Hello, it can be done fast and efficiently. Regards.
$30 USD in 0 day
5.0 (56 reviews)
6.3
6.3
2 freelancers are bidding on average $30 USD for this job
User Avatar
ready for start
$30 USD in 0 day
5.0 (2 reviews)
2.0
2.0

About the client

Flag of FRANCE
Montpellier, France
4.9
5
Payment method verified
Member since Dec 12, 2010

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.