Data Scraping - Web Crawler - open to bidding

Closed Posted Oct 2, 2013 Paid on delivery
Closed Paid on delivery

Crawler for site Eleclerc

Base site and store finder: [url removed, login to view]

Store example: [url removed, login to view]

Flyer example: [url removed, login to view]

The Crawler parse the market or the supermarket site to find all the stores and all the promotional flyers related to that shop. To do this it is necessary that each store is uniquely identified within the site, and all the information and all the promotions (flyers) associated to this store will be recognized and recorded in the json. It’s possible that a single store has more flyers and also a single flyer can be associated with more stores.

You have to create a single cli php (php 5.4 standard) script (one single file and, if necessary, a few free libraries) that will be started on our server every x hours(2-3 time a day).

This script must be able to:

- crawl the site and parse all the information necessary for the Json

- create the Json like specific below

- download pdf (flyer) or create it in case of jpg or flash

- download the pdf (flyer) locally in a custom configurable directory

- we need the possibility to start a few command shell for every downloaded pdf

- automatic erase the expired flyer (if it’s not available an expiration date 3 month after the first download)

PHP

Project ID: #4989564

About the project

3 proposals Remote project Active Nov 8, 2013

3 freelancers are bidding on average €239 for this job

YLsaPPXlwmDe

Hi we are freelance software developers. If you contact us, we can give a quote and we can discuss further details of the project. w w w . s o l v e r . i o

€155 EUR in 3 days
(0 Reviews)
0.0