Crawler for site Eleclerc
Base site and store finder: [login to view URL]
Store example: [login to view URL]
Flyer example: [login to view URL]
The Crawler parse the market or the supermarket site to find all the stores and all the promotional flyers related to that shop. To do this it is necessary that each store is uniquely identified within the site, and all the information and all the promotions (flyers) associated to this store will be recognized and recorded in the json. It’s possible that a single store has more flyers and also a single flyer can be associated with more stores.
You have to create a single cli php (php 5.4 standard) script (one single file and, if necessary, a few free libraries) that will be started on our server every x hours(2-3 time a day).
This script must be able to:
- crawl the site and parse all the information necessary for the Json
- create the Json like specific below
- download pdf (flyer) or create it in case of jpg or flash
- download the pdf (flyer) locally in a custom configurable directory
- we need the possibility to start a few command shell for every downloaded pdf
- automatic erase the expired flyer (if it’s not available an expiration date 3 month after the first download)
Hi we are freelance software developers. If you contact us, we can give a quote and we can discuss further details of the project. w w w . s o l v e r . i o