I need a script to scrape product information and images from 3 to 4 selected website.
Looking for something I can run on a daily bases on the back ground on a XP PC that has MYSQL
It should have a list of proxy ip which can be edited
Once the information is collected would like it saved in a database maybe MYSQL so i can export the new products at a date in the future
Extracted data and allow it be manipulated to a file,
Saved data into a database for future scrapes so it will not have duplicated products scrapes
Keep records of products all ready added and already on my website so no duplicate products is added
When assigning Category information use my category code vs the website it scraped
For each product I need:
Category Assigned
Product name
Main product image small (all images should be downloaded)
Main product image large (all images should be downloaded)
Description
Weight
Product Price
Default Export Format will; (tab delimited)
PRODUCT_CODE
PRODUCT_NAME
CATEGORY_CODES
PRODUCT_PRICE
PRODUCT_COST
PRODUCT_WEIGHT
PRODUCT_DESC
PRODUCT_TAXABLE
PRODUCT_ACTIVE
PRODUCT_THUMBNAIL
PRODUCT_IMAGE