Crawl an entire website and convert to PDF or ODF

Виконано Опубліковано %project.relative_time Оплачується при отриманні
Виконано Оплачується при отриманні

Create a webform that accepts 2 required values (e-mail and website URL), 2 optional values that must be entered together (username and password), and an 'execute' process button.

For example, the e-mail addr provided is: [Posting contact details is Prohibited by [url removed, login to view] Admin] and the url is: http://stig.test.org.

Username: Password:

All email and URL values would have basic value validation checking. Username and PW fields should accept special characters.

Upon entering both values, user clicks 'execute' button

Also create web API that can accept the above (4) values.

Store email address

Crawl website URL ([url removed, login to view]) with no depth limit within the domain ([url removed, login to view])

Must also be able to enter pw protected areas with supplied username/pw credentials prompted by textbox or within url (http://username:[url removed, login to view])

Convert HTML, images, css, script (php/xml) into PDF or ODF(Open Document Format). In other words, generate a 'snapshot' of what a browser would display into a pdf/odf.

Combine all these pages into a single document.

Name file <server.domain.extension>-<mm-dd-yyyy>-<24hr:min:sec>.pdf

Upload (ftp) document onto supplied web server.

If work order entered through webform:

Generate retrieval URL

Send retrieval URL to stored email address [Posting contact details is Prohibited by [url removed, login to view] Admin] originally provided in step 1 with unique transaction number in subject line and body.

If work order entered through API:

Return document payload back over open http connection.

In case of timeout, fall back to email delivery described above.

Support:

We can provide server support but we prefer that you develop and test in your own environment and then provide instructions/support to deploy in our environment. Linux (Centos) OS Platform implementation is preferred.

Verification:

I will need 1 week to verify the completeness of the deliverable.

Example:

Please see attached example file.

Take note of source URL and timestamp at the bottom of each page.

If interested, please include example description of API call framework.

Apache Linux PHP Архітектура ПЗ Веб-дизайн

ID Проекту: #957356

Про проект

4 заявок(-ки) Дистанційний проект Остання активність Mar 4, 2011

Доручено:

ojno

I have several years experience in Linux/Unix and web development, mainly with Python, Java, C/C++, and PHP. My preferred framework is Django (Python based), but I learn quickly and would be willing to adapt to whateve Більше

$500 USD за 3 дні(-в)
(0 відгуків(-и))
3.4

4 фрілансерів(-и) готові виконати цю роботу у середньому за $663

aruhat

Hello Please see PM. Regards, Chandni

$750 USD за 5 дні(-в)
(12 відгуків(и))
6.0
mrt2410

Hello, Please check PM.

$700 USD за 5 дні(-в)
(8 відгуків(и))
3.6
jvetter

Please look PM.

$700 USD за 3 дні(-в)
(0 відгуків(и))
0.0