Data mining on Craigslist(repost)

Виконано Опубліковано %project.relative_time Оплачується при отриманні
Виконано Оплачується при отриманні

I would like to hire an experienced programmer to create a program that can do data mining on Craigslist.org. The project involves searching for a specific keyword throughout the entire US Craigslist site and extracting certain pieces of information for output into MS Excel.

?

The program deliverable should be in ready-for-use condition. The program and program code should be in the buyer’s name and will legally belong to the buyer of the code. Payment will be delivered after the program is tested. Program must be delivered to buyer by January 30, 2009.

## Deliverables

I would like to hire an experienced programmer to create a program that can do data mining on Craigslist.org. The project involves searching for a specific keyword throughout the entire US Craigslist site and extracting certain pieces of information for output into MS Excel.

?

The program deliverable should be in ready-for-use condition. The program and program code should be in the buyer’s name and will legally belong to the buyer of the code. Payment will be delivered after the program is tested.

?

This is what the program should do:

?

The program should search “gift card?? in every forum of Craigslist in the United States. In other words, the program will search for gift cards in every region of every US state ??" for example, within the state of Nebraska, the search should take place in Grand Island, Lincoln, and Omaha/Council Bluffs, but NOT Sioux City (Iowa), because that would be in the Iowa version.

?

The program should post output into an Excel spreadsheet, with the following variables as columns (I follow variables with the arrow for explanatory purposes):

- Value of the gift card ? How much money is the card worth?

- Price of the card ? How much is the seller willing to accept to part with the gift card (frequently lower than the card value)

- The unique sale ID ? the e-mail link to the right of “reply to??

- Trade ? this variable = 1 if “trade?? or “barter?? is anywhere in the text of the post, 0 otherwise

- Store/vendor ? Gap, Target, Wal-Mart, Banana Republic or other vendors

- The state ? New York, New Jersey, California, etc.

- The region ? Albany, New York City, Hudson Valley, etc.

- Location, if applicable

- Phone number, if applicable

- Picture ? Code 1 if there is a picture in the post, 0 otherwise.

- Date of post

- Time of post

- URL of post ? This is so I can go back to track observations

- If possible, an indication about whether the card expires or not. And if it DOES expire, the date of expiration (as a separate variable)

?

I do NOT want the entire body of the post. Each variable should output in a separate column in MS Excel (no later than 2003 version), with each observation occupying a new row. The program, when run, should update the observations without erasing or duplicating already-recorded data. Because posts expire after one month (or until removed), it is important that I can run this program on a daily basis without recording the same observations over and over for an entire month. Therefore, if, in two hours, there are only 50 new applicable posts, then if I run before and then two hours later, 50 additional rows in Excel will appear with information.

?

I am not particular about the face of the program. However, it should be relatively easy to use. Ideally, it would consist of a button to load the data mining system which outputs into an Excel spreadsheet. Somewhere in the code AND on the face of the program it should say “Property of A E Greenberg??

?

Please contact through the site for any questions.? The deadline for the deliverables (ready-for-use program, program code) will be no more than 4 days after the close of bidding.

Apple Safari Введення даних Техніка Google Chrome Microsoft MySQL PHP Управління проектом Архітектура ПЗ Тестування ПЗ Робочій стіл Windows

ID Проекту: #3562972

Про проект

7 заявок(-ки) Дистанційний проект Остання активність Jan 24, 2009

Доручено:

asadapt84

See private message.

$68 USD за 14 дні(-в)
(106 відгуків(-и))
5.8

7 фрілансерів(-и) готові виконати цю роботу у середньому за $827

gagikb6321

See private message.

$4250 USD за 14 дні(-в)
(5 відгуків(и))
5.9
dizyn

See private message.

$476 USD за 14 дні(-в)
(37 відгуків(и))
5.5
se13311

See private message.

$467.5 USD за 14 дні(-в)
(34 відгуків(и))
5.2
hassannasirvw

See private message.

$425 USD за 14 дні(-в)
(21 відгуків(и))
4.5
Anbusivam

See private message.

$51 USD за 14 дні(-в)
(13 відгуків(и))
3.3
openwareltd

See private message.

$51 USD за 14 дні(-в)
(3 відгуків(и))
0.5