Data analyzer + text similarity comparison

Виконано Опубліковано %project.relative_time Оплачується при отриманні
Виконано Оплачується при отриманні

Need to create software(script) that will be store, filter and analize data. The number of records in the database can be up to hundreds of thousands of records, you should use the most optimized algorithms and development technologies.(New data will be upload every day)

The process of working has next structure:

.CSV data file -> Database and text comparation analizer-> Processing macros and filters -> output in .TXT format

1 step) Loading data from a .CSV file with a fixed structure in the current database

2) After data uploaded, it should compare one field(text) for each record with another records in !ALL DATABASE for similarity

(!!! This is the most difficult part of this project, it need to compare two text(two records), and return similirity of it in percents)

Example of working you can find at:

[url removed, login to view]

[url removed, login to view] (There is russian language interface(can be translated in Google translate))

After current record was compared with all records in DB, it add info of MAXIMUM percent of similiry and ID of the record that is most similiar to.

So we saved this info for each record in db.

3) One record has next structure:

Field 1;Field 2;Field 3;...;Max percent of simility;ID of most similiar record

4)The ability to create flexible filters (macros) to sort the data (filters (macros) should be able to save)

Macro consists of several filters (fields has different types: date, text, numerical)..

For example

Macro =

(

Field 1 contains "John"

And

Field 4 is equal to "address" OR field 4 is equal to "Andy"

)

So macros has a complex structure with the logical relations between the filters inside AND \ OR

5) After processing the macro data that we received, export in .TXT file

!!!ALL ADDITIONAL INFO AND DATA SAMPLE WILL BE PROVIDED!!!

Big Data Sales Програмування на C# Delphi PHP Visual Basic

ID Проекту: #4030658

Про проект

12 заявок(-ки) Дистанційний проект Остання активність Dec 13, 2012

Доручено:

sveralex

Hello, here is my bid

$440 USD за 5 дні(-в)
(32 відгуків(-и))
6.2

12 фрілансерів(-и) готові виконати цю роботу у середньому за $584

AlosDeveloper

Hello, i have 11 years experince in Delphi. i am ready to start. Let's discuss your project more deeply in message board

$750 USD за 20 дні(-в)
(93 відгуків(и))
7.2
greggfletcher

Hello, professioanl C# programmer here. If you are interested in my bid, please contact me. Best Regards.

$1000 USD за 7 дні(-в)
(25 відгуків(и))
5.7
eugene2006

Hi. I interested in your project. Can discuss details.

$400 USD за 10 дні(-в)
(8 відгуків(и))
3.9
aegansys

c/c++/c# developer

$250 USD за 7 дні(-в)
(6 відгуків(и))
3.6
spyrosn

I have fully understood the requirements of your project and am ready to start ASAP. 10+ years of .NET experience guarantee swift project completion with quality results.

$420 USD за 5 дні(-в)
(1 відгук)
1.7
DeepSyaal

Having a team of Professionals. We Provide high quality work with accuracy.

$500 USD за 10 дні(-в)
(1 відгук)
2.2
boyet0911

Experienced programmer/developer here. Kindly check your pmb for my details. Thanks.

$500 USD за 20 дні(-в)
(1 відгук)
0.0
Eb2THqM14

We are freelance software developers. If you contact me I can give a quote for your project and we can discuss the details. www.<b><i>Removed by Admin</i></b>

$500 USD за 1 день
(0 відгуків(и))
0.0
rishijain83

Hi Alex, I have executed a similar project for a logistics company in which I had to match customer names and addresses. It required daily updation as in your case. Matching algorithm is certainly tricky, but I ha Більше

$500 USD за 30 дні(-в)
(0 відгуків(и))
0.0
pcman1ac

Hello. I'm interresting in this project. I have experience in analysing hudge amounts of data in SQL databases (hundreds of millions records) using Delphi.

$1000 USD за 30 дні(-в)
(0 відгуків(и))
0.0
ngcomp

Certified from CLoudera for Hadoop

$750 USD за 5 дні(-в)
(0 відгуків(и))
4.2