I have an access database of about 450,000 company & personal names, addresses, etc.
There are tons of duplicate records in the database, but they are not exactly duplicate.
Some records may have Joe Construction Co and another will be Joe Construction Co Inc, etc.
There may be times where there are 6-9 duplicate vendors each with slightly different information. What I am trying to is combine all the duplicate records into one single record. I have to make the single record contain the most data possible.
For example, in the database there are about 20 fields. Some duplicate vendors may have the companies ethnicity information in one record, the next duplicate record may have the company's vendor number, the next record may have a phone number that the previous two records did not have, etc. I want to combine all the data into one record with the most data.
Attached is a sample of the database I am working with. This one only contains about 4,000 names just to give you an idea of what I am up against. The solution needs to be able to be done on the complete database of 450,000 records eventually.
If you have any questions, please feel free to ask.
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 2) Installation package that will install the software (in ready-to-run condition) on the platform(s) specified in this bid request. 3) Complete ownership and distribution copyrights to all work purchased.