An Introduction to Duplicate Detection
With the ever increasing volume of data, data quality problems abound. Multiple, yet different representations of the same real-world objects in data, duplicates, are one of the most intriguing data quality problems. The effects of such duplicates are detrimental; for instance, bank customers can obtain duplicate identities, inventory levels are monitored incorrectly, catalogs are mailed multiple times to the same household, etc. Automatically detecting duplicates is difficult: First, duplicate representations are usually not identical but slightly differ in their values. Second, in principle all pairs of records should be compared, which is…
Mehr
CHF 39.90
Preise inkl. MwSt. und Versandkosten (Portofrei ab CHF 40.00)
V103:
Folgt in ca. 5 Arbeitstagen
Produktdetails
Weitere Autoren: Herschel, Melanie
- ISBN: 978-1-60845-220-0
- EAN: 9781608452200
- Produktnummer: 6702278
- Verlag: Morgan & Claypool Publishers
- Sprache: Englisch
- Erscheinungsjahr: 2010
- Seitenangabe: 88 S.
- Masse: H23.5 cm x B19.1 cm x D0.5 cm 183 g
- Abbildungen: Paperback
- Gewicht: 183
10 weitere Werke von Felix Naumann:
Bewertungen
Anmelden