Blar i NTNU Open på forfatter "Leland, Robert"
-
Duplicate Detection with PMC -- A Parallel Approach to Pattern Matching
Leland, Robert (Master thesis, 2007)Fuzzy duplicate detection is an integral part of data cleansing. It consists of finding a set of duplicate records, correctly identifying the original or most representative record and removing the rest. The rate of Internet ...