Data De-Duplication through Active Learning

Data de-duplication concerns the identification and eventual elimination of records, in a particular dataset, that refer to the same entity without necessarily having the same attribute values, nor the same identifying values. Machine Learning techniques have been used to handle data de-duplication....

Full description

Bibliographic Details
Main Author: Muhivuwomunda, Divine
Format: Others
Language:en
Published: University of Ottawa (Canada) 2013
Subjects:
Online Access:http://hdl.handle.net/10393/28859
http://dx.doi.org/10.20381/ruor-19478