Effective and Fast Near Duplicate Detection via Signature-Based Compression Metrics

Detecting near duplicates on the web is challenging due to its volume and variety. Most of the previous studies require the setting of input parameters, making it difficult for them to achieve robustness across various scenarios without careful tuning. Recently, a universal and parameter-free simila...

Full description

Bibliographic Details
Main Authors: Xi Zhang, Yuntao Yao, Yingsheng Ji, Binxing Fang
Format: Article
Language:English
Published: Hindawi Limited 2016-01-01
Series:Mathematical Problems in Engineering
Online Access:http://dx.doi.org/10.1155/2016/3919043