The Effects of Tabular-Based Content Extraction on Patent Document Clustering

Data can be represented in many different ways within a particular document or set of documents. Hence, attempts to automatically process the relationships between documents or determine the relevance of certain document objects can be problematic. In this study, we have developed software to automa...

Full description

Bibliographic Details
Main Authors: Michael W. Berry, Bruce E. Kiefer, Denise R. Koessler, Benjamin W. Martin
Format: Article
Language:English
Published: MDPI AG 2012-10-01
Series:Algorithms
Subjects:
Online Access:http://www.mdpi.com/1999-4893/5/4/490