GENERATION OF A SET OF KEY TERMS CHARACTERISING TEXT DOCUMENTS

The presented paper describes statistical methods (information gain, mutual X^2 statistics, and TF-IDF method) for key words generation from a text document collection. These key words should characterise the content of text documents and can be used to retrieve relevant documents from a document co...

Full description

Bibliographic Details
Main Authors: Kristina Machova, Andrea Szaboova, Peter Bednar
Format: Article
Language:English
Published: University of Zagreb, Faculty of organization and informatics 2007-06-01
Series:Journal of Information and Organizational Sciences
Subjects:
Online Access:http://jios.foi.hr/index.php/jios/article/view/33