GENERATION OF A SET OF KEY TERMS CHARACTERISING TEXT DOCUMENTS
The presented paper describes statistical methods (information gain, mutual X^2 statistics, and TF-IDF method) for key words generation from a text document collection. These key words should characterise the content of text documents and can be used to retrieve relevant documents from a document co...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
University of Zagreb, Faculty of organization and informatics
2007-06-01
|
Series: | Journal of Information and Organizational Sciences |
Subjects: | |
Online Access: | http://jios.foi.hr/index.php/jios/article/view/33 |