The Performance of Boolean Retrieval and Vector Space Model in Textual Information Retrieval

Boolean Retrieval (BR) and Vector Space Model (VSM) are very popular methods in information retrieval for creating an inverted index and querying terms. BR method searches the exact results of the textual information retrieval without ranking the results. VSM method searches and ranks the results. T...

Full description

Bibliographic Details
Main Authors: Budi Yulianto, Widodo Budiharto, Iman Herwidiana Kartowisastro
Format: Article
Language:English
Published: Bina Nusantara University 2017-08-01
Series:CommIT Journal
Subjects:
Online Access:https://journal.binus.ac.id/index.php/commit/article/view/2108
Description
Summary:Boolean Retrieval (BR) and Vector Space Model (VSM) are very popular methods in information retrieval for creating an inverted index and querying terms. BR method searches the exact results of the textual information retrieval without ranking the results. VSM method searches and ranks the results. This study empirically compares the two methods. The research utilizes a sample of the corpus data obtained from Reuters. The experimental results show that the required times to produce an inverted index by the two methods are nearly the same. However, a difference exists on the querying index. The results also show that the number of generated indexes, the sizes of the generated files, and the duration of reading and searching an index are proportional with the file number in the corpus and the file size.
ISSN:1979-2484
2460-7010