INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS

Information retrieval of text document requires a method that is able to restore a number of documents that have high relevance according to the user's request. One important step in the process is a text representation of the weighting process. The use of LCS in Tf-Idf weighting adjustments co...

Full description

Bibliographic Details
Main Authors: Munjiah Nur Saadah, Rigga Widar Atmagi, Dyah S. Rahayu, Agus Zainal Arifin
Format: Article
Language:English
Published: Universitas Indonesia 2013-10-01
Series:Jurnal Ilmu Komputer dan Informasi
Online Access:http://jiki.cs.ui.ac.id/index.php/jiki/article/view/216
id doaj-85d931df9e324016944425bc3112f011
record_format Article
spelling doaj-85d931df9e324016944425bc3112f0112020-11-24T23:07:38ZengUniversitas IndonesiaJurnal Ilmu Komputer dan Informasi2088-70512502-92742013-10-0161343710.21609/jiki.v6i1.216170INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCSMunjiah Nur SaadahRigga Widar AtmagiDyah S. RahayuAgus Zainal ArifinInformation retrieval of text document requires a method that is able to restore a number of documents that have high relevance according to the user's request. One important step in the process is a text representation of the weighting process. The use of LCS in Tf-Idf weighting adjustments considers the appearance of the same order of words between the query and the text in the document. There is a very long document but irrelevant cause weight produced is not able to represent the value relevance of documents. This research proposes the use of LCS which gives weight to the word order by considering long documents related to the average length of documents in the corpus. This method is able to return a text document effectively. Additional features of word order by normalizing the ratio of the overall length of the document to the documents in the corpus generate values of precision and recall as well as the method of Tasi et al.http://jiki.cs.ui.ac.id/index.php/jiki/article/view/216
collection DOAJ
language English
format Article
sources DOAJ
author Munjiah Nur Saadah
Rigga Widar Atmagi
Dyah S. Rahayu
Agus Zainal Arifin
spellingShingle Munjiah Nur Saadah
Rigga Widar Atmagi
Dyah S. Rahayu
Agus Zainal Arifin
INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS
Jurnal Ilmu Komputer dan Informasi
author_facet Munjiah Nur Saadah
Rigga Widar Atmagi
Dyah S. Rahayu
Agus Zainal Arifin
author_sort Munjiah Nur Saadah
title INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS
title_short INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS
title_full INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS
title_fullStr INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS
title_full_unstemmed INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS
title_sort information retrieval of text document with weighting tf-idf and lcs
publisher Universitas Indonesia
series Jurnal Ilmu Komputer dan Informasi
issn 2088-7051
2502-9274
publishDate 2013-10-01
description Information retrieval of text document requires a method that is able to restore a number of documents that have high relevance according to the user's request. One important step in the process is a text representation of the weighting process. The use of LCS in Tf-Idf weighting adjustments considers the appearance of the same order of words between the query and the text in the document. There is a very long document but irrelevant cause weight produced is not able to represent the value relevance of documents. This research proposes the use of LCS which gives weight to the word order by considering long documents related to the average length of documents in the corpus. This method is able to return a text document effectively. Additional features of word order by normalizing the ratio of the overall length of the document to the documents in the corpus generate values of precision and recall as well as the method of Tasi et al.
url http://jiki.cs.ui.ac.id/index.php/jiki/article/view/216
work_keys_str_mv AT munjiahnursaadah informationretrievaloftextdocumentwithweightingtfidfandlcs
AT riggawidaratmagi informationretrievaloftextdocumentwithweightingtfidfandlcs
AT dyahsrahayu informationretrievaloftextdocumentwithweightingtfidfandlcs
AT aguszainalarifin informationretrievaloftextdocumentwithweightingtfidfandlcs
_version_ 1725617826969944064