INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS
Information retrieval of text document requires a method that is able to restore a number of documents that have high relevance according to the user's request. One important step in the process is a text representation of the weighting process. The use of LCS in Tf-Idf weighting adjustments co...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Universitas Indonesia
2013-10-01
|
Series: | Jurnal Ilmu Komputer dan Informasi |
Online Access: | http://jiki.cs.ui.ac.id/index.php/jiki/article/view/216 |
id |
doaj-85d931df9e324016944425bc3112f011 |
---|---|
record_format |
Article |
spelling |
doaj-85d931df9e324016944425bc3112f0112020-11-24T23:07:38ZengUniversitas IndonesiaJurnal Ilmu Komputer dan Informasi2088-70512502-92742013-10-0161343710.21609/jiki.v6i1.216170INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCSMunjiah Nur SaadahRigga Widar AtmagiDyah S. RahayuAgus Zainal ArifinInformation retrieval of text document requires a method that is able to restore a number of documents that have high relevance according to the user's request. One important step in the process is a text representation of the weighting process. The use of LCS in Tf-Idf weighting adjustments considers the appearance of the same order of words between the query and the text in the document. There is a very long document but irrelevant cause weight produced is not able to represent the value relevance of documents. This research proposes the use of LCS which gives weight to the word order by considering long documents related to the average length of documents in the corpus. This method is able to return a text document effectively. Additional features of word order by normalizing the ratio of the overall length of the document to the documents in the corpus generate values of precision and recall as well as the method of Tasi et al.http://jiki.cs.ui.ac.id/index.php/jiki/article/view/216 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Munjiah Nur Saadah Rigga Widar Atmagi Dyah S. Rahayu Agus Zainal Arifin |
spellingShingle |
Munjiah Nur Saadah Rigga Widar Atmagi Dyah S. Rahayu Agus Zainal Arifin INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS Jurnal Ilmu Komputer dan Informasi |
author_facet |
Munjiah Nur Saadah Rigga Widar Atmagi Dyah S. Rahayu Agus Zainal Arifin |
author_sort |
Munjiah Nur Saadah |
title |
INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS |
title_short |
INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS |
title_full |
INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS |
title_fullStr |
INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS |
title_full_unstemmed |
INFORMATION RETRIEVAL OF TEXT DOCUMENT WITH WEIGHTING TF-IDF AND LCS |
title_sort |
information retrieval of text document with weighting tf-idf and lcs |
publisher |
Universitas Indonesia |
series |
Jurnal Ilmu Komputer dan Informasi |
issn |
2088-7051 2502-9274 |
publishDate |
2013-10-01 |
description |
Information retrieval of text document requires a method that is able to restore a number of documents that have high relevance according to the user's request. One important step in the process is a text representation of the weighting process. The use of LCS in Tf-Idf weighting adjustments considers the appearance of the same order of words between the query and the text in the document. There is a very long document but irrelevant cause weight produced is not able to represent the value relevance of documents. This research proposes the use of LCS which gives weight to the word order by considering long documents related to the average length of documents in the corpus. This method is able to return a text document effectively. Additional features of word order by normalizing the ratio of the overall length of the document to the documents in the corpus generate values of precision and recall as well as the method of Tasi et al. |
url |
http://jiki.cs.ui.ac.id/index.php/jiki/article/view/216 |
work_keys_str_mv |
AT munjiahnursaadah informationretrievaloftextdocumentwithweightingtfidfandlcs AT riggawidaratmagi informationretrievaloftextdocumentwithweightingtfidfandlcs AT dyahsrahayu informationretrievaloftextdocumentwithweightingtfidfandlcs AT aguszainalarifin informationretrievaloftextdocumentwithweightingtfidfandlcs |
_version_ |
1725617826969944064 |