Information Retrieval Document Classified with K-Nearest Neighbor

Along with the rapid advancement of technology development led to the amount of information available is also increasingly abundant. The aim of this study was to determine how the implementation of information retrieval system in the classification of the journal by using the cosine similarity and K...

Full description

Bibliographic Details
Main Authors:	Badruz Zaman, Endah Purwanti, Alifian Sukma
Format:	Article
Language:	Indonesian
Published:	Universitas Airlangga 2016-01-01
Series:	Record and Library Journal
Subjects:	information retrieval system, similarity, cosinus, k-nearest neighbor, document classification
Online Access:	http://e-journal.unair.ac.id/index.php/RLJ/article/view/1177

id	doaj-78de068c399748a2a248f72034037bc7
record_format	Article
spelling	doaj-78de068c399748a2a248f72034037bc72020-11-24T22:58:21ZindUniversitas AirlanggaRecord and Library Journal2442-51682016-01-011212913810.20473/rlj.v1i2.11771017Information Retrieval Document Classified with K-Nearest NeighborBadruz Zaman0Endah PurwantiAlifian SukmaFakultas Sains dan Teknologi Universitas AirlanggaAlong with the rapid advancement of technology development led to the amount of information available is also increasingly abundant. The aim of this study was to determine how the implementation of information retrieval system in the classification of the journal by using the cosine similarity and K-Nearest Neighbor (KNN). The data used as many as 160 documents with categories such as Physical Sciences and Engineering, Life Science, Health Science, and Social Sciences and Humanities. Construction stage begins with the use of text mining processing, the weighting of each token by using the term frequency-inverse document frequency (TF-IDF), calculate the degree of similarity of each document by using the cosine similarity and classification using k-Nearest Neighbor.Evaluation is done by using the testing documents as much as 20 documents, with a value of k = {37, 41, 43}. Evaluation system shows the level of success in classifying documents on the value of k = 43 with a value precision of 0501. System test results showed that 20 document testing used can be classified according to the actual category.http://e-journal.unair.ac.id/index.php/RLJ/article/view/1177information retrieval system, similarity, cosinus, k-nearest neighbor, document classification
collection	DOAJ
language	Indonesian
format	Article
sources	DOAJ
author	Badruz Zaman Endah Purwanti Alifian Sukma
spellingShingle	Badruz Zaman Endah Purwanti Alifian Sukma Information Retrieval Document Classified with K-Nearest Neighbor Record and Library Journal information retrieval system, similarity, cosinus, k-nearest neighbor, document classification
author_facet	Badruz Zaman Endah Purwanti Alifian Sukma
author_sort	Badruz Zaman
title	Information Retrieval Document Classified with K-Nearest Neighbor
title_short	Information Retrieval Document Classified with K-Nearest Neighbor
title_full	Information Retrieval Document Classified with K-Nearest Neighbor
title_fullStr	Information Retrieval Document Classified with K-Nearest Neighbor
title_full_unstemmed	Information Retrieval Document Classified with K-Nearest Neighbor
title_sort	information retrieval document classified with k-nearest neighbor
publisher	Universitas Airlangga
series	Record and Library Journal
issn	2442-5168
publishDate	2016-01-01
description	Along with the rapid advancement of technology development led to the amount of information available is also increasingly abundant. The aim of this study was to determine how the implementation of information retrieval system in the classification of the journal by using the cosine similarity and K-Nearest Neighbor (KNN). The data used as many as 160 documents with categories such as Physical Sciences and Engineering, Life Science, Health Science, and Social Sciences and Humanities. Construction stage begins with the use of text mining processing, the weighting of each token by using the term frequency-inverse document frequency (TF-IDF), calculate the degree of similarity of each document by using the cosine similarity and classification using k-Nearest Neighbor.Evaluation is done by using the testing documents as much as 20 documents, with a value of k = {37, 41, 43}. Evaluation system shows the level of success in classifying documents on the value of k = 43 with a value precision of 0501. System test results showed that 20 document testing used can be classified according to the actual category.
topic	information retrieval system, similarity, cosinus, k-nearest neighbor, document classification
url	http://e-journal.unair.ac.id/index.php/RLJ/article/view/1177
work_keys_str_mv	AT badruzzaman informationretrievaldocumentclassifiedwithknearestneighbor AT endahpurwanti informationretrievaldocumentclassifiedwithknearestneighbor AT alifiansukma informationretrievaldocumentclassifiedwithknearestneighbor
_version_	1725647425537835008

Information Retrieval Document Classified with K-Nearest Neighbor

Similar Items