Short Text Document Clustering using Distributed Word Representation and Document Distance

This paper presents a method for clustering short text documents, such as instant messages, SMS, or news headlines. Vocabularies in the texts are expanded using external knowledge sources and represented by a Distributed Word Representation. Clustering is done using the K-means algorithm with Word...

Full description

Bibliographic Details
Main Authors: Supavit KONGWUDHIKUNAKORN, Kitsana WAIYAMAI
Format: Article
Language:English
Published: Walailak University 2018-03-01
Series:Walailak Journal of Science and Technology
Subjects:
Online Access:http://wjst.wu.ac.th/index.php/wjst/article/view/4133