Short Text Document Clustering using Distributed Word Representation and Document Distance
This paper presents a method for clustering short text documents, such as instant messages, SMS, or news headlines. Vocabularies in the texts are expanded using external knowledge sources and represented by a Distributed Word Representation. Clustering is done using the K-means algorithm with Word...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Walailak University
2018-03-01
|
Series: | Walailak Journal of Science and Technology |
Subjects: | |
Online Access: | http://wjst.wu.ac.th/index.php/wjst/article/view/4133 |