Short Text Document Clustering using Distributed Word Representation and Document Distance
This paper presents a method for clustering short text documents, such as instant messages, SMS, or news headlines. Vocabularies in the texts are expanded using external knowledge sources and represented by a Distributed Word Representation. Clustering is done using the K-means algorithm with Word...
Main Authors: | Supavit KONGWUDHIKUNAKORN, Kitsana WAIYAMAI |
---|---|
Format: | Article |
Language: | English |
Published: |
Walailak University
2018-03-01
|
Series: | Walailak Journal of Science and Technology |
Subjects: | |
Online Access: | http://wjst.wu.ac.th/index.php/wjst/article/view/4133 |
Similar Items
-
A Nested Chinese Restaurant Topic Model for Short Texts with Document Embeddings
by: Yue Niu, et al.
Published: (2021-09-01) -
Document Nature as a Text Feature (Exemplified by PR Texts)
by: Ekaterina Sergeevna Buslaeva
Published: (2016-04-01) -
Topic Modeling for Short Texts via Word Embedding and Document Correlation
by: Feng Yi, et al.
Published: (2020-01-01) -
A Semantic Graph Model for Text Representation and Matching in Document Mining
by: Shaban, Khaled
Published: (2007) -
A Semantic Graph Model for Text Representation and Matching in Document Mining
by: Shaban, Khaled
Published: (2007)