Fine‐Grained Mobile Application Clustering Model Using Retrofitted Document Embedding

In this paper, we propose a fine‐grained mobile application clustering model using retrofitted document embedding. To automatically determine the clusters and their numbers with no predefined categories, the proposed model initializes the clusters based on title keywords and then merges similar clus...

Full description

Bibliographic Details
Main Authors: Yeo‐Chan Yoon, Junwoo Lee, So‐Young Park, Changki Lee
Format: Article
Language:English
Published: Electronics and Telecommunications Research Institute (ETRI) 2017-08-01
Series:ETRI Journal
Subjects:
Online Access:https://doi.org/10.4218/etrij.17.0116.0936
Description
Summary:In this paper, we propose a fine‐grained mobile application clustering model using retrofitted document embedding. To automatically determine the clusters and their numbers with no predefined categories, the proposed model initializes the clusters based on title keywords and then merges similar clusters. For improved clustering performance, the proposed model distinguishes between an accurate clustering step with titles and an expansive clustering step with descriptions. During the accurate clustering step, an automatically tagged set is constructed as a result. This set is utilized to learn a high‐performance document vector. During the expansive clustering step, more applications are then classified using this document vector. Experimental results showed that the purity of the proposed model increased by 0.19, and the entropy decreased by 1.18, compared with the K‐means algorithm. In addition, the mean average precision improved by more than 0.09 in a comparison with a support vector machine classifier.
ISSN:1225-6463
2233-7326