ETRI Journal (Aug 2017)

Fine‐Grained Mobile Application Clustering Model Using Retrofitted Document Embedding

  • Yeo‐Chan Yoon,
  • Junwoo Lee,
  • So‐Young Park,
  • Changki Lee

DOI
https://doi.org/10.4218/etrij.17.0116.0936
Journal volume & issue
Vol. 39, no. 4
pp. 443 – 454

Abstract

Read online

In this paper, we propose a fine‐grained mobile application clustering model using retrofitted document embedding. To automatically determine the clusters and their numbers with no predefined categories, the proposed model initializes the clusters based on title keywords and then merges similar clusters. For improved clustering performance, the proposed model distinguishes between an accurate clustering step with titles and an expansive clustering step with descriptions. During the accurate clustering step, an automatically tagged set is constructed as a result. This set is utilized to learn a high‐performance document vector. During the expansive clustering step, more applications are then classified using this document vector. Experimental results showed that the purity of the proposed model increased by 0.19, and the entropy decreased by 1.18, compared with the K‐means algorithm. In addition, the mean average precision improved by more than 0.09 in a comparison with a support vector machine classifier.

Keywords