Deep autoencoder-based fuzzy c-means for topic detection

Hendri Murfi; Natasha Rosaline; Nora Hariadi

Array (Mar 2022)

Deep autoencoder-based fuzzy c-means for topic detection

Hendri Murfi,
Natasha Rosaline,
Nora Hariadi

Affiliations

Hendri Murfi: Corresponding author.; Department of Mathematics, Universitas Indonesia, Depok, 16424, Indonesia
Natasha Rosaline: Department of Mathematics, Universitas Indonesia, Depok, 16424, Indonesia
Nora Hariadi: Department of Mathematics, Universitas Indonesia, Depok, 16424, Indonesia

Journal volume & issue: Vol. 13
p. 100124

Abstract

Read online

Topic detection is a process for determining topics from a collection of textual data. One of the topic detection methods is clustering based, which assumes that the centroids are topics. The clustering method has the advantage that it can process data with negative representations. Therefore, the clustering method allows a combination with a broader-representation learning method. In this paper, we adopt deep learning for topic detection by using a deep autoencoder and fuzzy c-means called “deep autoencoder-based fuzzy c-means”. The encoder of the autoencoder performs a lower-dimensional representation learning. Fuzzy c-means groups the lower-dimensional representation to identify the centroids. The autoencoder's decoder transforms the centroids back into the original representation to be interpreted as the topics. Our simulation shows that deep autoencoder-based fuzzy c-means improves the coherence score of eigenspace-based fuzzy c-means and is comparable to the leading standard methods, i.e., nonnegative matrix factorization or latent Dirichlet allocation.

Published in Array

ISSN: 2590-0056 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.journals.elsevier.com/array

About the journal

Abstract

Keywords