Axioms (Aug 2022)

The Performance of Topic Evolution Based on a Feature Maximization Measurement for the Linguistics Domain

  • Junchao Feng,
  • Jianjun Miao,
  • Yue Tang,
  • Yuechen Li,
  • Jundong Feng

DOI
https://doi.org/10.3390/axioms11080412
Journal volume & issue
Vol. 11, no. 8
p. 412

Abstract

Read online

Understanding the performance of the data mining approach and topic evolution in a certain scientific domain is imperative to capturing key domain developments and facilitating knowledge transfer within and across domains. Our research selects linguistics as an exploratory domain and exploits the feature maximization (FM) measurement for feature selection, combined with the contrast ratio to conduct the diachronic analysis for the linguistics domain’s topics. To accurately mine the linguistics domain’s topics and obtain the optimal clustering model selection, we exploit an integrated method associated with the deep embedding for clustering (DEC) algorithm based on the keywords-based Text Representation Matrix (KTRM) and Lamirel’s EC index and test the performance of this method. The results show that the FM measurement is applicable in the linguistics domain for topic mining, and the combinatory method has the advantage of an unbiased clustering optimization model and applies to the design of non-parameter clustering and algorithms from the low dimension to the high dimension of datasets. The findings suggest that this approach could be suitable for a diachronic analysis of topic evolution and facilitate the performance of topic detection. In addition, these findings of text detection can rise to knowledge fusion cognition with the factor of language as an available research objective in interdisciplinary research.

Keywords