IEEE Access (Jan 2023)

Detecting Favorite Topics in Computing Scientific Literature via Dynamic Topic Modeling

  • Rosa Virginia Encinas Quille,
  • Jose Melendez Barros,
  • Marcio Barbado Junior,
  • Felipe Valencia De Almeida,
  • Pedro Luiz Pizzigatti Correa

DOI
https://doi.org/10.1109/ACCESS.2023.3269660
Journal volume & issue
Vol. 11
pp. 41535 – 41545

Abstract

Read online

Topic modeling comprises a set of machine learning algorithms that allow topics to be extracted from a collection of documents. These algorithms have been widely used in many areas, such as identifying dominant topics in scientific research. However, works addressing such problems focus on identifying static topics, providing snapshots that cannot show how those topics evolve. Aiming to close this gap, in this article, we describe an approach for dynamic article set analysis and classification. This is accomplished by querying open data of notable scientific databases via representational state transfers. After that, we enforce data management practices with a dynamic topic modeling approach on the associated metadata available. As a result, we identify research trends for a given field at specific instants and the referred terminology trends evolution throughout the years. It was possible to detect the associated lexical variation over time in published content, ultimately determining the so-called “hot topics” in arbitrary instants and how they correlate.

Keywords