PeerJ Computer Science (Apr 2024)

Research on the prediction of English topic richness in the context of multimedia data

  • Jie Jiao,
  • Hanan Aljuaid

DOI
https://doi.org/10.7717/peerj-cs.1967
Journal volume & issue
Vol. 10
p. e1967

Abstract

Read online Read online

With the evolution of the Internet and multimedia technologies, delving deep into multimedia data for predicting topic richness holds significant practical implications in public opinion monitoring and data discourse power competition. This study introduces an algorithm for predicting English topic richness based on the Transformer model, applied specifically to the Twitter platform. Initially, relevant data is organized and extracted following an analysis of Twitter’s characteristics. Subsequently, a feature fusion approach is employed to mine, extract, and construct features from Twitter blogs and users, encompassing blog features, topic features, and user features, which are amalgamated into multimodal features. Lastly, the combined features undergo training and learning using the Transformer model. Through experimentation on the Twitter topic richness dataset, our algorithm achieves an accuracy of 82.3%, affirming the efficacy and superior performance of the proposed approach.

Keywords