Applied Sciences (Nov 2021)

Improved Text Summarization of News Articles Using GA-HC and PSO-HC

  • Muhammad Mohsin,
  • Shazad Latif,
  • Muhammad Haneef,
  • Usman Tariq,
  • Muhammad Attique Khan,
  • Sefedine Kadry,
  • Hwan-Seung Yong,
  • Jung-In Choi

DOI
https://doi.org/10.3390/app112210511
Journal volume & issue
Vol. 11, no. 22
p. 10511

Abstract

Read online

Automatic Text Summarization (ATS) is gaining attention because a large volume of data is being generated at an exponential rate. Due to easy internet availability globally, a large amount of data is being generated from social networking websites, news websites and blog websites. Manual summarization is time consuming, and it is difficult to read and summarize a large amount of content. Automatic text summarization is the solution to deal with this problem. This study proposed two automatic text summarization models which are Genetic Algorithm with Hierarchical Clustering (GA-HC) and Particle Swarm Optimization with Hierarchical Clustering (PSO-HC). The proposed models use a word embedding model with Hierarchal Clustering Algorithm to group sentences conveying almost same meaning. Modified GA and adaptive PSO based sentence ranking models are proposed for text summary in news text documents. Simulations are conducted and compared with other understudied algorithms to evaluate the performance of proposed methodology. Simulations results validate the superior performance of the proposed methodology.

Keywords