Mathematics (Apr 2023)

Multi-Document News Web Page Summarization Using Content Extraction and Lexical Chain Based Key Phrase Extraction

  • Chandrakala Arya,
  • Manoj Diwakar,
  • Prabhishek Singh,
  • Vijendra Singh,
  • Seifedine Kadry,
  • Jungeun Kim

DOI
https://doi.org/10.3390/math11081762
Journal volume & issue
Vol. 11, no. 8
p. 1762

Abstract

Read online

In the area of text summarization, there have been significant advances recently. In the meantime, the current trend in text summarization is focused more on news summarization. Therefore, developing a synthesis approach capable of extracting, comparing, and ranking sentences is vital to create a summary of various news articles in the context of erroneous online data. It is necessary, however, for the news summarization system to be able to deal with multi-document summaries due to content redundancy. This paper presents a method for summarizing multi-document news web pages based on similarity models and sentence ranking, where relevant sentences are extracted from the original article. English-language articles are collected from five news websites that cover the same topic and event. According to our experimental results, our approach provides better results than other recent methods for summarizing news.

Keywords