Journal of Big Data (Oct 2023)

EXABSUM: a new text summarization approach for generating extractive and abstractive summaries

  • Zakariae Alami Merrouni,
  • Bouchra Frikh,
  • Brahim Ouhbi

DOI
https://doi.org/10.1186/s40537-023-00836-y
Journal volume & issue
Vol. 10, no. 1
pp. 1 – 34

Abstract

Read online

Abstract Due to the exponential growth of online information, the ability to efficiently extract the most informative content and target specific information without extensive reading is becoming increasingly valuable to readers. In this paper, we present 'EXABSUM,' a novel approach to Automatic Text Summarization (ATS), capable of generating the two primary types of summaries: extractive and abstractive. We propose two distinct approaches: (1) an extractive technique (EXABSUMExtractive), which integrates statistical and semantic scoring methods to select and extract relevant, non-repetitive sentences from a text unit, and (2) an abstractive technique (EXABSUMAbstractive), which employs a word graph approach (including compression and fusion stages) and re-ranking based on keyphrases to generate abstractive summaries using the source document as an input. In the evaluation conducted on multi-domain benchmarks, EXABSUM outperformed extractive summarization methods and demonstrated competitiveness against abstractive baselines.

Keywords