Who Needs External References?—Text Summarization Evaluation Using Original Documents

Abdullah Al Foysal; Ronald Böck

doi:10.3390/ai4040049

AI (Nov 2023)

Who Needs External References?—Text Summarization Evaluation Using Original Documents

Abdullah Al Foysal,
Ronald Böck

Affiliations

Abdullah Al Foysal: Research Division, Genie Enterprise, Donnersbergweg 1, 67059 Ludwigshafen, Germany
Ronald Böck: Research Division, Genie Enterprise, Donnersbergweg 1, 67059 Ludwigshafen, Germany

DOI: https://doi.org/10.3390/ai4040049
Journal volume & issue: Vol. 4, no. 4
pp. 970 – 995

Abstract

Read online

Nowadays, individuals can be overwhelmed by a huge number of documents being present in daily life. Capturing the necessary details is often a challenge. Therefore, it is rather important to summarize documents to obtain the main information quickly. There currently exist automatic approaches to this task, but their quality is often not properly assessed. State-of-the-art metrics rely on human-generated summaries as a reference for the evaluation. If no reference is given, the assessment will be challenging. Therefore, in the absence of human-generated reference summaries, we investigated an alternative approach to how machine-generated summaries can be evaluated. For this, we focus on the original text or document to retrieve a metric that allows a direct evaluation of automatically generated summaries. This approach is particularly helpful in cases where it is difficult or costly to find reference summaries. In this paper, we present a novel metric called Summary Score without Reference—SUSWIR—which is based on four factors already known in the text summarization community: Semantic Similarity, Redundancy, Relevance, and Bias Avoidance Analysis, overcoming drawbacks of common metrics. Therefore, we aim to close a gap in the current evaluation environment for machine-generated text summaries. The novel metric is introduced theoretically and tested on five datasets from their respective domains. The conducted experiments yielded noteworthy outcomes, employing the utilization of SUSWIR.

Published in AI

ISSN: 2673-2688 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.mdpi.com/journal/ai

About the journal

Abstract

Keywords