Exploring the Cognitive Neural Basis of Factuality in Abstractive Text Summarization Models: Interpretable Insights from EEG Signals

Zhejun Zhang; Yingqi Zhu; Yubo Zheng; Yingying Luo; Hengyi Shao; Shaoting Guo; Liang Dong; Lin Zhang; Lei Li

doi:10.3390/app14020875

Applied Sciences (Jan 2024)

Exploring the Cognitive Neural Basis of Factuality in Abstractive Text Summarization Models: Interpretable Insights from EEG Signals

Zhejun Zhang,
Yingqi Zhu,
Yubo Zheng,
Yingying Luo,
Hengyi Shao,
Shaoting Guo,
Liang Dong,
Lin Zhang,
Lei Li

Affiliations

Zhejun Zhang: School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Yingqi Zhu: School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Yubo Zheng: School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Yingying Luo: School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Hengyi Shao: School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Shaoting Guo: School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Liang Dong: School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Lin Zhang: School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
Lei Li: School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China

DOI: https://doi.org/10.3390/app14020875
Journal volume & issue: Vol. 14, no. 2
p. 875

Abstract

Read online

(1) Background: Information overload challenges decision-making in the Industry 4.0 era. While Natural Language Processing (NLP), especially Automatic Text Summarization (ATS), offers solutions, issues with factual accuracy persist. This research bridges cognitive neuroscience and NLP, aiming to improve model interpretability. (2) Methods: This research examined four fact extraction techniques: dependency relation, named entity recognition, part-of-speech tagging, and TF-IDF, in order to explore their correlation with human EEG signals. Representational Similarity Analysis (RSA) was applied to gauge the relationship between language models and brain activity. (3) Results: Named entity recognition showed the highest sensitivity to EEG signals, marking the most significant differentiation between factual and non-factual words with a score of −0.99. The dependency relation followed with −0.90, while part-of-speech tagging and TF-IDF resulted in 0.07 and −0.52, respectively. Deep language models such as GloVe, BERT, and GPT-2 exhibited noticeable influences on RSA scores, highlighting the nuanced interplay between brain activity and these models. (4) Conclusions: Our findings emphasize the crucial role of named entity recognition and dependency relations in fact extraction and demonstrate the independent effects of different models and TOIs on RSA scores. These insights aim to refine algorithms to reflect human text processing better, thereby enhancing ATS models’ factual integrity.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords