Shortcut Learning Explanations for Deep Natural Language Processing: A Survey on Dataset Biases

Varun Dogra; Sahil Verma; Kavita; Marcin Wozniak; Jana Shafi; Muhammad Fazal Ijaz

doi:10.1109/ACCESS.2024.3360306

IEEE Access (Jan 2024)

Shortcut Learning Explanations for Deep Natural Language Processing: A Survey on Dataset Biases

Varun Dogra,
Sahil Verma,
Kavita,
Marcin Wozniak,
Jana Shafi,
Muhammad Fazal Ijaz

Affiliations

Varun Dogra: ORCiD; School of Computer Science and Engineering, Lovely Professional University, Phagwara, Punjab, India
Sahil Verma: ORCiD; Uttaranchal University, Dehradun, India
Kavita: ORCiD; Uttaranchal University, Dehradun, India
Marcin Wozniak: ORCiD; Faculty of Applied Mathematics, Silesian University of Technology, Gliwice, Poland
Jana Shafi: ORCiD; Department of Computer Engineering and Information, College of Engineering in Wadi Alddawasir, Prince Sattam Bin Abdulaziz University, Wadi Alddawasir, Saudi Arabia
Muhammad Fazal Ijaz: ORCiD; School of IT and Engineering, Melbourne Institute of Technology, Melbourne, VIC, Australia

DOI: https://doi.org/10.1109/ACCESS.2024.3360306
Journal volume & issue: Vol. 12
pp. 26183 – 26195

Abstract

Read online

The introduction of pre-trained large language models (LLMs) has transformed NLP by fine-tuning task-specific datasets, enabling notable advancements in news classification, language translation, and sentiment analysis. This has revolutionized the field, driving remarkable breakthroughs and progress. However, the growing recognition of bias in textual data has emerged as a critical focus in the NLP community, revealing the inherent limitations of models trained on specific datasets. LLMs exploit these dataset biases and artifacts as expedient shortcuts for prediction. The reliance of LLMs on dataset bias and artifacts as shortcuts for prediction has hindered their generalizability and adversarial robustness. Addressing this issue is crucial to enhance the reliability and resilience of LLMs in various contexts. This survey provides a comprehensive overview of the rapidly growing body of research on shortcut learning in language models, classifying the research into four main areas: the factors of shortcut learning, the origin of bias, the detection methods of dataset biases, and understanding mitigation strategies to address data biases. The goal of this study is to offer a contextualized, in-depth look at the state of learning models, highlighting the major areas of attention and suggesting possible directions for further research.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords