Explaining Misinformation Detection Using Large Language Models

Vishnu S. Pendyala; Christopher E. Hall

doi:10.3390/electronics13091673

Electronics (Apr 2024)

Explaining Misinformation Detection Using Large Language Models

Vishnu S. Pendyala,
Christopher E. Hall

Affiliations

Vishnu S. Pendyala: Department of Applied Data Science, San Jose State University, San Jose, CA 95192, USA
Christopher E. Hall: Department of Computer Science, San Jose State University, San Jose, CA 95192, USA

DOI: https://doi.org/10.3390/electronics13091673
Journal volume & issue: Vol. 13, no. 9
p. 1673

Abstract

Read online

Large language models (LLMs) are a compressed repository of a vast corpus of valuable information on which they are trained. Therefore, this work hypothesizes that LLMs such as Llama, Orca, Falcon, and Mistral can be used for misinformation detection by making them cross-check new information with the repository on which they are trained. Accordingly, this paper describes the findings from the investigation of the abilities of LLMs in detecting misinformation on multiple datasets. The results are interpreted using explainable AI techniques such as Local Interpretable Model-Agnostic Explanations (LIME), SHapley Additive exPlanations (SHAP), and Integrated Gradients. The LLMs themselves are also asked to explain their classification. These complementary approaches aid in better understanding the inner workings of misinformation detection using LLMs and lead to conclusions about their effectiveness at the task. The methodology is generic and nothing specific is assumed for any of the LLMs, so the conclusions apply generally. Primarily, when it comes to misinformation detection, the experiments show that the LLMs are limited by the data on which they are trained.

Published in Electronics

ISSN: 2079-9292 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics
Website: http://www.mdpi.com/journal/electronics

About the journal

Abstract

Keywords