Fabrication and errors in the bibliographic citations generated by ChatGPT

William H. Walters; Esther Isabelle Wilder

doi:10.1038/s41598-023-41032-5

Scientific Reports (Sep 2023)

Fabrication and errors in the bibliographic citations generated by ChatGPT

William H. Walters,
Esther Isabelle Wilder

Affiliations

William H. Walters: Mary Alice & Tom O’Malley Library, Manhattan College
Esther Isabelle Wilder: Department of Sociology, Lehman College, The City University of New York

DOI: https://doi.org/10.1038/s41598-023-41032-5
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Although chatbots such as ChatGPT can facilitate cost-effective text generation and editing, factually incorrect responses (hallucinations) limit their utility. This study evaluates one particular type of hallucination: fabricated bibliographic citations that do not represent actual scholarly works. We used ChatGPT-3.5 and ChatGPT-4 to produce short literature reviews on 42 multidisciplinary topics, compiling data on the 636 bibliographic citations (references) found in the 84 papers. We then searched multiple databases and websites to determine the prevalence of fabricated citations, to identify errors in the citations to non-fabricated papers, and to evaluate adherence to APA citation format. Within this set of documents, 55% of the GPT-3.5 citations but just 18% of the GPT-4 citations are fabricated. Likewise, 43% of the real (non-fabricated) GPT-3.5 citations but just 24% of the real GPT-4 citations include substantive citation errors. Although GPT-4 is a major improvement over GPT-3.5, problems remain.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal