Machine-Generated Text: A Comprehensive Survey of Threat Models and Detection Methods

Evan N. Crothers; Nathalie Japkowicz; Herna L. Viktor

doi:10.1109/ACCESS.2023.3294090

IEEE Access (Jan 2023)

Machine-Generated Text: A Comprehensive Survey of Threat Models and Detection Methods

Evan N. Crothers,
Nathalie Japkowicz,
Herna L. Viktor

Affiliations

Evan N. Crothers: ORCiD; School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, Canada
Nathalie Japkowicz: ORCiD; Department of Computer Science, American University, Washington, DC, USA
Herna L. Viktor: ORCiD; School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, Canada

DOI: https://doi.org/10.1109/ACCESS.2023.3294090
Journal volume & issue: Vol. 11
pp. 70977 – 71002

Abstract

Read online

Machine-generated text is increasingly difficult to distinguish from text authored by humans. Powerful open-source models are freely available, and user-friendly tools that democratize access to generative models are proliferating. ChatGPT, which was released shortly after the first edition of this survey, epitomizes these trends. The great potential of state-of-the-art natural language generation (NLG) systems is tempered by the multitude of avenues for abuse. Detection of machine-generated text is a key countermeasure for reducing the abuse of NLG models, and presents significant technical challenges and numerous open problems. We provide a survey that includes 1) an extensive analysis of threat models posed by contemporary NLG systems and 2) the most complete review of machine-generated text detection methods to date. This survey places machine-generated text within its cybersecurity and social context, and provides strong guidance for future work addressing the most critical threat models. While doing so, we highlight the importance that detection systems themselves demonstrate trustworthiness through fairness, robustness, and accountability.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords