A survey on large language model (LLM) security and privacy: The Good, The Bad, and The Ugly

Yifan Yao; Jinhao Duan; Kaidi Xu; Yuanfang Cai; Zhibo Sun; Yue Zhang

High-Confidence Computing (Jun 2024)

A survey on large language model (LLM) security and privacy: The Good, The Bad, and The Ugly

Yifan Yao,
Jinhao Duan,
Kaidi Xu,
Yuanfang Cai,
Zhibo Sun,
Yue Zhang

Affiliations

Yifan Yao: Department of Computer Science, Drexel University, Philadelphia, PA 19104, USA
Jinhao Duan: Department of Computer Science, Drexel University, Philadelphia, PA 19104, USA
Kaidi Xu: Department of Computer Science, Drexel University, Philadelphia, PA 19104, USA
Yuanfang Cai: Department of Computer Science, Drexel University, Philadelphia, PA 19104, USA
Zhibo Sun: Department of Computer Science, Drexel University, Philadelphia, PA 19104, USA
Yue Zhang: Corresponding author.; Department of Computer Science, Drexel University, Philadelphia, PA 19104, USA

Journal volume & issue: Vol. 4, no. 2
p. 100211

Abstract

Read online

Large Language Models (LLMs), such as ChatGPT and Bard, have revolutionized natural language understanding and generation. They possess deep language comprehension, human-like text generation capabilities, contextual awareness, and robust problem-solving skills, making them invaluable in various domains (e.g., search engines, customer support, translation). In the meantime, LLMs have also gained traction in the security community, revealing security vulnerabilities and showcasing their potential in security-related tasks. This paper explores the intersection of LLMs with security and privacy. Specifically, we investigate how LLMs positively impact security and privacy, potential risks and threats associated with their use, and inherent vulnerabilities within LLMs. Through a comprehensive literature review, the paper categorizes the papers into “The Good” (beneficial LLM applications), “The Bad” (offensive applications), and “The Ugly” (vulnerabilities of LLMs and their defenses). We have some interesting findings. For example, LLMs have proven to enhance code security (code vulnerability detection) and data privacy (data confidentiality protection), outperforming traditional methods. However, they can also be harnessed for various attacks (particularly user-level attacks) due to their human-like reasoning abilities. We have identified areas that require further research efforts. For example, Research on model and parameter extraction attacks is limited and often theoretical, hindered by LLM parameter scale and confidentiality. Safe instruction tuning, a recent development, requires more exploration. We hope that our work can shed light on the LLMs’ potential to both bolster and jeopardize cybersecurity.

Published in High-Confidence Computing

ISSN: 2667-2952 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.journals.elsevier.com/high-confidence-computing

About the journal

Abstract

Keywords