GDPR and Large Language Models: Technical and Legal Obstacles

Georgios Feretzakis; Evangelia Vagena; Konstantinos Kalodanis; Paraskevi Peristera; Dimitris Kalles; Athanasios Anastasiou

doi:10.3390/fi17040151

Future Internet (Mar 2025)

GDPR and Large Language Models: Technical and Legal Obstacles

Georgios Feretzakis,
Evangelia Vagena,
Konstantinos Kalodanis,
Paraskevi Peristera,
Dimitris Kalles,
Athanasios Anastasiou

Affiliations

Georgios Feretzakis: School of Science and Technology, Hellenic Open University, 26335 Patras, Greece
Evangelia Vagena: Athens University of Economics and Business, 10434 Athens, Greece
Konstantinos Kalodanis: Department of Informatics and Telematics, Harokopio University of Athens, 17676 Kallithea, Greece
Paraskevi Peristera: Division of Psychobiology and Epidemiology, Department of Psychology, Stockholm University, 10691 Stockholm, Sweden
Dimitris Kalles: School of Science and Technology, Hellenic Open University, 26335 Patras, Greece
Athanasios Anastasiou: Biomedical Engineering Laboratory, National Technical University of Athens, 15780 Athens, Greece

DOI: https://doi.org/10.3390/fi17040151
Journal volume & issue: Vol. 17, no. 4
p. 151

Abstract

Read online

Large Language Models (LLMs) have revolutionized natural language processing but present significant technical and legal challenges when confronted with the General Data Protection Regulation (GDPR). This paper examines the complexities involved in reconciling the design and operation of LLMs with GDPR requirements. In particular, we analyze how key GDPR provisions—including the Right to Erasure, Right of Access, Right to Rectification, and restrictions on Automated Decision-Making—are challenged by the opaque and distributed nature of LLMs. We discuss issues such as the transformation of personal data into non-interpretable model parameters, difficulties in ensuring transparency and accountability, and the risks of bias and data over-collection. Moreover, the paper explores potential technical solutions such as machine unlearning, explainable AI (XAI), differential privacy, and federated learning, alongside strategies for embedding privacy-by-design principles and automated compliance tools into LLM development. The analysis is further enriched by considering the implications of emerging regulations like the EU’s Artificial Intelligence Act. In addition, we propose a four-layer governance framework that addresses data governance, technical privacy enhancements, continuous compliance monitoring, and explainability and oversight, thereby offering a practical roadmap for GDPR alignment in LLM systems. Through this comprehensive examination, we aim to bridge the gap between the technical capabilities of LLMs and the stringent data protection standards mandated by GDPR, ultimately contributing to more responsible and ethical AI practices.

Published in Future Internet

ISSN: 1999-5903 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/futureinternet/

About the journal

Abstract

Keywords