GDPR and Large Language Models: Technical and Legal Obstacles

被引:0
作者
Feretzakis, Georgios [1 ]
Vagena, Evangelia [2 ]
Kalodanis, Konstantinos [3 ]
Peristera, Paraskevi [4 ]
Kalles, Dimitris [1 ]
Anastasiou, Athanasios [5 ]
机构
[1] Hellen Open Univ, Sch Sci & Technol, Patras 26335, Greece
[2] Athens Univ Econ & Business, Athens 10434, Greece
[3] Harokopio Univ Athens, Dept Informat & Telemat, Kallithea 17676, Greece
[4] Stockholm Univ, Dept Psychol, Div Psychobiol & Epidemiol, S-10691 Stockholm, Sweden
[5] Natl Tech Univ Athens, Biomed Engn Lab, Athens 15780, Greece
关键词
GDPR; artificial intelligence; large language models; AI Act; LLM; LLMs; data privacy; AI; Legal Obstacles;
D O I
10.3390/fi17040151; 10.3390/fi17040151
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large Language Models (LLMs) have revolutionized natural language processing but present significant technical and legal challenges when confronted with the General Data Protection Regulation (GDPR). This paper examines the complexities involved in reconciling the design and operation of LLMs with GDPR requirements. In particular, we analyze how key GDPR provisions-including the Right to Erasure, Right of Access, Right to Rectification, and restrictions on Automated Decision-Making-are challenged by the opaque and distributed nature of LLMs. We discuss issues such as the transformation of personal data into non-interpretable model parameters, difficulties in ensuring transparency and accountability, and the risks of bias and data over-collection. Moreover, the paper explores potential technical solutions such as machine unlearning, explainable AI (XAI), differential privacy, and federated learning, alongside strategies for embedding privacy-by-design principles and automated compliance tools into LLM development. The analysis is further enriched by considering the implications of emerging regulations like the EU's Artificial Intelligence Act. In addition, we propose a four-layer governance framework that addresses data governance, technical privacy enhancements, continuous compliance monitoring, and explainability and oversight, thereby offering a practical roadmap for GDPR alignment in LLM systems. Through this comprehensive examination, we aim to bridge the gap between the technical capabilities of LLMs and the stringent data protection standards mandated by GDPR, ultimately contributing to more responsible and ethical AI practices.
引用
收藏
页数:26
相关论文
共 90 条
[51]  
Greenleaf Graham, 2019, Privacy Laws Business International Report, V157, P14
[52]  
Gupta V., 2021, P ADV NEURAL INFORM, VVolume 34
[53]  
Hamburg Commissioner for Data Protection and Freedom of Information (HmbBfDI), 2024, Discussion Paper: Large Language Models and Personal Data
[54]   The global landscape of AI ethics guidelines [J].
Jobin, Anna ;
Ienca, Marcello ;
Vayena, Effy .
NATURE MACHINE INTELLIGENCE, 2019, 1 (09) :389-399
[55]   Advances and Open Problems in Federated Learning [J].
Kairouz, Peter ;
McMahan, H. Brendan ;
Avent, Brendan ;
Bellet, Aurelien ;
Bennis, Mehdi ;
Bhagoji, Arjun Nitin ;
Bonawitz, Kallista ;
Charles, Zachary ;
Cormode, Graham ;
Cummings, Rachel ;
D'Oliveira, Rafael G. L. ;
Eichner, Hubert ;
El Rouayheb, Salim ;
Evans, David ;
Gardner, Josh ;
Garrett, Zachary ;
Gascon, Adria ;
Ghazi, Badih ;
Gibbons, Phillip B. ;
Gruteser, Marco ;
Harchaoui, Zaid ;
He, Chaoyang ;
He, Lie ;
Huo, Zhouyuan ;
Hutchinson, Ben ;
Hsu, Justin ;
Jaggi, Martin ;
Javidi, Tara ;
Joshi, Gauri ;
Khodak, Mikhail ;
Konecny, Jakub ;
Korolova, Aleksandra ;
Koushanfar, Farinaz ;
Koyejo, Sanmi ;
Lepoint, Tancrede ;
Liu, Yang ;
Mittal, Prateek ;
Mohri, Mehryar ;
Nock, Richard ;
Ozgur, Ayfer ;
Pagh, Rasmus ;
Qi, Hang ;
Ramage, Daniel ;
Raskar, Ramesh ;
Raykova, Mariana ;
Song, Dawn ;
Song, Weikang ;
Stich, Sebastian U. ;
Sun, Ziteng ;
Suresh, Ananda Theertha .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2021, 14 (1-2) :1-210
[56]   High-Risk AI Systems-Lie Detection Application [J].
Kalodanis, Konstantinos ;
Rizomiliotis, Panagiotis ;
Feretzakis, Georgios ;
Papapavlou, Charalampos ;
Anagnostopoulos, Dimosthenis .
FUTURE INTERNET, 2025, 17 (01)
[57]   European Artificial Intelligence Act: an AI security approach [J].
Kalodanis, Konstantinos ;
Rizomiliotis, Panagiotis ;
Anagnostopoulos, Dimosthenis .
INFORMATION AND COMPUTER SECURITY, 2024, 32 (03) :265-281
[58]  
Kamarinou D., 2016, Machine Learning with Personal Data
[59]  
Kaneko M, 2021, 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), P1256
[60]   Natural Questions: A Benchmark for Question Answering Research [J].
Kwiatkowski, Tom ;
Palomaki, Jennimaria ;
Redfield, Olivia ;
Collins, Michael ;
Parikh, Ankur ;
Alberti, Chris ;
Epstein, Danielle ;
Polosukhin, Illia ;
Devlin, Jacob ;
Lee, Kenton ;
Toutanova, Kristina ;
Jones, Llion ;
Kelcey, Matthew ;
Chang, Ming-Wei ;
Dai, Andrew M. ;
Uszkoreit, Jakob ;
Quoc Le ;
Petrov, Slav .
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2019, 7 :453-466