Tampering with Generative Artificial Intelligence by Jailbreaking

被引：0

作者：

Claverini, Corrado ^{[1
]}

机构：

[1] Univ Salento, Lecce, Italy

来源：

TEORIA-RIVISTA DI FILOSOFIA | 2024年 / 44卷 / 01期

关键词：

ChatGPT; ethics of artificial intelligence; generative artificial in- telligence; jailbreaking; regulation of artificial intelligence;

D O I：

10.4454/mg6wax06

中图分类号：

B [哲学、宗教];

学科分类号：

01 ; 0101 ;

摘要：

In this paper, I will analyse the risks linked to the use of generative artificial intelligence systems and relative risk-reduction strategies, while concentrating in particular on the possibility of tampering with the chatbot ChatGPT by jailbreaking. After examining how a user can tamper with this generative AI, bypassing its ethical and legal restrictions, through a series of prompts, I will turn my focus to the ethical issues raised by the malicious use of this technology: are the transparency requirements requested of generative AI sufficient or should there be tighter restrictions that do not hinder the innovation and development of these technologies? How can the risk of tampering with these AI tools be lowered? And, should a breach take place, who is responsible: the AI developer or the jailbreaker? To what extent could the changes needed to prevent jailbreaking involuntarily generate or strengthen certain biases? In conclusion, I will uphold the necessity of ethical reflection for the sustainable and "human-centric" development of AI.

引用

页码：159 / 171

页数：172

共 50 条

[21] Towards a Definition of Generative Artificial Intelligence
Raphael Ronge
Markus Maier
Benjamin Rathgeber
Philosophy & Technology, 2025, 38 (1)
[22] Generative artificial intelligence: Can ChatGPT write a quality abstract?
Babl, Franz E.
Babl, Maximilian P.
EMERGENCY MEDICINE AUSTRALASIA, 2023, 35 (05) : 809 - 811
[23] Generative Artificial Intelligence in Education, Part One: the Dynamic Frontier
Yu-Chang Hsu
Yu-Hui Ching
TechTrends, 2023, 67 : 603 - 607
[24] Generative Artificial Intelligence Terminology: A Primer for Clinicians and Medical Researchers
Melnyk, Oleksiy
Ismail, Ahmed
Ghorashi, Nima S.
Heekin, Mary
Javan, Ramin
CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (12)
[25] Unveiling the potential of generative artificial intelligence: a multidimensional journey into the future
Ooi, Keng-Boon
Koohang, Alex
Aw, Eugene Cheng-Xi
Cham, Tat-Huei
Cobanoglu, Cihan
Dennis, Charles
Dwivedi, Yogesh K.
Hew, Jun-Jie
Linton Kelly, Heather
Hughes, Laurie
Lin, Chieh-Yu
Mishra, Anubhav
Phau, Ian
Raman, Ramakrishnan
Sigala, Marianna
Tang, Yun-Chia
Wong, Lai-Wan
Tan, Garry Wei-Han
INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2025, 125 (02) : 417 - 432
[26] The new reality of education in the face of advances in generative artificial intelligence
Garcia-Penalvo, Francisco Jose
Llorens-Largo, Faraon
Vidal, Javier
RIED-REVISTA IBEROAMERICANA DE EDUCACION A DISTANCIA, 2024, 27 (01): : 9 - 39
[27] Integration of Generative Artificial Intelligence in Higher Education: Best Practices
Cordero, Jorge
Torres-Zambrano, Jonathan
Cordero-Castillo, Alison
EDUCATION SCIENCES, 2025, 15 (01):
[28] Generative Artificial Intelligence in Education, Part One: the Dynamic Frontier
Hsu, Yu-Chang
Ching, Yu-Hui
TECHTRENDS, 2023, 67 (04) : 603 - 607
[29] Generative Artificial Intelligence in Education, Part Two: International Perspectives
Hsu, Yu-Chang
Ching, Yu-Hui
TECHTRENDS, 2023, 67 (06) : 885 - 890
[30] Generative Artificial Intelligence, Human Agency and the Future of Cultural Heritage
Spennemann, Dirk H. R.
HERITAGE, 2024, 7 (07): : 3597 - 3609

← 1 2 3 4 5 →