Tampering with Generative Artificial Intelligence by Jailbreaking

被引:0
作者
Claverini, Corrado [1 ]
机构
[1] Univ Salento, Lecce, Italy
来源
TEORIA-RIVISTA DI FILOSOFIA | 2024年 / 44卷 / 01期
关键词
ChatGPT; ethics of artificial intelligence; generative artificial in- telligence; jailbreaking; regulation of artificial intelligence;
D O I
10.4454/mg6wax06
中图分类号
B [哲学、宗教];
学科分类号
01 ; 0101 ;
摘要
In this paper, I will analyse the risks linked to the use of generative artificial intelligence systems and relative risk-reduction strategies, while concentrating in particular on the possibility of tampering with the chatbot ChatGPT by jailbreaking. After examining how a user can tamper with this generative AI, bypassing its ethical and legal restrictions, through a series of prompts, I will turn my focus to the ethical issues raised by the malicious use of this technology: are the transparency requirements requested of generative AI sufficient or should there be tighter restrictions that do not hinder the innovation and development of these technologies? How can the risk of tampering with these AI tools be lowered? And, should a breach take place, who is responsible: the AI developer or the jailbreaker? To what extent could the changes needed to prevent jailbreaking involuntarily generate or strengthen certain biases? In conclusion, I will uphold the necessity of ethical reflection for the sustainable and "human-centric" development of AI.
引用
收藏
页码:159 / 171
页数:172
相关论文
共 50 条
  • [1] Generative artificial intelligence in oncology
    Ganjavi, Conner
    Melamed, Sam
    Biedermann, Brett
    Eppler, Michael B.
    Rodler, Severin
    Layne, Ethan
    Cei, Francesco
    Gill, Inderbir
    Cacciamani, Giovanni E.
    CURRENT OPINION IN UROLOGY, 2025, 35 (03) : 205 - 213
  • [2] A Primer on Generative Artificial Intelligence
    Kalota, Faisal
    EDUCATION SCIENCES, 2024, 14 (02):
  • [3] Cybersecurity in the generative artificial intelligence era
    Teo, Zhen Ling
    Quek, Chrystie Wan Ning
    Wong, Joy Le Yi
    Ting, Daniel Shu Wei
    ASIA-PACIFIC JOURNAL OF OPHTHALMOLOGY, 2024, 13 (04):
  • [4] Generative artificial intelligence in ophthalmology
    Waisberg, Ethan
    Ong, Joshua
    Kamran, Sharif Amit
    Masalkhi, Mouayad
    Paladugu, Phani
    Zaman, Nasif
    Lee, Andrew G.
    Tavakkoli, Alireza
    SURVEY OF OPHTHALMOLOGY, 2025, 70 (01) : 1 - 11
  • [5] Applications of Generative Artificial Intelligence in the Software Industry
    Damyanov, Ivo
    Tsankov, Nikolay
    Nedyalkov, Iliya
    TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2024, 13 (04): : 2724 - 2733
  • [6] Generative artificial intelligence and ELT
    Moorhouse, Benjamin Luke
    ELT JOURNAL, 2024, 78 (04) : 378 - 392
  • [7] Generative artificial intelligence in surgery
    Rodler, Severin
    Ganjavi, Conner
    De Backer, Pieter
    Magoulianitis, Vasileios
    Ramacciotti, Lorenzo Storino
    Abreu, Andre Luis De Castro
    Gill, Inderbir S.
    Cacciamani, Giovanni E.
    SURGERY, 2024, 175 (06) : 1496 - 1502
  • [8] Applications of Generative Artificial Intelligence in the Judiciary: The Case of ChatGPT
    Huang, Huiyao
    INTERNATIONAL JOURNAL OF MULTIPHYSICS, 2024, 18 (02) : 403 - 411
  • [9] The Societal Impacts of Generative Artificial Intelligence: A Balanced Perspective
    Sabherwal, Rajiv
    Grover, Varun
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2024, 25 (01): : 13 - 22
  • [10] Earth Science Simulations with Generative Artificial Intelligence (GenAI)
    Choi, Yoon-Sung
    JOURNAL OF UNIVERSITY TEACHING AND LEARNING PRACTICE, 2025, 22 (01)