ConstitutionMaker: Interactively Critiquing Large Language Models by Converting Feedback into Principles

被引:0
|
作者
Petridis, Savvas [1 ]
Wedin, Ben [2 ]
Wexler, James [2 ]
Donsbach, Aaron [3 ]
Pushkarna, Mahima [2 ]
Goyal, Nitesh [1 ]
Cai, Carrie J. [4 ]
Terry, Michael [2 ]
机构
[1] Google Res, New York, NY 10011 USA
[2] Google Res, Cambridge, MA USA
[3] Google Res, Seattle, WA USA
[4] Google Res, Mountain View, CA USA
来源
PROCEEDINGS OF 2024 29TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2024 | 2024年
关键词
Large Language Models; Generative AI; Conversational AI; Interactive Critique; Feedback; CHATBOT;
D O I
10.1145/3640543.3645144
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language model (LLM) prompting is a promising new approach for users to create and customize their own chatbots. However, current methods for steering a chatbot's outputs, such as prompt engineering and fine-tuning, do not support users in converting their natural feedback on the model's outputs to changes in the prompt or model. In this work, we explore how to enable users to interactively refine model outputs through their feedback, by helping them convert their feedback into a set of principles (i.e. a constitution) that dictate the model's behavior. From a formative study, we (1) found that users needed support converting their feedback into principles for the chatbot and (2) classified the different principle types desired by users. Inspired by these findings, we developed ConstitutionMaker, an interactive tool for converting user feedback into principles, to steer LLM-based chatbots. With ConstitutionMaker, users can provide either positive or negative feedback in natural language, select auto-generated feedback, or rewrite the chatbot's response; each mode of feedback automatically generates a principle that is inserted into the chatbot's prompt. In a user study with 14 participants, we compare ConstitutionMaker to an ablated version, where users write their own principles. With ConstitutionMaker, participants felt that their principles could better guide the chatbot, that they could more easily convert their feedback into principles, and that they could write principles more efficiently, with less mental demand. ConstitutionMaker helped users identify ways to improve the chatbot, formulate their intuitive responses to the model into feedback, and convert this feedback into specific and clear principles. Together, these findings inform future tools that support the interactive critiquing of LLM outputs.
引用
收藏
页码:853 / 868
页数:16
相关论文
共 50 条
  • [41] Large language models: a survey of their development, capabilities, and applications
    Annepaka, Yadagiri
    Pakray, Partha
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (03) : 2967 - 3022
  • [42] Natural language processing in the era of large language models
    Zubiaga, Arkaitz
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 6
  • [43] ChemGen: Towards Understanding First-Principles Calculation Code Generation Based on Large Language Models
    Gao, Peng
    Qiu, Feng
    Hua, Baojian
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 281 - 287
  • [44] Structuring Natural Language Requirements with Large Language Models
    Norheim, Johannes J.
    Rebentisch, Eric
    32ND INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS, REW 2024, 2024, : 68 - 71
  • [45] The Breakthrough of Large Language Models Release for Medical Applications: 1-Year Timeline and Perspectives
    Cascella, Marco
    Semeraro, Federico
    Montomoli, Jonathan
    Bellini, Valentina
    Piazza, Ornella
    Bignami, Elena
    JOURNAL OF MEDICAL SYSTEMS, 2024, 48 (01)
  • [46] Evolution and Prospects of Foundation Models: From Large Language Models to Large Multimodal Models
    Chen, Zheyi
    Xu, Liuchang
    Zheng, Hongting
    Chen, Luyao
    Tolba, Amr
    Zhao, Liang
    Yu, Keping
    Feng, Hailin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 1753 - 1808
  • [47] Embracing Large Language Models for Medical Applications: Opportunities and Challenges
    Karabacak, Mert
    Margetis, Konstantinos
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (05)
  • [48] Position Paper: Leveraging Large Language Models for Cybersecurity Compliance
    Salman, Ahmed
    Creese, Sadie
    Goldsmith, Michael
    9TH IEEE EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS, EUROS&PW 2024, 2024, : 496 - 503
  • [49] Large Language Models for Software Engineering: Survey and Open Problems
    Fan, Angela
    Gokkaya, Beliz
    Harman, Mark
    Lyubarskiy, Mitya
    Sengupta, Shubho
    Yoo, Shin
    Zhang, Jie M.
    2023 IEEE/ACM INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: FUTURE OF SOFTWARE ENGINEERING, ICSE-FOSE, 2023, : 31 - 53
  • [50] Utility of large language models for creating clinical assessment items
    Lam, George
    Shammoon, Yusra
    Coulson, Anna
    Lalloo, Felicity
    Maini, Arti
    Amin, Anjali
    Brown, Celia
    Sam, Amir H.
    MEDICAL TEACHER, 2024,