ConstitutionMaker: Interactively Critiquing Large Language Models by Converting Feedback into Principles

被引:0
|
作者
Petridis, Savvas [1 ]
Wedin, Ben [2 ]
Wexler, James [2 ]
Donsbach, Aaron [3 ]
Pushkarna, Mahima [2 ]
Goyal, Nitesh [1 ]
Cai, Carrie J. [4 ]
Terry, Michael [2 ]
机构
[1] Google Res, New York, NY 10011 USA
[2] Google Res, Cambridge, MA USA
[3] Google Res, Seattle, WA USA
[4] Google Res, Mountain View, CA USA
来源
PROCEEDINGS OF 2024 29TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2024 | 2024年
关键词
Large Language Models; Generative AI; Conversational AI; Interactive Critique; Feedback; CHATBOT;
D O I
10.1145/3640543.3645144
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language model (LLM) prompting is a promising new approach for users to create and customize their own chatbots. However, current methods for steering a chatbot's outputs, such as prompt engineering and fine-tuning, do not support users in converting their natural feedback on the model's outputs to changes in the prompt or model. In this work, we explore how to enable users to interactively refine model outputs through their feedback, by helping them convert their feedback into a set of principles (i.e. a constitution) that dictate the model's behavior. From a formative study, we (1) found that users needed support converting their feedback into principles for the chatbot and (2) classified the different principle types desired by users. Inspired by these findings, we developed ConstitutionMaker, an interactive tool for converting user feedback into principles, to steer LLM-based chatbots. With ConstitutionMaker, users can provide either positive or negative feedback in natural language, select auto-generated feedback, or rewrite the chatbot's response; each mode of feedback automatically generates a principle that is inserted into the chatbot's prompt. In a user study with 14 participants, we compare ConstitutionMaker to an ablated version, where users write their own principles. With ConstitutionMaker, participants felt that their principles could better guide the chatbot, that they could more easily convert their feedback into principles, and that they could write principles more efficiently, with less mental demand. ConstitutionMaker helped users identify ways to improve the chatbot, formulate their intuitive responses to the model into feedback, and convert this feedback into specific and clear principles. Together, these findings inform future tools that support the interactive critiquing of LLM outputs.
引用
收藏
页码:853 / 868
页数:16
相关论文
共 50 条
  • [21] Evaluating Language Models for Generating and Judging Programming Feedback
    Koutcheme, Charles
    Dainese, Nicola
    Sarsa, Sami
    Hellas, Arto
    Leinonen, Juho
    Ashraf, Syed
    Denny, Paul
    PROCEEDINGS OF THE 56TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, SIGCSE TS 2025, VOL 2, 2025, : 624 - 630
  • [22] Implementing Artificial Intelligence in Physiotherapy Education: A Case Study on the Use of Large Language Models (LLM) to Enhance Feedback
    Villagran, Ignacio
    Hernandez, Rocio
    Schuit, Gregory
    Neyem, Andres
    Fuentes-Cimma, Javiera
    Miranda, Constanza
    Hilliger, Isabel
    Duran, Valentina
    Escalona, Gabriel
    Varas, Julian
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2024, 17 : 2079 - 2090
  • [23] Knowledge management in organization and the large language models
    Zelenkov, Yu. A.
    ROSSIISKII ZHURNAL MENEDZHMENTA-RUSSIAN MANAGEMENT JOURNAL, 2024, 22 (03): : 573 - 601
  • [24] Software Modeling Assistance with Large Language Models
    Ben Chaaben, Meriem
    ACM/IEEE 27TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS: COMPANION PROCEEDINGS, MODELS 2024, 2024, : 188 - 191
  • [25] Debiasing large language models: research opportunities
    Yogarajan, Vithya
    Dobbie, Gillian
    Keegan, Te Taka
    JOURNAL OF THE ROYAL SOCIETY OF NEW ZEALAND, 2025, 55 (02) : 372 - 395
  • [26] Applying Large Language Models for intelligent industrial automation From theory to application: Towards autonomous systems with Large Language Models
    Xia, Yuchen
    Jazdi, Nasser
    Weyrich, Michael
    ATP MAGAZINE, 2024, (6-7):
  • [27] A Generative Artificial Intelligence Using Multilingual Large Language Models for ChatGPT Applications
    Tuan, Nguyen Trung
    Moore, Philip
    Thanh, Dat Ha Vu
    Pham, Hai Van
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [28] The Promises and Pitfalls of Large Language Models as Feedback Providers: A Study of Prompt Engineering and the Quality of AI-Driven Feedback
    Jacobsen, Lucas Jasper
    Weber, Kira Elena
    AI, 2025, 6 (02)
  • [29] Imitation and Large Language Models
    Boisseau, Eloise
    MINDS AND MACHINES, 2024, 34 (04)
  • [30] Large language models and psychiatry
    Orru, Graziella
    Melis, Giulia
    Sartori, Giuseppe
    INTERNATIONAL JOURNAL OF LAW AND PSYCHIATRY, 2025, 101