Implicit Human Feedback for Large Language Models: A Passive-Brain Computer Interfaces Study Proposal

被引:0
作者
Gherman, Diana E. [1 ]
Zander, Thorsten O. [1 ]
机构
[1] Brandenburg Tech Univ Cottbus, Senftenberg, Germany
来源
INFORMATION SYSTEMS AND NEUROSCIENCE, NEUROIS RETREAT 2024 | 2025年 / 66卷
关键词
Passive BCI; LLM; Error-processing; Moral judgement;
D O I
10.1007/978-3-031-71385-9_24
中图分类号
学科分类号
摘要
Large language models (LLMs) are transforming the way we work, learn, and access information. As our dependence on these tools grows, it becomes crucial to enhance their accuracy and ensure they align with our ethical standards. The most high-performing language models are currently trained and refined with the help of explicit human feedback. Here we propose a study that investigates the feasibility of implicit human feedback through passive brain-computer interfaces (pBCIs). Two calibration paradigms for moral judgment and error-perception elicitation and detection are described. The obtained classification models will be tested in an application phase with simulated chatbot conversations. If proven successful, pBCIs could provide novel and informative human implicit feedback in the process of LLM development.
引用
收藏
页码:279 / 286
页数:8
相关论文
共 27 条
[1]  
Andreessen L. M., 2023, Towards real-world applicability of neuroadaptive technologies: Investigating subject-independence, task-independence and versatility of passive braincomputer interfaces
[2]   Single-trial analysis and classification of ERP components - A tutorial [J].
Blankertz, Benjamin ;
Lemm, Steven ;
Treder, Matthias ;
Haufe, Stefan ;
Mueller, Klaus-Robert .
NEUROIMAGE, 2011, 56 (02) :814-825
[3]  
Brown D. S., 2019, Extrapolating beyond suboptimal demonstrations via inverse reinforcement learning from observations
[4]   Visual constraints in written word recognition: evidence from the optimal viewing-position effect [J].
Brysbaert, M ;
Nazir, T .
JOURNAL OF RESEARCH IN READING, 2005, 28 (03) :216-228
[5]  
Casper S, 2023, Arxiv, DOI [arXiv:2307.15217, DOI 10.48550/ARXIV.2307.15217]
[6]   Accessing world knowledge: Evidence from N400 and reaction time priming [J].
Chwilla, DJ ;
Kolk, HHJ .
COGNITIVE BRAIN RESEARCH, 2005, 25 (03) :589-606
[7]   Reinforcement learning: A survey [J].
Kaelbling, LP ;
Littman, ML ;
Moore, AW .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285
[8]  
Kaufmann T., 2023, ECML PKDD 2023 WORKS
[9]   EVENT-RELATED BRAIN POTENTIALS TO GRAMMATICAL ERRORS AND SEMANTIC ANOMALIES [J].
KUTAS, M ;
HILLYARD, SA .
MEMORY & COGNITION, 1983, 11 (05) :539-550
[10]   Online processing of moral transgressions: ERP evidence for spontaneous evaluation [J].
Leuthold, Hartmut ;
Kunkel, Angelika ;
Mackenzie, Ian G. ;
Filik, Ruth .
SOCIAL COGNITIVE AND AFFECTIVE NEUROSCIENCE, 2015, 10 (08) :1021-1029