Implicit Human Feedback for Large Language Models: A Passive-Brain Computer Interfaces Study Proposal

被引：0

作者：

Gherman, Diana E. ^{[1
]}

Zander, Thorsten O. ^{[1
]}

机构：

[1] Brandenburg Tech Univ Cottbus, Senftenberg, Germany

来源：

INFORMATION SYSTEMS AND NEUROSCIENCE, NEUROIS RETREAT 2024 | 2025年 / 66卷

关键词：

Passive BCI; LLM; Error-processing; Moral judgement;

D O I：

10.1007/978-3-031-71385-9_24

中图分类号：

学科分类号：

摘要：

Large language models (LLMs) are transforming the way we work, learn, and access information. As our dependence on these tools grows, it becomes crucial to enhance their accuracy and ensure they align with our ethical standards. The most high-performing language models are currently trained and refined with the help of explicit human feedback. Here we propose a study that investigates the feasibility of implicit human feedback through passive brain-computer interfaces (pBCIs). Two calibration paradigms for moral judgment and error-perception elicitation and detection are described. The obtained classification models will be tested in an application phase with simulated chatbot conversations. If proven successful, pBCIs could provide novel and informative human implicit feedback in the process of LLM development.

引用

页码：279 / 286

页数：8

共 27 条

[1]

Andreessen L. M., 2023, Towards real-world applicability of neuroadaptive technologies: Investigating subject-independence, task-independence and versatility of passive braincomputer interfaces

[2] Single-trial analysis and classification of ERP components - A tutorial [J].

Blankertz, Benjamin ;

Lemm, Steven ;

Treder, Matthias ;

Haufe, Stefan ;

Mueller, Klaus-Robert .

NEUROIMAGE, 2011, 56 (02) :814-825

[3]

Brown D. S., 2019, Extrapolating beyond suboptimal demonstrations via inverse reinforcement learning from observations

[4] Visual constraints in written word recognition: evidence from the optimal viewing-position effect [J].

Brysbaert, M ;

Nazir, T .

JOURNAL OF RESEARCH IN READING, 2005, 28 (03) :216-228

[5]

Casper S, 2023, Arxiv, DOI [arXiv:2307.15217, DOI 10.48550/ARXIV.2307.15217]

[6] Accessing world knowledge: Evidence from N400 and reaction time priming [J].

Chwilla, DJ ;

Kolk, HHJ .

COGNITIVE BRAIN RESEARCH, 2005, 25 (03) :589-606

[7] Reinforcement learning: A survey [J].

Kaelbling, LP ;

Littman, ML ;

Moore, AW .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :237-285

[8]

Kaufmann T., 2023, ECML PKDD 2023 WORKS

[9] EVENT-RELATED BRAIN POTENTIALS TO GRAMMATICAL ERRORS AND SEMANTIC ANOMALIES [J].

KUTAS, M ;

HILLYARD, SA .

MEMORY & COGNITION, 1983, 11 (05) :539-550

[10] Online processing of moral transgressions: ERP evidence for spontaneous evaluation [J].

Leuthold, Hartmut ;

Kunkel, Angelika ;

Mackenzie, Ian G. ;

Filik, Ruth .

SOCIAL COGNITIVE AND AFFECTIVE NEUROSCIENCE, 2015, 10 (08) :1021-1029

← 1 2 3 →