Engineering conversation: Understanding the control requirements of language production in monologue and dialogue

被引:0
作者
Gambi, Chiara [1 ,2 ]
Zhang, Fan [3 ]
Pickering, Martin J. [4 ]
机构
[1] Univ Warwick, Dept Psychol, Warwick CV4 7AL, England
[2] Cardiff Univ, Sch Psychol, Cardiff CF10 3AT, Wales
[3] Univ Sussex, Sch Engn & Informat, Dept Engn & Design, Brighton BN1 9QJ, England
[4] Univ Edinburgh, Dept Psychol, Edinburgh EH8 9JZ, Scotland
关键词
Control theory; Forward models; Language production; Planning; Dialogue; INTERMITTENT CONTROL; PREDICTION ERROR; TURN-TAKING; SPEECH; MODEL; COMPREHENSION; EXPECTATION; PERCEPTION; BRAIN; TEXT;
D O I
10.1016/j.jneuroling.2024.101229
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Both artificial and biological systems are faced with the challenge of noisy and uncertain estimation of the state of the world, in contexts where feedback is often delayed. This challenge also applies to the processes of language production and comprehension, both when they take place in isolation (e.g., in monologue or solo reading) and when they are combined as is the case in dialogue. Crucially, we argue, dialogue brings with it some unique challenges. In this paper, we describe three such challenges within the general framework of control theory, drawing analogies to mechanical and biological systems where possible: (1) the need to distinguish between self- and other-generated utterances; (2) the need to adjust the amount of advance planning (i.e., the degree to which planning precedes articulation) flexibly to achieve timely turn-taking; (3) the need to track changing conversational goals. We show that message-to-sound models of language production (i.e., those that cover the whole process from message generation to articulation) tend to implement fairly simple control architectures. However, we argue that more sophisticated control architectures are necessary to build language production models that can account for both monologue and dialogue.
引用
收藏
页数:12
相关论文
共 96 条
[21]   You cannot speak and listen at the same time: a probabilistic model of turn-taking [J].
Donnarumma, Francesco ;
Dindo, Haris ;
Iodice, Pierpaolo ;
Pezzulo, Giovanni .
BIOLOGICAL CYBERNETICS, 2017, 111 (02) :165-183
[22]   The Influence of Categories on Perception: Explaining the Perceptual Magnet Effect as Optimal Statistical Inference [J].
Feldman, Naomi H. ;
Griffiths, Thomas L. ;
Morgan, James L. .
PSYCHOLOGICAL REVIEW, 2009, 116 (04) :752-782
[23]   Language ERPs reflect learning through prediction error propagation [J].
Fitz, Hartmut ;
Chang, Franklin .
COGNITIVE PSYCHOLOGY, 2019, 111 :15-52
[24]   Using Speakers' Referential Intentions to Model Early Cross-Situational Word Learning [J].
Frank, Michael C. ;
Goodman, Noah D. ;
Tenenbaum, Joshua B. .
PSYCHOLOGICAL SCIENCE, 2009, 20 (05) :578-585
[25]   A Duet for one [J].
Friston, Karl ;
Frith, Christopher .
CONSCIOUSNESS AND COGNITION, 2015, 36 :390-405
[26]   Active inference, communication and hermeneutics [J].
Friston, Karl J. ;
Frith, Christopher D. .
CORTEX, 2015, 68 :129-143
[27]   Temporal Predictive Codes for Spoken Words in Auditory Cortex [J].
Gagnepain, Pierre ;
Henson, Richard N. ;
Davis, Matthew H. .
CURRENT BIOLOGY, 2012, 22 (07) :615-621
[28]  
Gambi C., 2017, The Handbook of Psycholinguistics, P157
[29]   The use of content and timing to predict turn transitions [J].
Garrod, Simon ;
Pickering, Martin J. .
FRONTIERS IN PSYCHOLOGY, 2015, 6
[30]  
Gauvin Hanna S, 2020, J Cogn, V3, P17, DOI 10.5334/joc.81