Real-time decoding of question-and-answer speech dialogue using human cortical activity

被引:151
作者
Moses, David A. [1 ,2 ]
Leonard, Matthew K. [1 ,2 ]
Makin, Joseph G. [1 ,2 ]
Chang, Edward F. [1 ,2 ]
机构
[1] UC San Francisco, Dept Neurol Surg, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
[2] UC San Francisco, Ctr Integrat Neurosci, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
关键词
HUMAN SENSORIMOTOR CORTEX; BRAIN-COMPUTER INTERFACE; ERROR;
D O I
10.1038/s41467-019-10994-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Natural communication often occurs in dialogue, differentially engaging auditory and sensorimotor brain regions during listening and speaking. However, previous attempts to decode speech directly from the human brain typically consider listening or speaking tasks in isolation. Here, human participants listened to questions and responded aloud with answers while we used high-density electrocorticography (ECoG) recordings to detect when they heard or said an utterance and to then decode the utterance's identity. Because certain answers were only plausible responses to certain questions, we could dynamically update the prior probabilities of each answer using the decoded question likelihoods as context. We decode produced and perceived utterances with accuracy rates as high as 61% and 76%, respectively (chance is 7% and 20%). Contextual integration of decoded question likelihoods significantly improves answer decoding. These results demonstrate real-time decoding of speech in an interactive, conversational setting, which has important implications for patients who are unable to communicate.
引用
收藏
页数:14
相关论文
共 58 条
[1]  
[Anonymous], ICML WORKSH STAT MAC
[2]  
Bergstra J, 2013, INT C MACHINE LEARNI, P115
[3]  
Bergstra J, 2011, ADV NEURAL INFORM PR, P2546, DOI 10.5555/2986459.2986743
[4]   Human temporal lobe activation by speech and nonspeech sounds [J].
Binder, JR ;
Frost, JA ;
Hammeke, TA ;
Bellgowan, PSF ;
Springer, JA ;
Kaufman, JN ;
Possing, ET .
CEREBRAL CORTEX, 2000, 10 (05) :512-528
[5]   Neuroperceptual differences in consonant and vowel discrimination: As revealed by direct cortical electrical interference [J].
Boatman, D ;
Hall, C ;
Goldstein, MH ;
Lesser, R ;
Gordon, B .
CORTEX, 1997, 33 (01) :83-98
[6]  
Boersma P., 2001, GLOT INT, V5, P341
[7]   Functional organization of human sensorimotor cortex for speech articulation [J].
Bouchard, Kristofer E. ;
Mesgarani, Nima ;
Johnson, Keith ;
Chang, Edward F. .
NATURE, 2013, 495 (7441) :327-332
[8]   A survey on self-assessed well-being in a cohort of chronic locked-in syndrome patients: happy majority, miserable minority [J].
Bruno, Marie-Aurelie ;
Bernheim, Jan L. ;
Ledoux, Didier ;
Pellas, Frederic ;
Demertzi, Athena ;
Laureys, Steven .
BMJ OPEN, 2011, 1 (01)
[9]   Spatiotemporal dynamics of word processing in the human brain [J].
Canolty, Ryan T. ;
Soltani, Maryam ;
Dalal, Sarang S. ;
Edwards, Erik ;
Dronkers, Nina F. ;
Nagarajan, Srikantan S. ;
Kirsch, Heidi E. ;
Barbaro, Nicholas M. ;
Knight, Robert T. .
FRONTIERS IN NEUROSCIENCE, 2007, 1 (01) :185-196
[10]   Functional and Quantitative MRI Mapping of Somatomotor Representations of Human Supralaryngeal Vocal Tract [J].
Carey, Daniel ;
Krishnan, Saloni ;
Callaghan, Martina F. ;
Sereno, Martin I. ;
Dick, Frederic .
CEREBRAL CORTEX, 2017, 27 (01) :265-278