Neuroprosthesis for Decoding Speech in a Paralyzed Person with Anarthria

被引:255
作者
Moses, David A. [1 ,2 ]
Metzger, Sean L. [1 ,2 ,5 ]
Liu, Jessie R. [1 ,2 ,5 ]
Anumanchipalli, Gopala K. [1 ,2 ]
Makin, Joseph G. [1 ,2 ]
Sun, Pengfei F. [1 ,2 ]
Chartier, Josh [1 ,2 ]
Dougherty, Maximilian E. [1 ]
Liu, Patricia M. [3 ]
Abrams, Gary M. [4 ]
Tu-Chan, Adelyn [4 ]
Ganguly, Karunesh [2 ,4 ]
Chang, Edward F. [1 ,2 ,5 ]
机构
[1] Univ Calif San Francisco, Dept Neurol Surg, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Weill Inst Neurosci, San Francisco, CA 94143 USA
[3] Univ Calif San Francisco, Dept Rehabil Serv, San Francisco, CA 94143 USA
[4] Univ Calif San Francisco, Dept Neurol, San Francisco, CA USA
[5] UCSF, Univ Calif Berkeley, Grad Program Bioengn, Berkeley, CA 94143 USA
关键词
COMMUNICATION;
D O I
10.1056/NEJMoa2027540
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Technology to restore the ability to communicate in paralyzed persons who cannot speak has the potential to improve autonomy and quality of life. An approach that decodes words and sentences directly from the cerebral cortical activity of such patients may represent an advancement over existing methods for assisted communication. Methods We implanted a subdural, high-density, multielectrode array over the area of the sensorimotor cortex that controls speech in a person with anarthria (the loss of the ability to articulate speech) and spastic quadriparesis caused by a brain-stem stroke. Over the course of 48 sessions, we recorded 22 hours of cortical activity while the participant attempted to say individual words from a vocabulary set of 50 words. We used deep-learning algorithms to create computational models for the detection and classification of words from patterns in the recorded cortical activity. We applied these computational models, as well as a natural-language model that yielded next-word probabilities given the preceding words in a sequence, to decode full sentences as the participant attempted to say them. Results We decoded sentences from the participant's cortical activity in real time at a median rate of 15.2 words per minute, with a median word error rate of 25.6%. In post hoc analyses, we detected 98% of the attempts by the participant to produce individual words, and we classified words with 47.1% accuracy using cortical signals that were stable throughout the 81-week study period. Conclusions In a person with anarthria and spastic quadriparesis caused by a brain-stem stroke, words and sentences were decoded directly from cortical activity during attempted speech with the use of deep-learning models and a natural-language model. (Funded by Facebook and others; ClinicalTrials.gov number, NCT03698149.) Detecting Speech in a Paralyzed Person with Anarthria In a person with anarthria and spastic quadriparesis caused by a brain-stem stroke, signals from the cortical speech area were processed with signal analysis and a natural-language model. It was possible to decode words and sentences with high accuracy when the patient attempted to produce language.
引用
收藏
页码:217 / 227
页数:11
相关论文
共 39 条
[1]   Speech synthesis from ECoG using densely connected 3D convolutional neural networks [J].
Angrick, Miguel ;
Herff, Christian ;
Mugler, Emily ;
Tate, Matthew C. ;
Slutzky, Marc W. ;
Krusienski, Dean J. ;
Schultz, Tanja .
JOURNAL OF NEURAL ENGINEERING, 2019, 16 (03)
[2]  
[Anonymous], 2014, WORKSH TRACK P 2 INT
[3]   Speech synthesis from neural decoding of spoken sentences [J].
Anumanchipalli, Gopala K. ;
Chartier, Josh ;
Chang, Edward F. .
NATURE, 2019, 568 (7753) :493-+
[4]   AAC for adults with acquired neurological conditions: A review [J].
Beukelman, David R. ;
Fager, Susan ;
Ball, Laura ;
Dietz, Aimee .
AUGMENTATIVE AND ALTERNATIVE COMMUNICATION, 2007, 23 (03) :230-242
[5]   Functional organization of human sensorimotor cortex for speech articulation [J].
Bouchard, Kristofer E. ;
Mesgarani, Nima ;
Johnson, Keith ;
Chang, Edward F. .
NATURE, 2013, 495 (7441) :327-332
[6]   Brain-Computer Interfaces for Augmentative and Alternative Communication: A Tutorial [J].
Brumberg, Jonathan S. ;
Pitt, Kevin M. ;
Mantie-Kozlowski, Alana ;
Burnison, Jeremy D. .
AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2018, 27 (01) :1-12
[7]   Classification of intended phoneme production from chronic intracortical microelectrode recordings in speech-motor cortex [J].
Brumberg, Jonathan S. ;
Wright, E. Joe ;
Andreasen, Dinal S. ;
Guenther, Frank H. ;
Kennedy, Philip R. .
FRONTIERS IN NEUROSCIENCE, 2011, 5 :1-12
[8]  
Chao Zenas C, 2010, Front Neuroeng, V3, P3, DOI 10.3389/fneng.2010.00003
[9]   Encoding of Articulatory Kinematic Trajectories in Human Speech Sensorimotor Cortex [J].
Chartier, Josh ;
Anumanchipalli, Gopala K. ;
Johnson, Keith ;
Chang, Edward F. .
NEURON, 2018, 98 (05) :1042-+
[10]   An empirical study of smoothing techniques for language modeling [J].
Chen, SF ;
Goodman, J .
COMPUTER SPEECH AND LANGUAGE, 1999, 13 (04) :359-394