Decoding Imagined and Spoken Phrases From Non-invasive Neural (MEG) Signals

被引:74
作者
Dash, Debadatta [1 ,2 ]
Ferrari, Paul [3 ,4 ]
Wang, Jun [2 ,5 ]
机构
[1] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
[2] Univ Texas Austin, Med Sch, Dept Neurol, Austin, TX 78712 USA
[3] Childrens Med Ctr, MEG Lab, Austin, TX USA
[4] Univ Texas Austin, Dept Psychol, Austin, TX 78712 USA
[5] Univ Texas Austin, Dept Commun Sci & Disorders, Austin, TX 78712 USA
基金
美国国家卫生研究院;
关键词
MEG; speech; brain-computer interface; wavelet; convolutional neural network; neural technology; SPEECH; BRAIN; FMRI; MAGNETOENCEPHALOGRAPHY; CLASSIFICATION; IDENTIFICATION; NETWORKS; MODELS;
D O I
10.3389/fnins.2020.00290
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Speech production is a hierarchical mechanism involving the synchronization of the brain and the oral articulators, where the intention of linguistic concepts is transformed into meaningful sounds. Individuals with locked-in syndrome (fully paralyzed but aware) lose their motor ability completely including articulation and even eyeball movement. The neural pathway may be the only option to resume a certain level of communication for these patients. Current brain-computer interfaces (BCIs) use patients' visual and attentional correlates to build communication, resulting in a slow communication rate (a few words per minute). Direct decoding of imagined speech from the neural signals (and then driving a speech synthesizer) has the potential for a higher communication rate. In this study, we investigated the decoding of five imagined and spoken phrases from single-trial, non-invasive magnetoencephalography (MEG) signals collected from eight adult subjects. Two machine learning algorithms were used. One was an artificial neural network (ANN) with statistical features as the baseline approach. The other was convolutional neural networks (CNNs) applied on the spatial, spectral and temporal features extracted from the MEG signals. Experimental results indicated the possibility to decode imagined and spoken phrases directly from neuromagnetic signals. CNNs were found to be highly effective with an average decoding accuracy of up to 93% for the imagined and 96% for the spoken phrases.
引用
收藏
页数:15
相关论文
共 79 条
[31]   EEG classification of imagined syllable rhythm using Hilbert spectrum methods [J].
Deng, Siyi ;
Srinivasan, Ramesh ;
Lappas, Tom ;
D'Zmura, Michael .
JOURNAL OF NEURAL ENGINEERING, 2010, 7 (04)
[32]   Who Is Saying "What"? Brain-Based Decoding of Human Voice and Speech [J].
Formisano, Elia ;
De Martino, Federico ;
Bonte, Milene ;
Goebel, Rainer .
SCIENCE, 2008, 322 (5903) :970-973
[33]   Repetition and the brain: neural models of stimulus-specific effects [J].
Grill-Spector, K ;
Henson, R ;
Martin, A .
TRENDS IN COGNITIVE SCIENCES, 2006, 10 (01) :14-23
[34]   A Wireless Brain-Machine Interface for Real-Time Speech Synthesis [J].
Guenther, Frank H. ;
Brumberg, Jonathan S. ;
Wright, E. Joseph ;
Nieto-Castanon, Alfonso ;
Tourville, Jason A. ;
Panko, Mikhail ;
Law, Robert ;
Siebert, Steven A. ;
Bartels, Jess L. ;
Andreasen, Dinal S. ;
Ehirim, Princewill ;
Mao, Hui ;
Kennedy, Philip R. .
PLOS ONE, 2009, 4 (12)
[35]   Comparing Features for Classification of MEG Responses to Motor Imagery [J].
Halme, Hanna-Leena ;
Parkkonen, Lauri .
PLOS ONE, 2016, 11 (12)
[36]   Deep Learning Approach for Automatic Classification of Ocular and Cardiac Artifacts in MEG Data [J].
Hasasneh, Ahmad ;
Kampel, Nikolas ;
Sripad, Praveen ;
Shah, N. Jon ;
Dammers, Juergen .
JOURNAL OF ENGINEERING, 2018, 2018
[37]   Word-Based Classification of Imagined Speech Using EEG [J].
Hashim, Noramiza ;
Ali, Aziah ;
Mohd-Isa, Wan-Noorshahida .
COMPUTATIONAL SCIENCE AND TECHNOLOGY, ICCST 2017, 2018, 488 :195-204
[38]   Comparing the Performance of Popular MEG/EEG Artifact Correction Methods in an Evoked-Response Study [J].
Haumann, Niels Trusbak ;
Parkkonen, Lauri ;
Kliuchko, Marina ;
Vuust, Peter ;
Brattico, Elvira .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
[39]   Brain-to-text: decoding spoken phrases from phone representations in the brain [J].
Herff, Christian ;
Heger, Dominic ;
de Pesters, Adriana ;
Telaar, Dominic ;
Brunner, Peter ;
Schalk, Gerwin ;
Schultz, Tanja .
FRONTIERS IN NEUROSCIENCE, 2015, 9
[40]  
Huang ZB, 2019, 2019 WORLD ROBOT CONFERENCE SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION (WRC SARA 2019), P354, DOI [10.1109/wrc-sara.2019.8931958, 10.1109/WRC-SARA.2019.8931958]