Biosignal-Based Spoken Communication: A Survey

被引:127
作者
Schultz, Tanja [1 ]
Wand, Michael [2 ]
Hueber, Thomas [3 ]
Krusienski, Dean J. [4 ]
Herff, Christian [1 ]
Brumberg, Jonathan S. [5 ]
机构
[1] Univ Bremen, Cognit Syst Lab, Fac Comp Sci & Math, D-28359 Bremen, Germany
[2] Ist Dalle Molle Intelligenza Artificiale, Swiss AI Lab, CH-6928 Manno, Switzerland
[3] Grenoble Alpes Univ, CNRS, GIPSA Lab, F-38402 Grenoble, France
[4] Old Dominion Univ, Biomed Engn Inst, ASPEN Lab, Norfolk, VA 23529 USA
[5] Univ Kansas, Speech Language Hearing Dept, Speech & Appl Neurosci Lab, Lawrence, KS 66045 USA
基金
美国国家科学基金会; 欧盟地平线“2020”; 美国国家卫生研究院;
关键词
Biosignals; spoken communication; multimodal technologies; speech recognition and synthesis; speech rehabilitation; electromyography; ultrasound; functional near-infrared spectroscopy; electroencephalography; electrocorticography; SPEECH RECOGNITION; CLASSIFICATION; RECORDINGS; VOICE; TIME; ARTICULOGRAPHY; CONVERSION; INTERFACES; MOVEMENTS; SIGNALS;
D O I
10.1109/TASLP.2017.2752365
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech is a complex process involving a wide range of biosignals, including but not limited to acoustics. These biosignals-stemming from the articulators, the articulator muscle activities, the neural pathways, and the brain itself-can be used to circumvent limitations of conventional speech processing in particular, and to gain insights into the process of speech production in general. Research on biosignal-based speech processing is a wide and very active field at the intersection of various disciplines, ranging from engineering, computer science, electronics and machine learning to medicine, neuroscience, physiology, and psychology. Consequently, a variety of methods and approaches have been used to investigate the common goal of creating biosignal-based speech processing devices for communication applications in everyday situations and for speech rehabilitation, as well as gaining a deeper understanding of spoken communication. This paper gives an overview of the various modalities, research approaches, and objectives for biosignal-based spoken communication.
引用
收藏
页码:2257 / 2271
页数:15
相关论文
共 160 条
[1]   Learning Dynamic Stream Weights For Coupled-HMM-Based Audio-Visual Speech Recognition [J].
Abdelaziz, Ahmed Hussen ;
Zeiler, Steffen ;
Kolossa, Dorothea .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (05) :863-876
[2]   Inner Speech: Development, Cognitive Functions, Phenomenology, and Neurobiology [J].
Alderson-Day, Ben ;
Fernyhough, Charles .
PSYCHOLOGICAL BULLETIN, 2015, 141 (05) :931-965
[3]  
[Anonymous], 1993, Discrete-Time Processing of Speech Signals
[4]  
[Anonymous], 2005, Electric fields of the brain: The neurophysics of eeg
[5]  
[Anonymous], 2004, ELECTROMYOGRAPHY PHY
[6]  
[Anonymous], 2016, FDN AUGMENTED COGNIT
[7]  
[Anonymous], 2015, 2015 INT JOINT C NEU
[8]  
[Anonymous], 2005, THESIS MIT CAMBRIDGE, DOI DOI 10.3115/1613984.1614005
[9]  
[Anonymous], 2011, P INT C FLOR IT 27 3
[10]  
[Anonymous], 2009, P INTERSPEECH