Domain-Adversarial Training for Session Independent EMG-based Speech Recognition

被引:20
作者
Wand, Michael [1 ,2 ]
Schultz, Tanja [3 ]
Schmidhuber, Jurgen [1 ,2 ]
机构
[1] USI, Ist Dalle Molle Studi Intelligenza Artificiale ID, Manno Lugano, Switzerland
[2] SUPSI, Manno Lugano, Switzerland
[3] Univ Bremen, Bremen, Germany
来源
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年
基金
欧盟地平线“2020”;
关键词
Silent Speech interface; Neural Networks; EMG-based Speech Recognition; Domain Adaptation; COMMUNICATION;
D O I
10.21437/Interspeech.2018-2318
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present our research on continuous speech recognition based on Surface Electromyography (EMG), where speech information is captured by electrodes attached to the speaker's face. This method allows speech processing without requiring that an acoustic signal is present; however, reattachment of the EMG electrodes causes subtle changes in the recorded signal, which degrades the recognition accuracy and thus poses a major challenge for practical application of the system. Based on the growing body of recent work in domain-adversarial training of neural networks, we present a system which adapts the neural network frontend of our recognizer to data from a new recording session, without requiring supervised enrollment.
引用
收藏
页码:3167 / 3171
页数:5
相关论文
共 37 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2016, P ICML
[3]  
[Anonymous], PLOS COMPUTATIONAL B
[4]  
[Anonymous], SPEECH COMMUNICATION
[5]  
[Anonymous], P INTERSPEECH
[6]  
[Anonymous], P INTERSPEECH
[7]  
[Anonymous], P NIPS
[8]  
[Anonymous], 2014, CORR
[9]  
[Anonymous], 2016, P ACCV
[10]  
[Anonymous], 2016, ARXIV160806019