SESSION-INDEPENDENT EMG-BASED SPEECH RECOGNITION

被引:0
|
作者
Wand, Michael [1 ]
Schultz, Tanja [1 ]
机构
[1] Karlsruhe Inst Technol, Cognit Syst Lab, D-76131 Karlsruhe, Germany
来源
关键词
Electromyography; Silent Speech Interfaces; EMG-based Speech Recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper reports on our recent research in speech recognition by surface electromyography (EMG), which is the technology of recording the electric activation potentials of the human articulatory muscles by surface electrodes in order to recognize speech. This method can be used to create Silent Speech Interfaces, since the EMG signal is available even when no audible signal is transmitted or captured. Several past studies have shown that EMG signals may vary greatly between different recording sessions, even of one and the same speaker. This paper shows that session-independent training methods may be used to obtain robust EMG-based speech recognizers which cope well with unseen recording sessions as well as with speaking mode variations. Our best session-independent recognition system, trained on 280 utterances of 7 different sessions, achieves an average 21.93% Word Error Rate (WER) on a testing vocabulary of 108 words. The overall best session-adaptive recognition system, based on a session-independent system and adapted towards the test session with 40 adaptation sentences, achieves an average WER of 15.66%, which is a relative improvement of 21% compared to the baseline average WER of 19.96% of a session-dependent recognition system trained only on a single session of 40 sentences.
引用
收藏
页码:295 / 300
页数:6
相关论文
共 50 条
  • [1] Domain-Adversarial Training for Session Independent EMG-based Speech Recognition
    Wand, Michael
    Schultz, Tanja
    Schmidhuber, Jurgen
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3167 - 3171
  • [2] ANALYSIS OF PHONE CONFUSION IN EMG-BASED SPEECH RECOGNITION
    Wand, Michael
    Schultz, Tanja
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 757 - 760
  • [3] Modeling coarticulation in EMG-based continuous speech recognition
    Schultz, Tanja
    Wand, Michael
    SPEECH COMMUNICATION, 2010, 52 (04) : 341 - 353
  • [4] Impact of Different Feedback Mechanisms in EMG-based Speech Recognition
    Herff, Christian
    Janke, Matthias
    Wand, Michael
    Schultz, Tanja
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2224 - 2227
  • [5] A Spectral Mapping Method for EMG-based Recognition of Silent Speech
    Janke, Matthias
    Wand, Michael
    Schultz, Tanja
    BIO-INSPIRED HUMAN- MACHINE INTERFACES AND HEALTHCARE APPLICATIONS, 2010, : 22 - 31
  • [6] Tackling Speaking Mode Varieties in EMG-Based Speech Recognition
    Wand, Michael
    Janke, Matthias
    Schultz, Tanja
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2014, 61 (10) : 2515 - 2526
  • [7] EMG-based speech recognition using dimensionality reduction methods
    Anat Ratnovsky
    Sarit Malayev
    Shahar Ratnovsky
    Sara Naftali
    Neta Rabin
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 597 - 607
  • [8] Multi-stream HMM for EMG-based speech recognition
    Manabe, H
    Zhang, Z
    PROCEEDINGS OF THE 26TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2004, 26 : 4389 - 4392
  • [9] EMG-based speech recognition using dimensionality reduction methods
    Ratnovsky, Anat
    Malayev, Sarit
    Ratnovsky, Shahar
    Naftali, Sara
    Rabin, Neta
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (1) : 597 - 607
  • [10] Impact of Different Speaking Modes on EMG-based Speech Recognition
    Wand, Michael
    Jou, Szu-Chen Stan
    Toth, Arthur R.
    Schultz, Tanja
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 640 - +