A Facial Electromyography Activity Detection Method in Silent Speech Recognition

被引:4
作者
Cai, Huihui [1 ,2 ]
Zhang, Yakun [2 ,3 ]
Xie, Liang [2 ,3 ]
Yan, Huijiong [2 ]
Qin, Wei [2 ,3 ]
Yan, Ye [2 ,3 ]
Yin, Erwei [1 ,2 ,3 ]
Xu, Minpeng [1 ]
Ming, Dong [1 ]
机构
[1] Tianjin Univ, Sch Acad Med Engn & Translat Med, Tianjin, Peoples R China
[2] Tianjin Artif Intelligence Innoat Ctr TAIIC, Tianjin, Peoples R China
[3] Acad Mil Sci AMS, Def Innovat Inst, Beijing, Peoples R China
来源
2021 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE BIG DATA AND INTELLIGENT SYSTEMS (HPBD&IS) | 2021年
关键词
Silent speech recognition; Spectral subtraction; Backtracking; Activity detection; CNN-BiGRU; EMG;
D O I
10.1109/HPBDIS53214.2021.9658469
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Silent speech recognition (SSR) is a new application of human-computer interaction based on electromyography (EMG), which solves the limitation of acoustic signal dependence. In a low signal-to-noise ratio (SNR) environment, traditional methods cannot accurately segment the EMG active signal. This paper proposes an energy detection method based on spectral subtraction backtracking for detecting EMG active signals to assist silent speech recognition. The experiments are mainly based on energy detection. In addition to the energy detection, the spectral subtraction method is also used to improve the SNR and the accuracy of the endpoint information. Then, the experiments propose a backtracking method to make up for the deficiency of spectral subtraction. Finally, this paper adopts an end-to-end network model, which takes the pre-trained model of CNN as the front end, and the bidirectional gate recurrent unit (Bi-GRU) as the back end for classification. Experimental results show that the proposed activity detection method in this paper is more accurate than others in the low SNR environment.
引用
收藏
页码:246 / 251
页数:6
相关论文
共 18 条
  • [1] Bengacemi H, 2017, INT CONF SYST CONTRO, P409, DOI 10.1109/ICoSC.2017.7958651
  • [2] A new detection method for EMG activity monitoring
    Bengacemi, Hichem
    Abed-Meraim, Karim
    Buttelli, Olivier
    Ouldali, Abdelaziz
    Mesloub, Ammar
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2020, 58 (02) : 319 - 334
  • [3] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
    BOLL, SF
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
  • [4] [成娟 Cheng Juan], 2016, [电子学报, Acta Electronica Sinica], V44, P479
  • [5] Improved Speech Reconstruction from Silent Video
    Ephrat, Ariel
    Halperin, Tavi
    Peleg, Shmuel
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 455 - 462
  • [6] Gong Y, 2016, Robust Automatic Speech Recognition
  • [7] EEG-EMG, MEG-EMG and EMG-EMG frequency analysis: physiological principles and clinical applications
    Grosse, P
    Cassidy, MJ
    Brown, P
    [J]. CLINICAL NEUROPHYSIOLOGY, 2002, 113 (10) : 1523 - 1531
  • [8] Syllable-Based Speech Recognition Using EMG
    Lopez-Larraz, Eduardo
    Mozos, Oscar M.
    Antelis, Javier M.
    Minguez, Javier
    [J]. 2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 4699 - 4702
  • [9] Meltzner GS, 2011, IEEE ENG MED BIO, P4848, DOI 10.1109/IEMBS.2011.6091201
  • [10] Nassimi Sami, 2014, International Journal of Advanced Research in Artificial Intelligence, V3, P1