A Facial Electromyography Activity Detection Method in Silent Speech Recognition

被引：4

作者：

Cai, Huihui ^{[1
,2
]}

Zhang, Yakun ^{[2
,3
]}

Xie, Liang ^{[2
,3
]}

Yan, Huijiong ^{[2
]}

Qin, Wei ^{[2
,3
]}

Yan, Ye ^{[2
,3
]}

Yin, Erwei ^{[1
,2
,3
]}

Xu, Minpeng ^{[1
]}

Ming, Dong ^{[1
]}

机构：

[1] Tianjin Univ, Sch Acad Med Engn & Translat Med, Tianjin, Peoples R China

[2] Tianjin Artif Intelligence Innoat Ctr TAIIC, Tianjin, Peoples R China

[3] Acad Mil Sci AMS, Def Innovat Inst, Beijing, Peoples R China

来源：

2021 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE BIG DATA AND INTELLIGENT SYSTEMS (HPBD&IS) | 2021年

关键词：

Silent speech recognition; Spectral subtraction; Backtracking; Activity detection; CNN-BiGRU; EMG;

D O I：

10.1109/HPBDIS53214.2021.9658469

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Silent speech recognition (SSR) is a new application of human-computer interaction based on electromyography (EMG), which solves the limitation of acoustic signal dependence. In a low signal-to-noise ratio (SNR) environment, traditional methods cannot accurately segment the EMG active signal. This paper proposes an energy detection method based on spectral subtraction backtracking for detecting EMG active signals to assist silent speech recognition. The experiments are mainly based on energy detection. In addition to the energy detection, the spectral subtraction method is also used to improve the SNR and the accuracy of the endpoint information. Then, the experiments propose a backtracking method to make up for the deficiency of spectral subtraction. Finally, this paper adopts an end-to-end network model, which takes the pre-trained model of CNN as the front end, and the bidirectional gate recurrent unit (Bi-GRU) as the back end for classification. Experimental results show that the proposed activity detection method in this paper is more accurate than others in the low SNR environment.

引用

页码：246 / 251

页数：6

共 18 条

[1]

Bengacemi H, 2017, INT CONF SYST CONTRO, P409, DOI 10.1109/ICoSC.2017.7958651

[2] A new detection method for EMG activity monitoring [J].

Bengacemi, Hichem ;

Abed-Meraim, Karim ;

Buttelli, Olivier ;

Ouldali, Abdelaziz ;

Mesloub, Ammar .

MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2020, 58 (02) :319-334

[3] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].

BOLL, SF .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120

[4]

[成娟 Cheng Juan], 2016, [电子学报, Acta Electronica Sinica], V44, P479

[5] Improved Speech Reconstruction from Silent Video [J].

Ephrat, Ariel ;

Halperin, Tavi ;

Peleg, Shmuel .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :455-462

[6] EEG-EMG, MEG-EMG and EMG-EMG frequency analysis: physiological principles and clinical applications [J].

Grosse, P ;

Cassidy, MJ ;

Brown, P .

CLINICAL NEUROPHYSIOLOGY, 2002, 113 (10) :1523-1531

[7]

Li J., 2016, Robust Automatic Speech Recognition

[8] Syllable-Based Speech Recognition Using EMG [J].

Lopez-Larraz, Eduardo ;

Mozos, Oscar M. ;

Antelis, Javier M. ;

Minguez, Javier .

2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, :4699-4702

[9]

Meltzner GS, 2011, IEEE ENG MED BIO, P4848, DOI 10.1109/IEMBS.2011.6091201

[10]

Nassimi Sami, 2014, International Journal of Advanced Research in Artificial Intelligence, V3, P1

← 1 2 →