Surface Electromyography-Based Recognition, Synthesis, and Perception of Prosodic Subvocal Speech

被引:15
|
作者
Vojtech, Jennifer M. [1 ,2 ]
Chan, Michael D. [1 ]
Shiwani, Bhawna [1 ]
Roy, Serge H. [1 ]
Heaton, James T. [3 ]
Meltzner, Geoffrey S. [4 ]
Contessa, Paola [1 ]
De Luca, Gianluca [1 ]
Patel, Rupal [4 ,5 ]
Kline, Joshua C. [1 ]
机构
[1] Delsys Altec Inc, Natick, MA 01760 USA
[2] Boston Univ, Boston, MA 02215 USA
[3] Massachusetts Gen Hosp, Dept Surg, Boston, MA USA
[4] VocaliD Inc, Belmont, MA USA
[5] Northeastern Univ, Boston, MA 02115 USA
来源
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH | 2021年 / 64卷 / 06期
基金
美国国家卫生研究院;
关键词
QUALITY-OF-LIFE; CLEAR SPEECH; VOCAL FATIGUE; INTELLIGIBILITY; LARYNGEAL; MOVEMENT; SPEAKING; MUSCLES; ACCEPTABILITY; ELECTROLARYNX;
D O I
10.1044/2021_JSLHR-20-00257
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Purpose: This study aimed to evaluate a novel communication system designed to translate surface electromyographic (sEMG) signals from articulatory muscles into speech using a personalized, digital voice. The system was evaluated for word recognition, prosodic classification, and listener perception of synthesized speech. Method: sEMG signals were recorded from the face and neck as speakers with (n = 4) and without (n = 4) laryngectomy subvocally recited (silently mouthed) a speech corpus comprising 750 phrases (150 phrases with variable phraselevel stress). Corpus tokens were then translated into speech via personalized voice synthesis (n = 8 synthetic voices) and compared against phrases produced by each speaker when using their typical mode of communication (n = 4 natural voices, n = 4 electrolaryngeal [EL] voices). Naive listeners (n = 12) evaluated synthetic, natural, and EL speech for acceptability and intelligibility in a visual sort-and-rate task, as well as phrasal stress discriminability via a classification mechanism. Results: Recorded sEMG signals were processed to translate sEMG muscle activity into lexical content and categorize variations in phrase-level stress, achieving a mean accuracy of 96.3% (SD = 3.10%) and 91.2% (SD = 4.46%), respectively. Synthetic speech was significantly higher in acceptability and intelligibility than EL speech, also leading to greater phrasal stress classification accuracy, whereas natural speech was rated as the most acceptable and intelligible, with the greatest phrasal stress classification accuracy. Conclusion: This proof-of-concept study establishes the feasibility of using subvocal sEMG-based alternative communication not only for lexical recognition but also for prosodic communication in healthy individuals, as well as those living with vocal impairments and residual articulatory function.
引用
收藏
页码:2134 / 2153
页数:20
相关论文
共 50 条
  • [1] Surface Electromyography-Based Recognition of Electronic Taste Sensations
    Ullah, Asif
    Zhang, Fengqi
    Song, Zhendong
    Wang, You
    Zhao, Shuo
    Riaz, Waqar
    Li, Guang
    BIOSENSORS-BASEL, 2024, 14 (08):
  • [2] Surface Electromyography-Based Action Recognition and Manipulator Control
    Cao, Tianao
    Liu, Dan
    Wang, Qisong
    Bai, Ou
    Sun, Jinwei
    APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [3] SurfMyoAiR: A Surface Electromyography-Based Framework for Airwriting Recognition
    Tripathi, Ayush
    Prathosh, A. P.
    Muthukrishnan, Suriya Prakash
    Kumar, Lalan
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [4] A Dataset for Electromyography-based Dactylology Recognition
    Satoshi Kawamoto, Andre Luiz
    Bertolini, Diego
    Barreto, Maisa
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 2376 - 2381
  • [5] Optimization of an Electromyography-Based Activity Recognition System
    Carrino, Francesco
    Ridi, Antonio
    Caon, Maurizio
    Abou Khaled, Omar
    Mugellini, Elena
    2013 ISSNIP BIOSIGNALS AND BIOROBOTICS CONFERENCE (BRC), 2013, : 100 - 105
  • [6] Silent Speech Recognition Based on Surface Electromyography
    Ma, Siyuan
    Jin, Dantong
    Zhang, Ming
    Zhang, Bixuan
    Wang, You
    Li, Guang
    Yang, Meng
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 4497 - 4501
  • [7] From Forearm to Wrist: Deep Learning for Surface Electromyography-Based Gesture Recognition
    He, Jiayuan
    Niu, Xinyue
    Zhao, Penghui
    Lin, Chuang
    Jiang, Ning
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2024, 32 : 102 - 111
  • [8] A Hierarchical View Pooling Network for Multichannel Surface Electromyography-Based Gesture Recognition
    Wei, Wentao
    Hong, Hong
    Wu, Xiaoli
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [9] Surface Electromyography-based Human Motion Pattern Recognition Using Convolutional Neural Networks
    Liu Y.
    Lu Y.
    Ma X.
    Song Q.
    Binggong Xuebao/Acta Armamentarii, 2024, 45 (07): : 2144 - 2158
  • [10] Development of surface electromyography-based motion intention recognition for human-machine interface
    Xu, Tinghan
    Qi, Shichen
    Liang, Yuanhao
    Hu, Yong
    2024 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS, CIVEMSA 2024, 2024,