Optical Microphone-Based Speech Reconstruction System With Deep Learning for Individuals With Hearing Loss

被引:0
作者
Lin, Yu-Min [1 ]
Han, Ji-Yan [1 ]
Lin, Cheng-Hung [2 ]
Lai, Ying-Hui [3 ,4 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Dept Biomed Engn, Taipei, Taiwan
[2] Natl Taiwan Normal Univ, Dept Elect Engn, Taipei, Taiwan
[3] Natl Yang Ming Chiao Tung Univ, Dept Biomed Engn, Taipei 112304, Taiwan
[4] Natl Yang Ming Chiao Tung Univ, Med Device Innovat & Translat Ctr, Taipei 112304, Taiwan
关键词
Deep learning; Lasers; Doppler effect; laser doppler vibrometer; speech enhancement; MEAN-SQUARE ERROR; NEURAL-NETWORKS; ENHANCEMENT; NOISE; INTELLIGIBILITY; AMPLIFICATION; OSCILLATIONS; RECOGNITION; PERCEPTION; FEATURES;
D O I
10.1109/TBME.2023.3285437
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective: Although many speech enhancement (SE) algorithms have been proposed to promote speech perception in hearing-impaired patients, the conventional SE approaches that perform well under quiet and/or stationary noises fail under nonstationary noises and/or when the speaker is at a considerable distance. Therefore, the objective of this study is to overcome the limitations of the conventional speech enhancement approaches. Method: This study proposes a speaker-closed deep learning-based SE method together with an optical microphone to acquire and enhance the speech of a target speaker. Results: The objective evaluation scores achieved by the proposed method outperformed the baseline methods by a margin of 0.21-0.27 and 0.34-0.64 in speech quality (HASQI) and speech comprehension/intelligibility (HASPI), respectively, for seven typical hearing loss types. Conclusion: The results suggest that the proposed method can enhance speech perception by cutting off noise from speech signals and mitigating interference caused by distance. Significance: The results of this study show a potential way that can help improve the listening experience in enhancing speech quality and speech comprehension/intelligibility for hearing-impaired people.
引用
收藏
页码:3330 / 3341
页数:12
相关论文
共 62 条
  • [31] Lingapuram P., 2011, Measuring speech quality of laptop microphone system using PESQ
  • [32] Loizou P. C., 2007, Speech enhancement: theory and practice
  • [33] Lu XG, 2013, INTERSPEECH, P436
  • [34] The effect of speckles noise on the Laser Doppler Vibrometry for remote speech detection
    Lv, Tao
    Han, Xiyu
    Wu, Shisong
    Li, Yuanyang
    [J]. OPTICS COMMUNICATIONS, 2019, 440 : 117 - 125
  • [35] An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
    Michelsanti, Daniel
    Tan, Zheng-Hua
    Zhang, Shi-Xiong
    Xu, Yong
    Yu, Meng
    Yu, Dong
    Jensen, Jesper
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1368 - 1396
  • [36] Mitra V, 2014, INTERSPEECH, P895
  • [37] Paliwal KK, 2010, HUMAN-CENTRIC INTERFACES FOR AMBIENT INTELLIGENCE, P135, DOI 10.1016/B978-0-12-374708-2.00006-1
  • [38] Pascual S, 2017, Arxiv, DOI arXiv:1703.09452
  • [39] Peng RH, 2015, 2015 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS)
  • [40] Remote speaker recognition based on the enhanced LDV-captured speech
    Peng, Shuping
    Lv, Tao
    Han, Xiyu
    Wu, Shisong
    Yan, Chunhui
    Zhang, Heyong
    [J]. APPLIED ACOUSTICS, 2019, 143 : 165 - 170