A model of speech recognition for hearing-impaired listeners based on deep learning

被引:8
|
作者
Rossbach, Jana [1 ]
Kollmeier, Birger [2 ]
Meyer, Bernd T. [1 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Commun Acoust & Cluster Excellence Hearing4all, D-26111 Oldenburg, Germany
[2] Carl von Ossietzky Univ Oldenburg, Med Phys & Cluster Excellence Hearing4all, D-26111 Oldenburg, Germany
关键词
INTELLIGIBILITY INDEX; RECEPTION THRESHOLD; FLUCTUATING NOISE; PREDICTION; ENVELOPE; PERCEPTION; MODULATION; ALGORITHM; MASKING;
D O I
10.1121/10.0009411
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic speech recognition (ASR) has made major progress based on deep machine learning, which motivated the use of deep neural networks (DNNs) as perception models and specifically to predict human speech recognition (HSR). This study investigates if a modeling approach based on a DNN that serves as phoneme classifier [Spille, Ewert, Kollmeier, and Meyer (2018). Comput. Speech Lang. 48, 51-66] can predict HSR for subjects with different degrees of hearing loss when listening to speech embedded in different complex noises. The eight noise signals range from simple stationary noise to a single competing talker and are added to matrix sentences, which are presented to 20 hearing-impaired (HI) listeners (categorized into three groups with different types of age-related hearing loss) to measure their speech recognition threshold (SRT), i.e., the signal-to-noise ratio with 50% word recognition rate. These are compared to responses obtained from the ASR-based model using degraded feature representations that take into account the individual hearing loss of the participants captured by a pure-tone audiogram. Additionally, SRTs obtained from eight normal-hearing (NH) listeners are analyzed. For NH subjects and three groups of HI listeners, the average SRT prediction error is below 2 dB, which is lower than the errors of the baseline models. (C) 2022 Authos(s).
引用
收藏
页码:1417 / 1427
页数:11
相关论文
共 50 条
  • [21] An effectively causal deep learning algorithm to increase intelligibility in untrained noises for hearing-impaired listeners
    Healy, Eric W.
    Tan, Ke
    Johnson, Eric M.
    Wang, DeLiang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 149 (06) : 3943 - 3953
  • [22] Predicting Speech Intelligibility by Individual Hearing-Impaired Listeners: The Path Forward Conclusion
    Grant, Ken W.
    Bernstein, Joshua G. W.
    Summers, Van
    JOURNAL OF THE AMERICAN ACADEMY OF AUDIOLOGY, 2013, 24 (04) : 329 - 336
  • [23] Characterizing the Speech Reception Threshold in hearing-impaired listeners in relation to masker type and masker level
    Rhebergen, Koenraad S.
    Pool, Ruben E.
    Dreschler, Wouter A.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (03) : 1491 - 1505
  • [24] Binaural temporal fine structure sensitivity, cognitive function, and spatial speech recognition of hearing-impaired listeners (L)
    Neher, Tobias
    Lunner, Thomas
    Hopkins, Kathryn
    Moore, Brian C. J.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (04) : 2561 - 2564
  • [25] Identification of the Spectrotemporal Modulations That Support Speech Intelligibility in Hearing-Impaired and Normal-Hearing Listeners
    Venezia, Jonathan H.
    Martin, Allison-Graham
    Hickok, Gregory
    Richards, Virginia M.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2019, 62 (04): : 1051 - 1067
  • [26] Can basic auditory and cognitive measures predict hearing-impaired listeners' localization and spatial speech recognition abilities?
    Neher, Tobias
    Laugesen, Soren
    Jensen, Niels Sogaard
    Kragelund, Louise
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (03) : 1542 - 1558
  • [27] Effects of reverberation and noise on speech intelligibility in normal-hearing and aided hearing-impaired listeners
    Xia, Jing
    Xu, Buye
    Pentony, Shareka
    Xu, Jingjing
    Swaminathan, Jayaganesh
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (03) : 1523 - 1533
  • [28] Modulation detection by normal and hearing-impaired listeners
    Lacher-Fougère, S
    Demany, L
    AUDIOLOGY, 1998, 37 (02): : 109 - 121
  • [29] Hearing Sensitivity to Gliding Rippled Spectra in Hearing-Impaired Listeners
    Nechaev, Dmitry
    Milekhina, Olga
    Tomozova, Marina
    Supin, Alexander
    AUDIOLOGY RESEARCH, 2024, 14 (06) : 928 - 938
  • [30] Behavioral measures of cochlear compression and temporal resolution as predictors of speech masking release in hearing-impaired listeners
    Gregan, Melanie J.
    Nelson, Peggy B.
    Oxenham, Andrew J.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (04) : 2895 - 2912