A model of speech recognition for hearing-impaired listeners based on deep learning

被引:8
作者
Rossbach, Jana [1 ]
Kollmeier, Birger [2 ]
Meyer, Bernd T. [1 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Commun Acoust & Cluster Excellence Hearing4all, D-26111 Oldenburg, Germany
[2] Carl von Ossietzky Univ Oldenburg, Med Phys & Cluster Excellence Hearing4all, D-26111 Oldenburg, Germany
关键词
INTELLIGIBILITY INDEX; RECEPTION THRESHOLD; FLUCTUATING NOISE; PREDICTION; ENVELOPE; PERCEPTION; MODULATION; ALGORITHM; MASKING;
D O I
10.1121/10.0009411
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic speech recognition (ASR) has made major progress based on deep machine learning, which motivated the use of deep neural networks (DNNs) as perception models and specifically to predict human speech recognition (HSR). This study investigates if a modeling approach based on a DNN that serves as phoneme classifier [Spille, Ewert, Kollmeier, and Meyer (2018). Comput. Speech Lang. 48, 51-66] can predict HSR for subjects with different degrees of hearing loss when listening to speech embedded in different complex noises. The eight noise signals range from simple stationary noise to a single competing talker and are added to matrix sentences, which are presented to 20 hearing-impaired (HI) listeners (categorized into three groups with different types of age-related hearing loss) to measure their speech recognition threshold (SRT), i.e., the signal-to-noise ratio with 50% word recognition rate. These are compared to responses obtained from the ASR-based model using degraded feature representations that take into account the individual hearing loss of the participants captured by a pure-tone audiogram. Additionally, SRTs obtained from eight normal-hearing (NH) listeners are analyzed. For NH subjects and three groups of HI listeners, the average SRT prediction error is below 2 dB, which is lower than the errors of the baseline models. (C) 2022 Authos(s).
引用
收藏
页码:1417 / 1427
页数:11
相关论文
共 50 条
  • [41] Fusion of dichotic consonants in normal-hearing and hearing-impaired listeners
    Sathe, Nishad C.
    Kain, Alexander
    Reiss, Lina A. J.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 155 (01) : 68 - 77
  • [42] Auditory and auditory-visual intelligibility of speech in fluctuating maskers for normal-hearing and hearing-impaired listeners
    Bernstein, Joshua G. W.
    Grant, Ken W.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (05) : 3358 - 3372
  • [43] Native and non-native listeners' judgements on the overall speech quality of hearing-impaired children
    Boonen, Nathalie
    Kloots, Hanne
    Gillis, Steven
    CLINICAL LINGUISTICS & PHONETICS, 2020, 34 (12) : 1149 - 1168
  • [44] An algorithm to increase speech intelligibility for hearing-impaired listeners in novel segments of the same noise type
    Healy, Eric W.
    Yoho, Sarah E.
    Chen, Jitong
    Wang, Yuxuan
    Wang, DeLiang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 138 (03) : 1660 - 1669
  • [45] Large-scale training to increase speech intelligibility for hearing-impaired listeners in novel noises
    Chen, Jitong
    Wang, Yuxuan
    Yoho, Sarah E.
    Wang, DeLiang
    Healy, Eric W.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 139 (05) : 2604 - 2612
  • [46] The interpretation of speech reception threshold data in normal-hearing and hearing-impaired listeners: II. Fluctuating noise
    Smits, Cas
    Festen, Joost M.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (05) : 3004 - 3015
  • [47] The Influence of Cochlear Mechanical Dysfunction, Temporal Processing Deficits, and Age on the Intelligibility of Audible Speech in Noise for Hearing-Impaired Listeners
    Johannesen, Peter T.
    Perez-Gonzalez, Patricia
    Kalluri, Sridhar
    Blanco, Jose L.
    Lopez-Poveda, Enrique A.
    TRENDS IN HEARING, 2016, 20
  • [48] Measurement and modeling of binaural loudness summation for hearing-impaired listeners
    Moore, Brian C. J.
    Gibbs, Alexander
    Onions, Grace
    Glasberg, Brian R.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 136 (02) : 736 - 747
  • [49] Acoustic correlates of vowel intelligibility in clear and conversational speech for young normal-hearing and elderly hearing-impaired listeners
    Ferguson, Sarah Hargus
    Quene, Hugo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (06) : 3570 - 3584
  • [50] Weak neural signatures of spatial selective auditory attention in hearing-impaired listeners
    Bonacci, Lia M.
    Dai, Lengshi
    Shinn-Cunningham, Barbara G.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (04) : 2577 - 2589