Spectrotemporal Modulation Sensitivity as a Predictor of Speech Intelligibility for Hearing-Impaired Listeners

被引:84
|
作者
Bernstein, Joshua G. W. [1 ]
Mehraei, Golbarg [2 ]
Shamma, Shihab [3 ]
Gallun, Frederick J. [4 ]
Theodoroff, Sarah M. [4 ]
Leek, Marjorie R. [4 ]
机构
[1] Walter Reed Natl Mil Med Ctr, Sci & Clin Studies Sect, Audiol & Speech Ctr, Bethesda, MD 20889 USA
[2] Harvard MIT Speech & Hearing Biosci & Technol Pro, Cambridge, MA USA
[3] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA
[4] Portland VA Med Ctr, VA RR&D Natl Ctr Rehabil Auditory Res, Portland, OR USA
关键词
Fine structure; frequency selectivity; hearing loss; model; modulation; sensorineural; spectral; speech intelligibility; temporal; AUDITORY FILTER SHAPES; FREQUENCY-SELECTIVITY; CARRIER FREQUENCY; NOISE; RECOGNITION; RECEPTION; PERCEPTION; REGIONS; DISCRIMINATION; REPRESENTATION;
D O I
10.3766/jaaa.24.4.5
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Background: A model that can accurately predict speech intelligibility for a given hearing-impaired (HI) listener would be an important tool for hearing-aid fitting or hearing-aid algorithm development. Existing speech-intelligibility models do not incorporate variability in suprathreshold deficits that are not well predicted by classical audiometric measures. One possible approach to the incorporation of such deficits is to base intelligibility predictions on sensitivity to simultaneously spectrally and temporally modulated signals. Purpose: The likelihood of success of this approach was evaluated by comparing estimates of spectrotemporal modulation (STM) sensitivity to speech intelligibility and to psychoacoustic estimates of frequency selectivity and temporal fine-structure (TFS) sensitivity across a group of HI listeners. Research Design: The minimum modulation depth required to detect STM applied to an 86 dB SPL four-octave noise carrier was measured for combinations of temporal modulation rate (4, 12, or 32 Hz) and spectral modulation density (0.5, 1, 2, or 4 cycles/octave). STM sensitivity estimates for individual HI listeners were compared to estimates of frequency selectivity (measured using the notched-noise method at 500, 1000, 2000, and 4000 Hz), TFS processing ability (2 Hz frequency-modulation detection thresholds for 500, 1000, 2000, and 4000 Hz carriers) and sentence intelligibility in noise (at a 0 dB signal-to-noise ratio) that were measured for the same listeners in a separate study. Study Sample: Eight normal-hearing (NH) listeners and 12 listeners with a diagnosis of bilateral sensorineural hearing loss participated. Data Collection and Analysis: STM sensitivity was compared between NH and HI listener groups using a repeated-measures analysis of variance. A stepwise regression analysis compared STM sensitivity for individual HI listeners to audiometric thresholds, age, and measures of frequency selectivity and TFS processing ability. A second stepwise regression analysis compared speech intelligibility to STM sensitivity and the audiogrann-based Speech Intelligibility Index. Results: STM detection thresholds were elevated for the HI listeners, but only for low rates and high densities. STM sensitivity for individual HI listeners was well predicted by a combination of estimates of frequency selectivity at 4000 Hz and TFS sensitivity at 500 Hz but was unrelated to audiometric thresholds. STM sensitivity accounted for an additional 40% of the variance in speech intelligibility beyond the 40% accounted for by the audibility-based Speech Intelligibility Index. Conclusions: Impaired STM sensitivity likely results from a combination of a reduced ability to resolve spectral peaks and a reduced ability to use TFS information to follow spectral-peak movements. Combining STM sensitivity estimates with audiometric threshold measures for individual HI listeners provided a more accurate prediction of speech intelligibility than audiometric measures alone. These results suggest a significant likelihood of success for an STM-based model of speech intelligibility for HI listeners.
引用
收藏
页码:293 / 306
页数:14
相关论文
共 50 条
  • [41] Measuring the Effects of Reverberation and Noise on Sentence Intelligibility for Hearing-Impaired Listeners
    George, Erwin L. J.
    Goverts, S. Theo
    Festen, Joost M.
    Houtgast, Tammo
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2010, 53 (06): : 1429 - 1439
  • [42] Effects of Noise Reduction on AM Perception for Hearing-Impaired Listeners
    Ives, D. Timothy
    Kalluri, Sridhar
    Strelcyk, Olaf
    Sheft, Stanley
    Miermont, Franck
    Coez, Arnaud
    Bizaguet, Eric
    Lorenzi, Christian
    JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY, 2014, 15 (05): : 839 - 848
  • [43] A model of speech recognition for hearing-impaired listeners based on deep learning
    Rossbach, Jana
    Kollmeier, Birger
    Meyer, Bernd T.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (03) : 1417 - 1427
  • [44] Intelligibility of interrupted sentences at subsegmental levels in young normal-hearing and elderly hearing-impaired listeners
    Lee, Jae Hee
    Kewley-Port, Diane
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (02) : 1153 - 1163
  • [45] Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners
    Tu, Zehai
    Ma, Ning
    Barker, Jon
    INTERSPEECH 2022, 2022, : 3488 - 3492
  • [46] An algorithm to improve speech recognition in noise for hearing-impaired listeners
    Healy, Eric W.
    Yoho, Sarah E.
    Wang, Yuxuan
    Wang, DeLiang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (04) : 3029 - 3038
  • [47] Temporal Fine-Structure Coding and Lateralized Speech Perception in Normal-Hearing and Hearing-Impaired Listeners
    Locsei, Gusztav
    Pedersen, Julie H.
    Laugesen, Soren
    Santurette, Sebastien
    Dau, Torsten
    MacDonald, Ewen N.
    TRENDS IN HEARING, 2016, 20
  • [48] Modeling speech intelligibility in quiet and noise in listeners with normal and impaired hearing
    Rhebergen, Koenraad S.
    Lyzenga, Johannes
    Dreschler, Wouter A.
    Festen, Joost M.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (03) : 1570 - 1583
  • [49] Intelligibility of synthesized voice messages in commercial truck cab noise for normal-hearing and hearing-impaired listeners
    Morrison H.B.
    Casali J.G.
    International Journal of Speech Technology, 1997, 2 (1) : 33 - 44
  • [50] The binaural intelligibility level difference in hearing-impaired listeners: The role of supra-threshold deficits
    Goverts, S. Theo
    Houtgast, Tammo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (05) : 3073 - 3084