SPEECH FOUNDATION MODELS ON INTELLIGIBILITY PREDICTION FOR HEARING-IMPAIRED LISTENERS

被引:0
|
作者
Cuervo, Santiago [1 ]
Marxer, Ricard [1 ]
机构
[1] Aix Marseille Univ, Univ Toulon, CNRS, LIS, Marseille, France
关键词
Foundation models; speech perception; intelligibility prediction; hearing aids;
D O I
10.1109/ICASSP48485.2024.10447907
中图分类号
学科分类号
摘要
Speech foundation models (SFMs) have been benchmarked on many speech processing tasks, often achieving state-of-the-art performance with minimal adaptation. However, the SFM paradigm has been significantly less explored for applications of interest to the speech perception community. In this paper we present a systematic evaluation of 10 SFMs on one such application: Speech intelligibility prediction. We focus on the non-intrusive setup of the Clarity Prediction Challenge 2 (CPC2), where the task is to predict the percentage of words correctly perceived by hearing-impaired listeners from speech-in-noise recordings. We propose a simple method that learns a lightweight specialized prediction head on top of frozen SFMs to approach the problem. Our results reveal statistically significant differences in performance across SFMs. Our method resulted in the winning submission in the CPC2, demonstrating its promise for speech perception applications.
引用
收藏
页码:1421 / 1425
页数:5
相关论文
共 50 条
  • [21] Identification of the Spectrotemporal Modulations That Support Speech Intelligibility in Hearing-Impaired and Normal-Hearing Listeners
    Venezia, Jonathan H.
    Martin, Allison-Graham
    Hickok, Gregory
    Richards, Virginia M.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2019, 62 (04): : 1051 - 1067
  • [22] Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners
    Tu, Zehai
    Ma, Ning
    Barker, Jon
    INTERSPEECH 2022, 2022, : 3488 - 3492
  • [24] Predicting speech intelligibility in hearing-impaired listeners using a physiologically inspired auditory model
    Zaar, Johannes
    Carney, Laurel H.
    HEARING RESEARCH, 2022, 426
  • [25] COMPARISON OF OBJECTIVE AND SUBJECTIVE MEASURES OF SPEECH-INTELLIGIBILITY IN ELDERLY HEARING-IMPAIRED LISTENERS
    COX, RM
    ALEXANDER, GC
    RIVERA, IM
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1991, 34 (04): : 904 - 915
  • [26] Modelling binaural unmasking and the intelligibility of speech in noise and reverberation for normal-hearing and hearing-impaired listeners
    Vicente, Thibault
    Buchholz, Jorg M.
    Lavandier, Mathieu
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 150 (05): : 3275 - 3287
  • [27] Modelling Speech Intelligibility in the Noisy Workplace for Normal-hearing and Hearing-impaired Listeners Using Hearing Protectors
    Giguere, Christian
    Laroche, Chantal
    Vaillancourt, Veronique
    Soli, Sigfrid D.
    INTERNATIONAL JOURNAL OF ACOUSTICS AND VIBRATION, 2010, 15 (04): : 156 - 167
  • [28] UNDERSTANDING SPEECH-INTELLIGIBILITY IN THE HEARING-IMPAIRED
    CARNEY, AE
    TOPICS IN LANGUAGE DISORDERS, 1986, 6 (03) : 47 - 59
  • [29] Challenging the Speech Intelligibility Index: Macroscopic vs. Microscopic Prediction of Sentence Recognition in Normal and Hearing-impaired Listeners
    Juergens, Tim
    Fredelake, Stefan
    Meyer, Ralf M.
    Kollmeier, Birger
    Brand, Thomas
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2482 - 2485
  • [30] An algorithm to increase speech intelligibility for hearing-impaired listeners in novel segments of the same noise type
    Healy, Eric W.
    Yoho, Sarah E.
    Chen, Jitong
    Wang, Yuxuan
    Wang, DeLiang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 138 (03): : 1660 - 1669