SPEECH FOUNDATION MODELS ON INTELLIGIBILITY PREDICTION FOR HEARING-IMPAIRED LISTENERS

被引:0
|
作者
Cuervo, Santiago [1 ]
Marxer, Ricard [1 ]
机构
[1] Aix Marseille Univ, Univ Toulon, CNRS, LIS, Marseille, France
关键词
Foundation models; speech perception; intelligibility prediction; hearing aids;
D O I
10.1109/ICASSP48485.2024.10447907
中图分类号
学科分类号
摘要
Speech foundation models (SFMs) have been benchmarked on many speech processing tasks, often achieving state-of-the-art performance with minimal adaptation. However, the SFM paradigm has been significantly less explored for applications of interest to the speech perception community. In this paper we present a systematic evaluation of 10 SFMs on one such application: Speech intelligibility prediction. We focus on the non-intrusive setup of the Clarity Prediction Challenge 2 (CPC2), where the task is to predict the percentage of words correctly perceived by hearing-impaired listeners from speech-in-noise recordings. We propose a simple method that learns a lightweight specialized prediction head on top of frozen SFMs to approach the problem. Our results reveal statistically significant differences in performance across SFMs. Our method resulted in the winning submission in the CPC2, demonstrating its promise for speech perception applications.
引用
收藏
页码:1421 / 1425
页数:5
相关论文
共 50 条
  • [1] Speech Intelligibility Prediction for Hearing-Impaired Listeners with the LEAP Model
    Rossbach, Jana
    Huber, Rainer
    Roettges, Saskia
    Hauth, Christopher F.
    Biberger, Thomas
    Brand, Thomas
    Meyer, Bernd T.
    Rennies, Jan
    INTERSPEECH 2022, 2022, : 3498 - 3502
  • [2] Speech intelligibility prediction in hearing-impaired listeners for steady and fluctuating noise
    Holube, I
    Wesselkamp, M
    Dreschler, WA
    Kollmeier, B
    MODELING SENSORINEURAL HEARING LOSS, 1997, : 447 - 459
  • [3] PREDICTION OF SPEECH-INTELLIGIBILITY FOR NORMAL-HEARING AND COCHLEARLY HEARING-IMPAIRED LISTENERS
    LUDVIGSEN, C
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 82 (04): : 1162 - 1171
  • [4] INTELLIGIBILITY OF SYNTHETIC SPEECH FOR NORMAL-HEARING AND HEARING-IMPAIRED LISTENERS
    KANGAS, KA
    ALLEN, GD
    JOURNAL OF SPEECH AND HEARING DISORDERS, 1990, 55 (04): : 751 - 755
  • [5] Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners
    Beutelmann, Rainer
    Brand, Thomas
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (01): : 331 - 342
  • [6] BINAURAL SPEECH-INTELLIGIBILITY IN NOISE FOR HEARING-IMPAIRED LISTENERS
    BRONKHORST, AW
    PLOMP, R
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 86 (04): : 1374 - 1383
  • [7] Effects of reverberation on speech intelligibility in noise for hearing-impaired listeners
    Cueille, Raphael
    Lavandier, Mathieu
    Grimault, Nicolas
    ROYAL SOCIETY OPEN SCIENCE, 2022, 9 (08):
  • [8] SYLLABIC COMPRESSION AND SPEECH-INTELLIGIBILITY IN HEARING-IMPAIRED LISTENERS
    VERSCHUURE, J
    DRESCHLER, WA
    DEHAAN, EH
    VANCAPPELLEN, M
    HAMMERSCHLAG, R
    MARE, MJ
    MAAS, AJJ
    HIJMANS, AC
    SCANDINAVIAN AUDIOLOGY, 1993, 22 : 92 - 100
  • [9] Speech intelligibility prediction in hearing-impaired listeners based on a psychoacoustically motivated perception model
    Holube, I
    Kollmeier, B
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (03): : 1703 - 1716
  • [10] EFFECTS OF ACOUSTIC REFLEX ON SPEECH-INTELLIGIBILITY OF HEARING-IMPAIRED LISTENERS
    MARTIN, RL
    ASP, C
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 58 : S71 - S71