Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners

被引:25
|
作者
Monaghan, Jessica J. M. [1 ,3 ]
Goehring, Tobias [1 ]
Yang, Xin [1 ]
Bolner, Federico [2 ,4 ]
Wang, Shangqiguo [1 ]
Wright, Matthew C. M. [1 ]
Bleeck, Stefan [1 ]
机构
[1] Univ Southampton, Inst Sound & Vibrat Res, Southampton, Hants, England
[2] Katholieke Univ Leuven, ExpORL, Leuven, Belgium
[3] Australian Hearing Hub, Sydney, NSW, Australia
[4] Cochlear Technol Ctr, Mechelen, Belgium
来源
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2017年 / 141卷 / 03期
基金
英国工程与自然科学研究理事会;
关键词
NOISE-REDUCTION ALGORITHM; ENHANCEMENT ALGORITHMS; SIZE; RECOGNITION; INFORMATION; SUPPRESSION; PERCEPTION; PREDICTION; LEVEL; MODEL;
D O I
10.1121/1.4977197
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Machine-learning based approaches to speech enhancement have recently shown great promise for improving speech intelligibility for hearing-impaired listeners. Here, the performance of three machine-learning algorithms and one classical algorithm, Wiener filtering, was compared. Two algorithms based on neural networks were examined, one using a previously reported feature set and one using a feature set derived from an auditory model. The third machine-learning approach was a dictionary-based sparse-coding algorithm. Speech intelligibility and quality scores were obtained for participants with mild-to-moderate hearing impairments listening to sentences in speech-shaped noise and multi-talker babble following processing with the algorithms. Intelligibility and quality scores were significantly improved by each of the three machine-learning approaches, but not by the classical approach. The largest improvements for both speech intelligibility and quality were found by implementing a neural network using the feature set based on auditory modeling. Furthermore, neural network based techniques appeared more promising than dictionary-based, sparse coding in terms of performance and ease of implementation. (C) 2017 Acoustical Society of America.
引用
收藏
页码:1985 / 1998
页数:14
相关论文
共 50 条
  • [1] Predicting speech intelligibility in hearing-impaired listeners using a physiologically inspired auditory model
    Zaar, Johannes
    Carney, Laurel H.
    HEARING RESEARCH, 2022, 426
  • [2] SPECTRAL ENHANCEMENT TO IMPROVE THE INTELLIGIBILITY OF SPEECH IN NOISE FOR HEARING-IMPAIRED LISTENERS
    SIMPSON, AM
    MOORE, BCJ
    GLASBERG, BR
    ACTA OTO-LARYNGOLOGICA, 1990, : 101 - 107
  • [3] Auditory and auditory-visual intelligibility of speech in fluctuating maskers for normal-hearing and hearing-impaired listeners
    Bernstein, Joshua G. W.
    Grant, Ken W.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (05): : 3358 - 3372
  • [4] INTELLIGIBILITY OF SYNTHETIC SPEECH FOR NORMAL-HEARING AND HEARING-IMPAIRED LISTENERS
    KANGAS, KA
    ALLEN, GD
    JOURNAL OF SPEECH AND HEARING DISORDERS, 1990, 55 (04): : 751 - 755
  • [5] BINAURAL SPEECH-INTELLIGIBILITY IN NOISE FOR HEARING-IMPAIRED LISTENERS
    BRONKHORST, AW
    PLOMP, R
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 86 (04): : 1374 - 1383
  • [6] Effects of reverberation on speech intelligibility in noise for hearing-impaired listeners
    Cueille, Raphael
    Lavandier, Mathieu
    Grimault, Nicolas
    ROYAL SOCIETY OPEN SCIENCE, 2022, 9 (08):
  • [7] SYLLABIC COMPRESSION AND SPEECH-INTELLIGIBILITY IN HEARING-IMPAIRED LISTENERS
    VERSCHUURE, J
    DRESCHLER, WA
    DEHAAN, EH
    VANCAPPELLEN, M
    HAMMERSCHLAG, R
    MARE, MJ
    MAAS, AJJ
    HIJMANS, AC
    SCANDINAVIAN AUDIOLOGY, 1993, 22 : 92 - 100
  • [8] Speech Intelligibility Prediction for Hearing-Impaired Listeners with the LEAP Model
    Rossbach, Jana
    Huber, Rainer
    Roettges, Saskia
    Hauth, Christopher F.
    Biberger, Thomas
    Brand, Thomas
    Meyer, Bernd T.
    Rennies, Jan
    INTERSPEECH 2022, 2022, : 3498 - 3502
  • [9] SPEECH FOUNDATION MODELS ON INTELLIGIBILITY PREDICTION FOR HEARING-IMPAIRED LISTENERS
    Cuervo, Santiago
    Marxer, Ricard
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1421 - 1425
  • [10] EVALUATION OF 2 SPEECH ENHANCEMENT TECHNIQUES TO IMPROVE INTELLIGIBILITY FOR HEARING-IMPAIRED ADULTS
    MONTGOMERY, AA
    EDGE, RA
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1988, 31 (03): : 386 - 393