Objective Prediction of Hearing Aid Benefit Across Listener Groups Using Machine Learning: Speech Recognition Performance With Binaural Noise-Reduction Algorithms

被引:25
作者
Schaedler, Marc R. [1 ]
Warzybok, Anna [1 ]
Kollmeier, Birger [1 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Med Phys & Cluster Excellence Hearing4all, Kupkersweg 74, D-26129 Oldenburg, Germany
关键词
speech recognition; speech perception modeling; hearing impairment; hearing aid; INTELLIGIBILITY PREDICTION; IMPAIRED LISTENERS; ENVELOPE-POWER; PERCEPTION; MODEL; COHERENCE; SYSTEM; RATIO;
D O I
10.1177/2331216518768954
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
The simulation framework for auditory discrimination experiments (FADE) was adopted and validated to predict the individual speech-in-noise recognition performance of listeners with normal and impaired hearing with and without a given hearing-aid algorithm. FADE uses a simple automatic speech recognizer (ASR) to estimate the lowest achievable speech reception thresholds (SRTs) from simulated speech recognition experiments in an objective way, independent from any empirical reference data. Empirical data from the literature were used to evaluate the model in terms of predicted SRTs and benefits in SRT with the German matrix sentence recognition test when using eight single-and multichannel binaural noise-reduction algorithms. To allow individual predictions of SRTs in binaural conditions, the model was extended with a simple better ear approach and individualized by taking audiograms into account. In a realistic binaural cafeteria condition, FADE explained about 90% of the variance of the empirical SRTs for a group of normal-hearing listeners and predicted the corresponding benefits with a root-mean-square prediction error of 0.6 dB. This highlights the potential of the approach for the objective assessment of benefits in SRT without prior knowledge about the empirical data. The predictions for the group of listeners with impaired hearing explained 75% of the empirical variance, while the individual predictions explained less than 25%. Possibly, additional individual factors should be considered for more accurate predictions with impaired hearing. A competing talker condition clearly showed one limitation of current ASR technology, as the empirical performance with SRTs lower than -20 dB could not be predicted.
引用
收藏
页数:21
相关论文
共 45 条
[1]  
[Anonymous], 2001, PERCEPTUAL EVALUATIO, P862
[2]  
[Anonymous], 2001, J Am Acad Audiol, DOI [DOI 10.1055/S-0041-1741117, DOI 10.1055/s-0041-1741117]
[3]  
ANSI, 1997, METH CALC SPEECH INT, pS35
[4]   Comparing Binaural Pre-processing Strategies I: Instrumental Evaluation [J].
Baumgaertel, Regina M. ;
Krawczyk-Becker, Martin ;
Marquardt, Daniel ;
Voelker, Christoph ;
Hu, Hongmei ;
Herzke, Tobias ;
Coleman, Graham ;
Adiloglu, Kamil ;
Ernst, Stephan M. A. ;
Gerkmann, Timo ;
Doclo, Simon ;
Kollmeier, Birger ;
Hohmann, Volker ;
Dietz, Mathias .
TRENDS IN HEARING, 2015, 19
[5]   Revision, extension, and evaluation of a binaural speech intelligibility model [J].
Beutelmann, Rainer ;
Brand, Thomas ;
Kollmeier, Birger .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (04) :2479-2497
[6]   The role of short-time intensity and envelope power for speech intelligibility and psychoacoustic masking [J].
Biberger, Thomas ;
Ewert, Stephan D. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (02) :1098-1111
[7]   Standard Audiograms for the IEC 60118-15 Measurement Procedure [J].
Bisgaard, Nikolai ;
Vlaming, Marcel S. M. G. ;
Dahlquist, Martin .
TRENDS IN AMPLIFICATION, 2010, 14 (02) :113-120
[8]   A glimpsing model of speech perception in noise [J].
Cooke, M .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (03) :1562-1573
[9]   A quantitative model of the ''effective'' signal processing in the auditory system .2. Simulations and measurements [J].
Dau, T ;
Puschel, D ;
Kohlrausch, A .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (06) :3623-3631
[10]  
Elko G. W., 1995, 1995 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics (Cat. No.95TH8144), P169, DOI 10.1109/ASPAA.1995.482983