Extending the Hearing-Aid Speech Perception Index (HASPI): Keywords, sentences, and context

被引:2
作者
Kates, James M. [1 ]
机构
[1] Univ Colorado, Dept Speech Language & Hearing Sci, Boulder, CO 80309 USA
基金
美国国家卫生研究院;
关键词
AUDITORY FILTER NONLINEARITY; WORKING-MEMORY; WORD RECOGNITION; INTELLIGIBILITY; NOISE; ENVELOPE; MODULATION; PHONEME; PREDICTION; MASKING;
D O I
10.1121/10.0017546
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The Hearing-Aid Speech Perception Index version 2 (HASPI v2) is a speech intelligibility metric derived by fitting subject responses scored as the proportion of complete sentences correct. This paper presents an extension of HASPI v2, denoted by HASPI w2, which predicts proportion keywords correct for the same datasets used to derive HASPI v2. The results show that the accuracy of HASPI w2 is nearly identical to that of HASPI v2. The values produced by HASPI w2 and HASPI v2 also allow the comparison of proportion words correct and sentences correct for the same stimuli. Using simulation values for speech in additive noise, a model of context effects for words combined into sentences is developed and accounts for the loss of intelligibility inherent in the impaired auditory periphery. In addition, HASPI w2 and HASPI v2 have a small bias term at poor signal-to-noise ratios; the model for context effects shows that the residual bias is reduced in converting from proportion keywords to sentences correct but is greatly magnified when considering the reverse transformation.
引用
收藏
页码:1662 / 1673
页数:12
相关论文
共 67 条
[11]   Evaluation of context effects in sentence recognition [J].
Bronkhorst, AW ;
Brand, T ;
Wagener, K .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (06) :2874-2886
[12]   THE NATIONAL-ACOUSTIC-LABORATORIES (NAL) NEW PROCEDURE FOR SELECTING THE GAIN AND FREQUENCY-RESPONSE OF A HEARING-AID [J].
BYRNE, D ;
DILLON, H .
EAR AND HEARING, 1986, 7 (04) :257-265
[13]   Speech recognition of hearing-impaired listeners: Predictions from audibility and the limited role of high-frequency amplification [J].
Ching, TYC ;
Dillon, H ;
Byrne, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1998, 103 (02) :1128-1140
[14]   Prediction of speech intelligibility based on an auditory preprocessing model [J].
Christiansen, Claus ;
Pedersen, Michael Syskind ;
Dau, Torsten .
SPEECH COMMUNICATION, 2010, 52 (7-8) :678-692
[15]  
Cooke M., 1993, MODELING AUDITORY
[16]   Modeling auditory processing of amplitude modulation .1. Detection and masking with narrow-band carriers [J].
Dau, T ;
Kollmeier, B ;
Kohlrausch, A .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 102 (05) :2892-2905
[18]   A LEISURELY LOOK AT THE BOOTSTRAP, THE JACKKNIFE, AND CROSS-VALIDATION [J].
EFRON, B ;
GONG, G .
AMERICAN STATISTICIAN, 1983, 37 (01) :36-48
[19]   A spectro-temporal modulation index (STMI) for assessment of speech intelligibility [J].
Elhilali, M ;
Chi, T ;
Shamma, SA .
SPEECH COMMUNICATION, 2003, 41 (2-3) :331-348
[20]   Spectro-temporal processing in the envelope-frequency domain [J].
Ewert, SD ;
Verhey, JL ;
Dau, T .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 112 (06) :2921-2931