Band importance for speech-in-speech recognition in the presence of extended high-frequency cues

被引:0
作者
Ananthanarayana, Rohit M. [1 ]
Buss, Emily [2 ]
Monson, Brian B. [1 ,3 ]
机构
[1] Univ Illinois, Dept Speech & Hearing Sci, Champaign, IL 61820 USA
[2] Univ N Carolina, Dept Otolaryngol, HNS, Chapel Hill, NC 27599 USA
[3] Univ Illinois, Carle Illinois Coll Med, Dept Biomed & Translat Sci, Champaign, IL USA
基金
美国国家卫生研究院;
关键词
AUDITORY FILTER SHAPES; HORIZONTAL DIRECTIVITY; INTELLIGIBILITY; PERCEPTION; LISTENERS; ENERGY; NOISE; DERIVATION; SENTENCES; VOWELS;
D O I
10.1121/10.0028269
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Band importance functions for speech-in-noise recognition, typically determined in the presence of steady background noise, indicate a negligible role for extended high frequencies (EHFs; 8-20 kHz). However, recent findings indicate that EHF cues support speech recognition in multi-talker environments, particularly when the masker has reduced EHF levels relative to the target. This scenario can occur in natural auditory scenes when the target talker is facing the listener, but the maskers are not. In this study, we measured the importance of five bands from 40 to 20 000 Hz for speech-in-speech recognition by notch-filtering the bands individually. Stimuli consisted of a female target talker recorded from 0 degrees and a spatially co-located two-talker female masker recorded either from 0 degrees or 56.25 degrees, simulating a masker either facing the listener or facing away, respectively. Results indicated peak band importance in the 0.4-1.3 kHz band and a negligible effect of removing the EHF band in the facing-masker condition. However, in the non-facing condition, the peak was broader and EHF importance was higher and comparable to that of the 3.3-8.3 kHz band in the facing-masker condition. These findings suggest that EHFs contain important cues for speech recognition in listening conditions with mismatched talker head orientations.
引用
收藏
页码:1202 / 1213
页数:12
相关论文
共 61 条