Acoustic and perceptual effects of amplitude and frequency compression on high-frequency speech

被引:26
作者
Alexander, Joshua M. [1 ]
Rallapalli, Varsha [1 ]
机构
[1] Purdue Univ, Dept Speech Language & Hearing Sci, W Lafayette, IN 47907 USA
关键词
DYNAMIC-RANGE COMPRESSION; S-VERTICAL-BAR; TEMPORAL-ENVELOPE; NORMAL-HEARING; RELEASE TIME; MULTICHANNEL COMPRESSION; RECOGNITION; CHILDREN; MODULATION; ADULTS;
D O I
10.1121/1.4997938
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study investigated how six different amplification methods influence acoustic properties, and subsequently perception, of high-frequency cues in fricatives that have been processed with conventional full bandwidth amplification or nonlinear frequency compression (NFC)-12 conditions total. Amplification methods included linear gain, fast/slow-acting wide dynamic range compression crossed with fixed/individualized compression parameters, and a method with adaptive time constants. Twenty-one hearing-impaired listeners identified seven fricatives in nonsense syllables produced by female talkers. For NFC stimuli, frequency-compressed filters that precisely aligned 1/3-octave bands between input and output were used to quantify effective compression ratio, audibility, and temporal envelope modulation relative to the input. Results indicated significant relationships between these acoustic properties, each of which contributed significantly to fricative recognition across the entire corpus of stimuli. Recognition was significantly better for NFC stimuli compared with full bandwidth stimuli, regardless of the amplification method, which had complementary effects on audibility and envelope modulation. Furthermore, while there were significant differences in recognition across the amplification methods, they were not consistent across phonemes. Therefore, neither recognition nor acoustic data overwhelmingly suggest that one amplification method should be used over another for transmission of high-frequency cues in isolated syllables. Longer duration stimuli and more realistic listening conditions should be examined. (C) 2017 Acoustical Society of America.
引用
收藏
页码:908 / 923
页数:16
相关论文
共 65 条
[1]   Nonlinear frequency compression: Influence of start frequency and input bandwidth on consonant and vowel recognition [J].
Alexander, Joshua M. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 139 (02) :938-957
[2]   Effects of WDRC Release Time and Number of Channels on Output SNR and Speech Recognition [J].
Alexander, Joshua M. ;
Masterson, Katie .
EAR AND HEARING, 2015, 36 (02) :E35-E49
[3]  
Alexander JM, 2014, EAR HEARING, V35, P519, DOI 10.1097/AUD.0000000000000040
[4]  
Alexander Joshua M., 2013, Seminars in Hearing, V34, P86, DOI 10.1055/s-0033-1341346
[5]  
ANSI, 2003, S3222003 ANSI S3222003 ANSI
[6]  
ANSI, 2007, ANSI S3.5-1997 [R2007]
[7]  
ANSI, 2004, S1112004 ANSI
[8]   Relative importance of temporal information in various frequency regions for consonant identification in quiet and in noise [J].
Apoux, F ;
Bacon, SP .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (03) :1671-1680
[9]   Importance of temporal-envelope speech cues in different spectral regions [J].
Ardoint, Marine ;
Agus, Trevor ;
Sheft, Stanley ;
Lorenzi, Christian .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (02) :EL115-EL121
[10]  
BACON SP, 1985, AUDIOLOGY, V24, P117