Speech perception based on spectral peaks versus spectral shape

被引:19
作者
Hillenbrand, James M. [1 ]
Houde, Robert A.
Gayvert, Robert T.
机构
[1] Western Michigan Univ, Dept Speech Pathol & Audiol, Kalamazoo, MI 49008 USA
[2] Ctr Commun Res, Rochester, NY 14623 USA
[3] Gayvet Consulting, Fairport, NY 14450 USA
关键词
D O I
10.1121/1.2188369
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study was designed to measure the relative contributions to speech intelligibility of spectral envelope peaks (including, but not limited to formants) versus the detailed shape of the spectral envelope. The problem was addressed by asking listeners to identify sentences and nonsense syllables that were generated by two structurally identical source-filter synthesizers, one of which constructs the filter function based on the detailed spectral envelope shape while the other constructs the filter function using a purposely coarse estimate that is based entirely on the distribution of peaks in the envelope. Viewed in the broadest terms the results showed that nearly as much speech information is conveyed by the peaks-only method as by the detail-preserving method. Just as clearly, however, every test showed some measurable advantage for spectral detail, although the differences were not large in absolute terms. (c) 2006 Acoustical Society of America.
引用
收藏
页码:4041 / 4054
页数:14
相关论文
共 27 条
[1]   MODELING THE PERCEPTION OF CONCURRENT VOWELS - VOWELS WITH THE SAME FUNDAMENTAL-FREQUENCY [J].
ASSMANN, PF ;
SUMMERFIELD, Q .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 85 (01) :327-338
[2]  
Bench J, 1979, Br J Audiol, V13, P108, DOI 10.3109/03005367909078884
[3]  
BLADON A, 1982, REPRESENTATION SPEEC, P95
[4]   MODELING THE JUDGMENT OF VOWEL QUALITY DIFFERENCES [J].
BLADON, RAW ;
LINDBLOM, B .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1981, 69 (05) :1414-1422
[5]  
CARLSON R, 1979, 34 SPEECH TRANSM LAB, P73
[6]  
CARLSON R, 1979, 341979 STLQPSR ROYAL, P84
[7]  
Garofolo JS, 1993, TIMIT Acoustic-Phonetic Continuous Speech Corpus
[8]  
HARRIS KS, 1958, LANG SPEECH, V1, P1
[9]   ACOUSTIC CHARACTERISTICS OF AMERICAN ENGLISH VOWELS [J].
HILLENBRAND, J ;
GETTY, LA ;
CLARK, MJ ;
WHEELER, K .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (05) :3099-3111
[10]   Creating filters with arbitrary response characteristics for use in hearing and speech research [J].
Hillenbrand, J ;
Houde, RA .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1996, 39 (02) :390-395