Comparing the information conveyed by envelope modulation for speech intelligibility, speech quality, and music quality

被引:15
作者
Kates, James M. [1 ]
Arehart, Kathryn H. [1 ]
机构
[1] Univ Colorado, Dept Speech Language & Hearing Sci, Boulder, CO 80309 USA
关键词
SENSORINEURAL HEARING-LOSS; TEMPORAL FINE-STRUCTURE; SINUSOIDAL REPRESENTATION; TRANSMISSION INDEX; MUTUAL INFORMATION; NOISE-REDUCTION; AID; RECOGNITION; PERCEPTION; COMPRESSION;
D O I
10.1121/1.4931899
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper uses mutual information to quantify the relationship between envelope modulation fidelity and perceptual responses. Data from several previous experiments that measured speech intelligibility, speech quality, and music quality are evaluated for normal-hearing and hearing-impaired listeners. A model of the auditory periphery is used to generate envelope signals, and envelope modulation fidelity is calculated using the normalized cross-covariance of the degraded signal envelope with that of a reference signal. Two procedures are used to describe the envelope modulation: (1) modulation within each auditory frequency band and (2) spectro-temporal processing that analyzes the modulation of spectral ripple components fit to successive short-time spectra. The results indicate that low modulation rates provide the highest information for intelligibility, while high modulation rates provide the highest information for speech and music quality. The low-to-mid auditory frequencies are most important for intelligibility, while mid frequencies are most important for speech quality and high frequencies are most important for music quality. Differences between the spectral ripple components used for the spectro-temporal analysis were not significant in five of the six experimental conditions evaluated. The results indicate that different modulation-rate and auditory-frequency weights may be appropriate for indices designed to predict different types of perceptual relationships. (C) 2015 Acoustical Society of America.
引用
收藏
页码:2470 / 2482
页数:13
相关论文
共 60 条
[1]  
Anderson M., 2010, THESIS
[2]  
Arehart K.H, 2013, P MTGS AC POMA, V19
[3]   Working Memory, Age, and Hearing Loss: Susceptibility to Hearing Aid Distortion [J].
Arehart, Kathryn H. ;
Souza, Pamela ;
Baca, Rosalinda ;
Kates, James M. .
EAR AND HEARING, 2013, 34 (03) :251-260
[4]   Effects of noise, nonlinear processing, and linear filtering on perceived music quality [J].
Arehart, Kathryn H. ;
Kates, James M. ;
Anderson, Melinda C. .
INTERNATIONAL JOURNAL OF AUDIOLOGY, 2011, 50 (03) :177-190
[5]   Effects of Noise, Nonlinear Processing, and Linear Filtering on Perceived Speech Quality [J].
Arehart, Kathryn H. ;
Kates, James M. ;
Anderson, Melinda C. .
EAR AND HEARING, 2010, 31 (03) :420-436
[6]   Speech recognition in normal hearing and sensorineural hearing loss as a function of the number of spectral channels [J].
Baskent, Deniz .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (05) :2908-2925
[7]   On the importance of the Pearson correlation coefficient in noise reduction [J].
Benesty, Jacob ;
Chen, Jingdong ;
Huang, Yiteng .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (04) :757-765
[8]   THE NATIONAL-ACOUSTIC-LABORATORIES (NAL) NEW PROCEDURE FOR SELECTING THE GAIN AND FREQUENCY-RESPONSE OF A HEARING-AID [J].
BYRNE, D ;
DILLON, H .
EAR AND HEARING, 1986, 7 (04) :257-265
[9]   The role of auditory spectro-temporal modulation filtering and the decision metric for speech intelligibility prediction [J].
Chabot-Leclerc, Alexandre ;
Jorgensen, Soren ;
Dau, Torsten .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (06) :3502-3512
[10]   Spectro-temporal modulation transfer functions and speech intelligibility [J].
Chi, TS ;
Gao, YJ ;
Guyton, MC ;
Ru, PW ;
Shamma, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (05) :2719-2732