Relative Contributions of Amplitude and Phase to the Intelligibility Advantage of Ideal Binary Masked Sentences

被引:0
作者
Wang, Lei [1 ]
Zhu, Shufeng [1 ]
Chen, Diliang [1 ]
Feng, Yong [1 ]
Chen, Fei [1 ]
机构
[1] Southern Univ Sci & Technol, Dept Elect & Elect Engn, Shenzhen, Peoples R China
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
基金
中国国家自然科学基金;
关键词
Speech intelligibility; ideal binary masking; amplitude and phase; SPEECH; MODULATION; ENVELOPE; CUES;
D O I
10.21437/Interspeech.2016-18
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many studies have shown the advantage of using ideal binary masking (IdBM) to improve the intelligibility of speech corrupted by interfering maskers. Given the fact that amplitude and phase are two important acoustic cues for speech perception, the present work further investigated the relative contributions of these two cues to the intelligibility advantage of IdBM-processed sentences. Three types of Mandarin IdBM-processed stimuli (i.e., amplitude-only, phase-only, and amplitude-and-phase) were generated, and played to normal-hearing listeners to recognize. Experiment results showed that amplitude- or phase-only cue could lead to significantly improved intelligibility of IdBM-processed sentences in relative to noise-masked sentences. A masker-dependent amplitude over phase advantage was observed when accounting for their relative contributions to the intelligibility advantage of IdBM-processed sentences. Under steady-state speech-spectrum shaped noise, both amplitude and phase-only IdBM-processed sentences contained intelligibility information close to that contained in amplitude-and-phase IdBM-processed sentences. In contrast, under competing babble masker, amplitude-only IdBM-processed sentences were more intelligible than phase-only IdBM-processed sentences, and neither could account for the intelligibility advantage of amplitude-and-phase IdBM-processed sentences.
引用
收藏
页码:136 / 139
页数:4
相关论文
共 21 条
[1]   Isolating the energetic com ponent of speech-on-speech masking with ideal time-frequency segregation [J].
Brungart, Douglas S. ;
Chang, Peter S. ;
Simpson, Brian D. ;
Wang, DeLiang .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (06) :4007-4018
[2]   Improvement of intelligibility of ideal binary-masked noisy speech by adding background noise [J].
Cao, Shuyang ;
Li, Liang ;
Wu, Xihong .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (04) :2227-2236
[3]  
Chen F, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P3404
[4]   Effect of temporal modulation rate on the intelligibility of phase-based speech [J].
Chen, Fei ;
Guan, Tian .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (06) :EL520-EL526
[5]   Zerocrossing-based nonuniform sampling to deliver low-frequency fine structure cue for cochlear implant [J].
Chen, Fei ;
Zhang, Yuan-Ting .
DIGITAL SIGNAL PROCESSING, 2011, 21 (03) :427-432
[6]   A glimpsing model of speech perception in noise [J].
Cooke, M .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (03) :1562-1573
[7]   Speech intelligibility as a function of the number of channels of stimulation for signal processors using sine-wave and noise-band outputs [J].
Dorman, MF ;
Loizou, PC ;
Rainey, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 102 (04) :2403-2411
[8]   The ability of listeners to use recovered envelope cues from speech fine structure [J].
Gilbert, G ;
Lorenzi, C .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (04) :2438-2444
[9]   On the significance of phase in the short term Fourier spectrum for speech intelligibility [J].
Kazama, Michiko ;
Gotoh, Satoru ;
Tohyama, Mikio ;
Houtgast, Tammo .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (03) :1432-1439
[10]   Effect of spectral resolution on the intelligibility of ideal binary masked speech [J].
Li, Ning ;
Loizou, Philipos C. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (04) :EL59-EL64