Isolating the energetic com ponent of speech-on-speech masking with ideal time-frequency segregation

被引：312

作者：

Brungart, Douglas S.

Chang, Peter S.

Simpson, Brian D.

Wang, DeLiang

机构：

[1] USAF, Res Lab, Human Effectiveness Directorate, Wright Patterson AFB, OH 45433 USA

[2] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA

[3] Ohio State Univ, Ctr Cognit Sci, Columbus, OH 43210 USA

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2006年 / 120卷 / 06期

关键词：

D O I：

10.1121/1.2363929

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

When a target speech signal is obscured by an interfering speech wave form, comprehension of the target message depends both on the successful detection of the energy from the target speech wave form and on the successful extraction and recognition of the spectro-temporal energy pattern of the target out of a background of acoustically similar masker sounds. This study attempted to isolate the effects that energetic masking, defined as the loss of detectable target information due to the spectral overlap of the target and masking signals, has on multitalker speech perception. This was achieved through the use of ideal time-frequency binary masks that retained those spectro-temporal regions of the acoustic mixture that were dominated by the target speech but eliminated those regions that were dominated by the interfering speech. The results suggest that energetic masking plays a relatively small role in the overall masking that occurs when speech is masked by interfering speech but a much more significant role when speech is masked by interfering noise. (c) 2006 Acoustical Society of America.

引用

页码：4007 / 4018

页数：12

共 33 条

[1]

Albert S. Bregman, 1990, AUDITORY SCENE ANAL, P411, DOI [DOI 10.1121/1.408434, DOI 10.7551/MITPRESS/1486.001.0001]

[2]

[Anonymous], 2004, SPEECH PROCESSING AU

[3] The effect of spatial separation on informational and energetic masking of speech [J].