Isolating the energetic com ponent of speech-on-speech masking with ideal time-frequency segregation

被引：308

作者：

Brungart, Douglas S.

Chang, Peter S.

Simpson, Brian D.

Wang, DeLiang

机构：

[1] USAF, Res Lab, Human Effectiveness Directorate, Wright Patterson AFB, OH 45433 USA

[2] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA

[3] Ohio State Univ, Ctr Cognit Sci, Columbus, OH 43210 USA

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2006年 / 120卷 / 06期

关键词：

D O I：

10.1121/1.2363929

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

When a target speech signal is obscured by an interfering speech wave form, comprehension of the target message depends both on the successful detection of the energy from the target speech wave form and on the successful extraction and recognition of the spectro-temporal energy pattern of the target out of a background of acoustically similar masker sounds. This study attempted to isolate the effects that energetic masking, defined as the loss of detectable target information due to the spectral overlap of the target and masking signals, has on multitalker speech perception. This was achieved through the use of ideal time-frequency binary masks that retained those spectro-temporal regions of the acoustic mixture that were dominated by the target speech but eliminated those regions that were dominated by the interfering speech. The results suggest that energetic masking plays a relatively small role in the overall masking that occurs when speech is masked by interfering speech but a much more significant role when speech is masked by interfering noise. (c) 2006 Acoustical Society of America.

引用

页码：4007 / 4018

页数：12

共 50 条

[21] On the integration of time-frequency masking speech separation and recognition in underdetermined environments
Jafari, Ingrid
Haque, Serajul
Togneri, Roberto
Nordholm, Sven
2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1613 - 1617
[22] Perceptual effects of noise reduction by time-frequency masking of noisy speech
Brons, Inge
Houben, Rolph
Dreschler, Wouter A.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (04): : 2690 - 2699
[23] Review of Time-Frequency Masking Approach for Improving Speech Intelligibility in Noise
Kim, Gibak
IETE TECHNICAL REVIEW, 2022, 39 (03) : 623 - 634
[24] Blind speech source separation via nonlinear time-frequency masking
Xu, Shun
Chen, Shaorong
Liu, Yulin
Shengxue Xuebao/Acta Acustica, 2007, 32 (04): : 375 - 381
[25] Blind speech source separation via nonlinear time-frequency masking
XU Shun CHEN Shaorong LIU Yulin (DSP Lab.
ChineseJournalofAcoustics, 2008, (03) : 203 - 214
[26] Accurate Labeling of Time-Frequency Units in Monaural Voiced Speech Segregation
Shamlou, Sanam Imani
Geravanchizadeh, Masoud
2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 902 - 906
[27] Independent Component Analysis and Time-Frequency Masking for Speech Recognition in Multitalker Conditions
Dorothea Kolossa
Ramon Fernandez Astudillo
Eugen Hoffmann
Reinhold Orglmeister
EURASIP Journal on Audio, Speech, and Music Processing, 2010
[28] The Effect of Partial Time-Frequency Masking of the Direct Sound on the Perception of Reverberant Speech
Madmoni, Lior
Tibor, Shir
Nelken, Israel
Rafaely, Boaz
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2037 - 2047
[29] The Application of Time-Frequency Masking To Improve Intelligibility of Dysarthric Speech in Background Noise
Borrie, Stephanie A.
Yoho, Sarah E.
Healy, Eric W.
Barrett, Tyson S.
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2023, 66 (05): : 1853 - 1866
[30] Independent Component Analysis and Time-Frequency Masking for Speech Recognition in Multitalker Conditions
Kolossa, Dorothea
Astudillo, Ramon Fernandez
Hoffmann, Eugen
Orglmeister, Reinhold
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2010,

← 1 2 3 4 5 →