Isolating the energetic com ponent of speech-on-speech masking with ideal time-frequency segregation

被引:308
|
作者
Brungart, Douglas S.
Chang, Peter S.
Simpson, Brian D.
Wang, DeLiang
机构
[1] USAF, Res Lab, Human Effectiveness Directorate, Wright Patterson AFB, OH 45433 USA
[2] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[3] Ohio State Univ, Ctr Cognit Sci, Columbus, OH 43210 USA
来源
关键词
D O I
10.1121/1.2363929
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
When a target speech signal is obscured by an interfering speech wave form, comprehension of the target message depends both on the successful detection of the energy from the target speech wave form and on the successful extraction and recognition of the spectro-temporal energy pattern of the target out of a background of acoustically similar masker sounds. This study attempted to isolate the effects that energetic masking, defined as the loss of detectable target information due to the spectral overlap of the target and masking signals, has on multitalker speech perception. This was achieved through the use of ideal time-frequency binary masks that retained those spectro-temporal regions of the acoustic mixture that were dominated by the target speech but eliminated those regions that were dominated by the interfering speech. The results suggest that energetic masking plays a relatively small role in the overall masking that occurs when speech is masked by interfering speech but a much more significant role when speech is masked by interfering noise. (c) 2006 Acoustical Society of America.
引用
收藏
页码:4007 / 4018
页数:12
相关论文
共 50 条
  • [21] On the integration of time-frequency masking speech separation and recognition in underdetermined environments
    Jafari, Ingrid
    Haque, Serajul
    Togneri, Roberto
    Nordholm, Sven
    2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1613 - 1617
  • [22] Perceptual effects of noise reduction by time-frequency masking of noisy speech
    Brons, Inge
    Houben, Rolph
    Dreschler, Wouter A.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (04): : 2690 - 2699
  • [23] Review of Time-Frequency Masking Approach for Improving Speech Intelligibility in Noise
    Kim, Gibak
    IETE TECHNICAL REVIEW, 2022, 39 (03) : 623 - 634
  • [24] Blind speech source separation via nonlinear time-frequency masking
    Xu, Shun
    Chen, Shaorong
    Liu, Yulin
    Shengxue Xuebao/Acta Acustica, 2007, 32 (04): : 375 - 381
  • [25] Blind speech source separation via nonlinear time-frequency masking
    XU Shun CHEN Shaorong LIU Yulin (DSP Lab.
    ChineseJournalofAcoustics, 2008, (03) : 203 - 214
  • [26] Accurate Labeling of Time-Frequency Units in Monaural Voiced Speech Segregation
    Shamlou, Sanam Imani
    Geravanchizadeh, Masoud
    2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 902 - 906
  • [27] Independent Component Analysis and Time-Frequency Masking for Speech Recognition in Multitalker Conditions
    Dorothea Kolossa
    Ramon Fernandez Astudillo
    Eugen Hoffmann
    Reinhold Orglmeister
    EURASIP Journal on Audio, Speech, and Music Processing, 2010
  • [28] The Effect of Partial Time-Frequency Masking of the Direct Sound on the Perception of Reverberant Speech
    Madmoni, Lior
    Tibor, Shir
    Nelken, Israel
    Rafaely, Boaz
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2037 - 2047
  • [29] The Application of Time-Frequency Masking To Improve Intelligibility of Dysarthric Speech in Background Noise
    Borrie, Stephanie A.
    Yoho, Sarah E.
    Healy, Eric W.
    Barrett, Tyson S.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2023, 66 (05): : 1853 - 1866
  • [30] Independent Component Analysis and Time-Frequency Masking for Speech Recognition in Multitalker Conditions
    Kolossa, Dorothea
    Astudillo, Ramon Fernandez
    Hoffmann, Eugen
    Orglmeister, Reinhold
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2010,