Isolating the energetic com ponent of speech-on-speech masking with ideal time-frequency segregation

被引:308
|
作者
Brungart, Douglas S.
Chang, Peter S.
Simpson, Brian D.
Wang, DeLiang
机构
[1] USAF, Res Lab, Human Effectiveness Directorate, Wright Patterson AFB, OH 45433 USA
[2] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[3] Ohio State Univ, Ctr Cognit Sci, Columbus, OH 43210 USA
来源
关键词
D O I
10.1121/1.2363929
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
When a target speech signal is obscured by an interfering speech wave form, comprehension of the target message depends both on the successful detection of the energy from the target speech wave form and on the successful extraction and recognition of the spectro-temporal energy pattern of the target out of a background of acoustically similar masker sounds. This study attempted to isolate the effects that energetic masking, defined as the loss of detectable target information due to the spectral overlap of the target and masking signals, has on multitalker speech perception. This was achieved through the use of ideal time-frequency binary masks that retained those spectro-temporal regions of the acoustic mixture that were dominated by the target speech but eliminated those regions that were dominated by the interfering speech. The results suggest that energetic masking plays a relatively small role in the overall masking that occurs when speech is masked by interfering speech but a much more significant role when speech is masked by interfering noise. (c) 2006 Acoustical Society of America.
引用
收藏
页码:4007 / 4018
页数:12
相关论文
共 50 条
  • [1] Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
    Brungart, Douglas S.
    Chang, Peter S.
    Simpson, Brian D.
    Wang, DeLiang
    Journal of the Acoustical Society of America, 2006, 120 (06): : 4007 - 4018
  • [2] Determining the energetic and informational components of speech-on-speech masking
    Kidd, Gerald, Jr.
    Mason, Christine R.
    Swaminathan, Jayaganesh
    Roverud, Elin
    Clayton, Kameron K.
    Best, Virginia
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (01): : 132 - 144
  • [3] Speech intelligibility in background noise with ideal binary time-frequency masking
    Wang, DeLiang
    Kjems, Ulrik
    Pedersen, Michael S.
    Boldt, Jesper B.
    Lunner, Thomas
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (04): : 2336 - 2347
  • [4] On time-frequency masking in voiced speech
    Skoglund, J
    Kleijn, WB
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04): : 361 - 369
  • [5] Energetic and Informational Components of Speech-on-Speech Masking in Binaural Speech Intelligibility and Perceived Listening Effort
    Rennies, Jan
    Best, Virginia
    Roverud, Elin
    Kidd, Gerald, Jr.
    TRENDS IN HEARING, 2019, 23
  • [6] The importance of processing resolution in "ideal time-frequency segregation" of masked speech and the implications for predicting speech intelligibilitya)
    Conroy, Christopher
    Best, Virginia
    Jennings, Todd R.
    Kidd, Gerald, Jr.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 147 (03): : 1648 - 1660
  • [7] Determining the energetic and informational components of speech-on-speech masking in listeners with sensorineural hearing loss
    Kidd, Gerald, Jr.
    Mason, Christine R.
    Best, Virginia
    Roverud, Elin
    Swaminathan, Jayaganesh
    Jennings, Todd
    Clayton, Kameron
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (01): : 440 - 457
  • [8] Strength of target source segregation cues affects the outcome of speech-on-speech masking experiments
    Roverud, Elin
    Villard, Sarah
    Kidd Jr, Gerald
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (05): : 2780 - 2788
  • [9] Segmentation on time-frequency domain for speech segregation
    Lim, Sung-Kil
    Lee, Hyon-Soo
    2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 433 - +
  • [10] Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
    Luo, Yi
    Mesgarani, Nima
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (08) : 1256 - 1266