Determining the energetic and informational components of speech-on-speech masking

被引:90
作者
Kidd, Gerald, Jr. [1 ,2 ]
Mason, Christine R. [1 ,2 ]
Swaminathan, Jayaganesh [1 ,2 ]
Roverud, Elin [1 ,2 ]
Clayton, Kameron K. [1 ,2 ,3 ]
Best, Virginia [1 ,2 ]
机构
[1] Boston Univ, Dept Speech Language & Hearing Sci, 635 Commonwealth Ave, Boston, MA 02215 USA
[2] Boston Univ, Hearing Res Ctr, 635 Commonwealth Ave, Boston, MA 02215 USA
[3] Harvard Med Sch, Program Speech & Hearing Biosci & Technol, Div Med Sci, 260 Longwood Ave, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
TIME-FREQUENCY SEGREGATION; COCKTAIL-PARTY PROBLEM; SPATIAL RELEASE; MULTICOMPONENT MASKERS; SIMULTANEOUS TALKERS; RECEPTION THRESHOLD; INTERFERING-SPEECH; FLUCTUATING NOISE; COMPETING SPEECH; REVERSED SPEECH;
D O I
10.1121/1.4954748
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Identification of target speech was studied under masked conditions consisting of two or four independent speech maskers. In the reference conditions, the maskers were colocated with the target, the masker talkers were the same sex as the target, and the masker speech was intelligible. The comparison conditions, intended to provide release from masking, included different-sex target and masker talkers, time-reversal of the masker speech, and spatial separation of the maskers from the target. Significant release from masking was found for all comparison conditions. To determine whether these reductions in masking could be attributed to differences in energetic masking, ideal time-frequency segregation (ITFS) processing was applied so that the time-frequency units where the masker energy dominated the target energy were removed. The remaining target-dominated "glimpses" were reassembled as the stimulus. Speech reception thresholds measured using these resynthesized ITFS-processed stimuli were the same for the reference and comparison conditions supporting the conclusion that the amount of energetic masking across conditions was the same. These results indicated that the large release from masking found under all comparison conditions was due primarily to a reduction in informational masking. Furthermore, the large individual differences observed generally were correlated across the three masking release conditions. (C) 2016 Author(s). All article content, except where otherwise noted, is licensed under a Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:132 / 144
页数:13
相关论文
共 66 条
[1]   Determination of the potential benefit of time-frequency gain manipulation [J].
Anzalone, Michael C. ;
Calandruccio, Lauren ;
Doherty, Karen A. ;
Carney, Laurel H. .
EAR AND HEARING, 2006, 27 (05) :480-492
[2]   Evidence for spatial tuning in informational masking using the probe-signal method [J].
Arbogast, TL ;
Kidd, G .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 108 (04) :1803-1810
[3]   Visually-guided attention enhances target identification in a complex auditory scene [J].
Best, Virginia ;
Ozmeral, Erol J. ;
Shinn-Cunningham, Barbara G. .
JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY, 2007, 8 (02) :294-304
[4]   An Energetic Limit on Spatial Release from Masking [J].
Best, Virginia ;
Thompson, Eric R. ;
Mason, Christine R. ;
Kidd, Gerald, Jr. .
JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY, 2013, 14 (04) :603-610
[5]   The influence of non-spatial factors on measures of spatial release from masking [J].
Best, Virginia ;
Marrone, Nicole ;
Mason, Christine R. ;
Kidd, Gerald, Jr. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (04) :3103-3110
[6]   A speech corpus for multitalker communications research [J].
Bolia, RS ;
Nelson, WT ;
Ericson, MA ;
Simpson, BD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 107 (02) :1065-1066
[7]   The cocktail-party problem revisited: early processing and selection of multi-talker speech [J].
Bronkhorst, Adelbert W. .
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2015, 77 (05) :1465-1487
[8]  
Bronkhorst AW, 2000, ACUSTICA, V86, P117
[9]   Linguistic contributions to speech-on-speech masking for native and non-native listeners: Language familiarity and semantic content [J].
Brouwer, Susanne ;
Van Engen, Kristin J. ;
Calandruccio, Lauren ;
Bradlow, Ann R. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (02) :1449-1464
[10]   Isolating the energetic com ponent of speech-on-speech masking with ideal time-frequency segregation [J].
Brungart, Douglas S. ;
Chang, Peter S. ;
Simpson, Brian D. ;
Wang, DeLiang .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (06) :4007-4018