Contributions of talker characteristics and spatial location to auditory streaming

被引:37
作者
Allen, Kachina [1 ]
Carlile, Simon [1 ]
Alais, David [2 ]
机构
[1] Univ Sydney, Dept Physiol, Sydney, NSW 2106, Australia
[2] Univ Sydney, Sch Psychol, Sydney, NSW 2006, Australia
基金
澳大利亚研究理事会;
关键词
D O I
10.1121/1.2831774
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
To examine whether auditory streaming contributes to unmasking, intelligibility of target sentences against two competing talkers was measured using the coordinate response measure (CRM) [Bolia et al., J. Acoust. Soc. Am. 107, 1065-1066 (2007)] corpus. In the control condition, the speech reception threshold (50% correct) was measured when the target and two maskers were collocated straight ahead. Separating maskers from the target by +/- 30 degrees resulted in spatial release from masking of 12 dB. CRM sentences involve an identifier in the first part and two target words in the second part. In experimental conditions, masking talkers started spatially separated at +/- 30 degrees but became collocated with the target before the scoring words. In one experiment, one target and two different maskers were randomly selected from a mixed-sex corpus. Significant unmasking of 4 dB remained despite the absence of persistent location cues. When same-sex talkers were used as maskers and target, unmasking was reduced. These data suggest that initial separation may permit confident identification and streaming of the target and masker speech where significant differences between target and masker voice characteristics exist, but where target and masker characteristics are similar, listeners must rely more heavily on continuing spatial cues. (C) 2008 Acoustical Society of America.
引用
收藏
页码:1562 / 1570
页数:9
相关论文
共 49 条
[1]   The ventriloquist effect results from near-optimal bimodal integration [J].
Alais, D ;
Burr, D .
CURRENT BIOLOGY, 2004, 14 (03) :257-262
[2]   Synchronizing to real events: Subjective audiovisual alignment scales with perceived auditory depth and speed of sound [J].
Alais, D ;
Carlile, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (06) :2244-2247
[3]  
Albert S. Bregman, 1990, AUDITORY SCENE ANAL, P411, DOI [DOI 10.7551/MITPRESS/1486.001.0001, 10.1121/1.408434, DOI 10.1121/1.408434]
[4]  
[Anonymous], 1997, Boostrap methods and their application
[5]   Evidence for spatial tuning in informational masking using the probe-signal method [J].
Arbogast, TL ;
Kidd, G .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 108 (04) :1803-1810
[6]   EFFECTS OF PHONETIC CONTEXT ON AUDIOVISUAL INTELLIGIBILITY OF FRENCH [J].
BENOI, C ;
MOHAMADI, T ;
KANDEL, S .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1994, 37 (05) :1195-1203
[7]   A speech corpus for multitalker communications research [J].
Bolia, RS ;
Nelson, WT ;
Ericson, MA ;
Simpson, BD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 107 (02) :1065-1066
[8]   MATHEMATICAL TREATMENT OF CONTEXT EFFECTS IN PHONEME AND WORD RECOGNITION [J].
BOOTHROYD, A ;
NITTROUER, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 84 (01) :101-114
[9]   THE EFFECT OF HEAD-INDUCED INTERAURAL TIME AND LEVEL DIFFERENCES ON SPEECH-INTELLIGIBILITY IN NOISE [J].
BRONKHORST, AW ;
PLOMP, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 83 (04) :1508-1516
[10]   Isolating the energetic com ponent of speech-on-speech masking with ideal time-frequency segregation [J].
Brungart, Douglas S. ;
Chang, Peter S. ;
Simpson, Brian D. ;
Wang, DeLiang .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (06) :4007-4018