Effects of spatial configuration and fundamental frequency on speech intelligibility in multiple-talker conditions in the ipsilateral horizontal plane and median plane

被引:0
作者
Yao, Dingding [1 ,2 ]
Zhao, Jiale [1 ,2 ]
Wang, Linyi [1 ,2 ]
Shang, Zengqiang [1 ,2 ]
Gu, Jianjun [1 ,2 ]
Wang, Yunan [2 ,3 ]
Jia, Maoshen [4 ]
Li, Junfeng [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Speech Acoust & Content Understanding, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Beihang Univ, Dept Elect & Informat Engn, Beijing 100191, Peoples R China
[4] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
基金
中国国家自然科学基金;
关键词
INFORMATIONAL MASKING; CANCELLATION THEORY; RELEASE; NOISE; SEGREGATION; PERCEPTION; SEPARATION; HEARING; HEAD; EQUALIZATION;
D O I
10.1121/10.0025857
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spatial separation and fundamental frequency (F0) separation are effective cues for improving the intelligibility of target speech in multi-talker scenarios. Previous studies predominantly focused on spatial configurations within the frontal hemifield, overlooking the ipsilateral side and the entire median plane, where localization confusion often occurs. This study investigated the impact of spatial and F0 separation on intelligibility under the above-mentioned underexplored spatial configurations. The speech reception thresholds were measured through three experiments for scenarios involving two to four talkers, either in the ipsilateral horizontal plane or in the entire median plane, utilizing monotonized speech with varying F0s as stimuli. The results revealed that spatial separation in symmetrical positions (front-back symmetry in the ipsilateral horizontal plane or front-back, up-down symmetry in the median plane) contributes positively to intelligibility. Both target direction and relative target-masker separation influence the masking release attributed to spatial separation. As the number of talkers exceeds two, the masking release from spatial separation diminishes. Nevertheless, F0 separation remains as a remarkably effective cue and could even facilitate spatial separation in improving intelligibility. Further analysis indicated that current intelligibility models encounter difficulties in accurately predicting intelligibility in scenarios explored in this study. (c) 2024 Acoustical Society of America.
引用
收藏
页码:2934 / 2947
页数:14
相关论文
共 53 条
  • [1] Assmann P.F., 1999, P 14 INT C PHONETIC, V1, P179
  • [2] Spatial Unmasking Effect on Speech Reception Threshold in the Median Plane
    Berwick, Nathan
    Lee, Hyunkook
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (15):
  • [3] Blauert J., 1996, Spatial Hearing: The Psychophysics of Human Sound Localization
  • [4] Boersma P., 2019, Praat, a system for doing phonetics by computer, DOI DOI 10.1097/AUD.0B013-31821473F7
  • [5] Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests
    Brand, T
    Kollmeier, B
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (06) : 2801 - 2810
  • [6] INTONATION AND THE PERCEPTUAL SEPARATION OF SIMULTANEOUS VOICES
    BROKX, JPL
    NOOTEBOOM, SG
    [J]. JOURNAL OF PHONETICS, 1982, 10 (01) : 23 - 36
  • [7] Informational and energetic masking effects in the perception of multiple simultaneous talkers
    Brungart, DS
    Simpson, BD
    Ericson, MA
    Scott, KR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (05) : 2527 - 2538
  • [8] Informational and energetic masking effects in the perception of two simultaneous talkers
    Brungart, DS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 109 (03) : 1101 - 1109
  • [9] Informational masking of speech produced by speech-like sounds without linguistic content
    Chen, Jing
    Li, Huahui
    Li, Liang
    Wu, Xihong
    Moore, Brian C. J.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (04) : 2914 - 2926
  • [10] Colburn H.S., 1996, AUDITORY COMPUTATION, P332, DOI 10.1007/978-1-4612-4070-98