Predicting the intelligibility of Mandarin Chinese with manipulated and intact tonal information for normal-hearing listeners

被引:0
作者
Xu, Chenyang [1 ,5 ]
Moore, Brian C. J. [2 ]
Diao, Mingfang [3 ,4 ]
Li, Xiaodong [1 ,5 ]
Zheng, Chengshi [1 ,5 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing 100190, Peoples R China
[2] Univ Cambridge, Dept Psychol, Cambridge Hearing Grp, Downing St, Cambridge CB2 3EB, England
[3] Peoples Liberat Army Gen Hosp, Med Ctr 6, Dept Endoscop Ear Surg, Sr Dept Otorhinolaryngol Head & Neck Surg, Beijing 100048, Peoples R China
[4] Natl Clin Med Res Ctr Otolaryngol Dis, Beijing 100048, Peoples R China
[5] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
FINE-STRUCTURE INFORMATION; AUDITORY FILTER SHAPES; SPEECH-INTELLIGIBILITY; SPEAKING CHILDREN; TEMPORAL ENVELOPE; LEXICAL-TONE; PERCEPTION; RECOGNITION; SENTENCES; NOISE;
D O I
10.1121/10.0034233
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Objective indices for predicting speech intelligibility offer a quick and convenient alternative to behavioral measures of speech intelligibility. However, most such indices are designed for a specific language, such as English, and they do not take adequate account of tonal information in speech when applied to languages like Mandarin Chinese (hereafter called Mandarin) for which the patterns of fundamental frequency (F0) variation play an important role in distinguishing speech sounds with similar phonetic content. To address this, two experiments with normal-hearing listeners were conducted examining: (1) The impact of manipulations of tonal information on the intelligibility of Mandarin sentences presented in speech-shaped noise (SSN) at several signal-to-noise ratios (SNRs); (2) The intelligibility of Mandarin sentences with intact tonal information presented in SSN, pink noise, and babble at several SNRs. The outcomes were not correctly predicted by the Hearing Aid Speech Perception Index (HASPI-V1). A new intelligibility metric was developed that used one acoustic feature from HASPI-V1 plus Hilbert time envelope and temporal fine structure information from multiple frequency bands. For the new metric, the Pearson correlation between obtained and predicted intelligibility was 0.923 and the root mean square error was 0.119. The new metric provides a potential tool for evaluating Mandarin intelligibility.
引用
收藏
页码:3088 / 3101
页数:14
相关论文
共 85 条
  • [11] Measuring the Band Importance Function for Mandarin Chinese with an Bayesian Adaptive Procedure
    Du, Yufan
    Shen, Yi
    Yang, Hongying
    Wu, Xihong
    Chen, Jing
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 961 - 965
  • [12] CONTEXTUAL EFFECTS IN INFANT SPEECH-PERCEPTION
    EIMAS, PD
    MILLER, JL
    [J]. SCIENCE, 1980, 209 (4461) : 1140 - 1141
  • [13] Sine-wave speech recognition in a tonal language
    Feng, Yan-Mei
    Xu, Li
    Zhou, Ning
    Yang, Guang
    Yin, Shan-Kai
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (02) : EL133 - EL138
  • [14] Nonintrusive objective measurement of speech intelligibility: A review of methodology
    Feng, Yong
    Chen, Fei
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 71
  • [15] FACTORS GOVERNING THE INTELLIGIBILITY OF SPEECH SOUNDS
    FRENCH, NR
    STEINBERG, JC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1947, 19 (01) : 90 - 119
  • [16] Importance of tonal envelope cues in Chinese speech recognition
    Fu, QJ
    Zeng, FG
    Shannon, RV
    Soli, SD
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1998, 104 (01) : 505 - 510
  • [17] Fu Z, 2021, EUR SIGNAL PR CONF, P970, DOI 10.23919/EUSIPCO54536.2021.9616195
  • [18] Brainstem encoding of frequency-modulated sweeps is relevant to Mandarin concurrent-vowels identification for normal-hearing and hearing-impaired listeners
    Fu, Zhen
    Yang, Hongying
    Chen, Fei
    Wu, Xihong
    Chen, Jing
    [J]. HEARING RESEARCH, 2019, 380 : 123 - 136
  • [19] DERIVATION OF AUDITORY FILTER SHAPES FROM NOTCHED-NOISE DATA
    GLASBERG, BR
    MOORE, BCJ
    [J]. HEARING RESEARCH, 1990, 47 (1-2) : 103 - 138
  • [20] AUDITORY FILTER SHAPES IN SUBJECTS WITH UNILATERAL AND BILATERAL COCHLEAR IMPAIRMENTS
    GLASBERG, BR
    MOORE, BCJ
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1986, 79 (04) : 1020 - 1033