Modeling Noise Influence to Speech Intelligibility Non-intrusively by Reduced Speech Dynamic Range

被引:1
作者
Chen, Fei [1 ]
机构
[1] Southern Univ Sci & Technol, Dept Elect & Elect Engn, Shenzhen, Peoples R China
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
基金
中国国家自然科学基金;
关键词
Speech intelligibility; intelligibility prediction; speech dynamic range; REVERBERANT; LISTENERS; HEARING; INDEX;
D O I
10.21437/Interspeech.2016-9
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The noise influence to speech signal waveform can be characterized by reduced speech dynamic range (rDR). This motivated the present work to propose an rDR-based intelligibility measure (denoted as rDRm) that could be used to non-intrusively (i.e., do not require clean reference speech signal) predict speech intelligibility in noise and is computed only using the dynamic range extracted from the noise corrupted speech. The rDRm indices were evaluated with intelligibility scores obtained from normal-hearing listeners presented with sentences corrupted by four types of maskers in a total of 22 conditions. High correlation (r=0.93) was obtained between rDRm values and listeners' sentence recognition scores, and this correlation was comparable to those computed with existing intrusive and non-intrusive intelligibility measures. This suggests that the dynamic range of speech signal may work as a simple but efficient predictor of speech intelligibility in noise, whose computation does not need access to the clean reference speech signal.
引用
收藏
页码:1359 / 1362
页数:4
相关论文
共 14 条
  • [1] [Anonymous], 1969, IEEE T ACOUST SPEECH, VAU17, P225
  • [2] [Anonymous], 2013, COMPUT REV
  • [3] ANSI, 1997, METH CALC SPEECH INT, pS35
  • [4] Predicting the intelligibility of noise-corrupted speech non-intrusively by across-band envelope correlation
    Chen, Fei
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2016, 24 : 109 - 113
  • [5] The Contribution of Matched Envelope Dynamic Range to the Binaural Benefits in Simulated Bilateral Electric Hearing
    Chen, Fei
    Wong, Lena L. N.
    Qiu, Jianxin
    Liu, Yehai
    Azimi, Behnam
    Hu, Yi
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2013, 56 (04): : 1166 - 1174
  • [6] Predicting the intelligibility of reverberant speech for cochlear implant listeners with a non-intrusive intelligibility measure
    Chen, Fei
    Hazrati, Oldooz
    Loizou, Philipos C.
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2013, 8 (03) : 311 - 314
  • [7] Analysis of a simplified normalized covariance measure based on binary weighting functions for predicting the intelligibility of noise-suppressed speech
    Chen, Fei
    Loizou, Philipos C.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (06) : 3715 - 3723
  • [8] SPEECH ENHANCEMENT USING A FREQUENCY-SPECIFIC COMPOSITE WIENER FUNCTION
    Chen, Fei
    Loizou, Philipos C.
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4726 - 4729
  • [9] A Non-Intrusive Quality and Intelligibility Measure of Reverberant and Dereverberated Speech
    Falk, Tiago H.
    Zheng, Chenxi
    Chan, Wai-Yip
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1766 - 1774
  • [10] Analysis of speech-based speech transmission index methods with implications for nonlinear operations
    Goldsworthy, RL
    Greenberg, JE
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (06) : 3679 - 3689