Refinement and validation of the binaural short time objective intelligibility measure for spatially diverse conditions

被引:36
作者
Andersen, Asger Heidemann [1 ,2 ]
de Haan, Jan Mark [1 ]
Tan, Zheng-Hua [2 ]
Jensen, Jesper [1 ,2 ]
机构
[1] Oticon AS, Kongebakken 9, Smorum, Denmark
[2] Aalborg Univ, Dept Elect Syst, Fredrik Bajers Vej 7B, Aalborg, Denmark
关键词
Speech intelligibility prediction; Binaural hearing; Speech enhancement; PREDICTING SPEECH-INTELLIGIBILITY; EQUALIZATION-CANCELLATION MODEL; RECEPTION THRESHOLD; TRANSMISSION INDEX; INTERAURAL TIME; NORMAL-HEARING; NOISE; REVERBERATION; MASKING; QUALITY;
D O I
10.1016/j.specom.2018.06.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech intelligibility prediction methods have recently gained popularity in the speech processing community as supplements to time consuming and costly listening experiments. Such methods can be used to objectively quantify and compare the advantage of different speech enhancement algorithms, in a way that correlates well with actual speech intelligibility. One such method is the short-time objective intelligibility (STOI) measure. In a recent publication, we proposed a binaural version of the STOI measure, based on a modified version of the equalization cancellation (EC) model. This measure was shown to retain many of the advantageous properties of the STOI measure, while at the same time being able to predict intelligibility correctly in conditions involving both binaural advantage and non-linear signal processing. The biggest prediction errors were found for conditions involving multiple spatially distributed interferers. In this paper, we report results for a new listening experiment including different mixtures of isotropic and point source noise. This exposes that the binaural STOI measure has a tendency to overestimate the intelligibility in conditions with spatially distributed interferes at low signal to noise ratios (SNRs). This condition-dependent error can make it difficult to compare intelligibility across different acoustical conditions. We investigate the cause of this upward bias, and propose a correction which alleviates the problem. The modified method is evaluated with five datasets of measured intelligibility, spanning a wide range of realistic acoustic conditions. Within the tested conditions, the modified method yields very accurate predictions, and entirely alleviates the aforementioned tendency to overestimate intelligibility in conditions with spatially distributed interferers.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 60 条
  • [1] The CIPICHRTF database
    Algazi, VR
    Duda, RO
    Thompson, DM
    Avendano, C
    [J]. PROCEEDINGS OF THE 2001 IEEE WORKSHOP ON THE APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2001, : 99 - 102
  • [2] Allen JB, 2005, AUDITORY SIGNAL PROCESSINGP: PHYSIOLOGY, PSYCHOACOUSTICS, AND MODELS, P314
  • [3] American National Standards Institute, 1997, ANSI/ASA S3.5-1997 (R2020)
  • [4] Andersen AH, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P2563
  • [5] Predicting the Intelligibility of Noisy and Nonlinearly Processed Binaural Speech
    Andersen, Asger Heidemann
    de Haan, Jan Mark
    Tan, Zheng-Hua
    Jensen, Jesper
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 1908 - 1920
  • [6] Andersen AH, 2016, INT CONF ACOUST SPEE, P4995, DOI 10.1109/ICASSP.2016.7472628
  • [7] [Anonymous], 2013, INTRO PSYCHOL HEARIN
  • [8] Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners
    Beutelmann, Rainer
    Brand, Thomas
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (01) : 331 - 342
  • [9] Revision, extension, and evaluation of a binaural speech intelligibility model
    Beutelmann, Rainer
    Brand, Thomas
    Kollmeier, Birger
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (04) : 2479 - 2497
  • [10] Boldt Jesper B., 2009, 2009 17th European Signal Processing Conference (EUSIPCO 2009), P1849