On the use of Band Importance Weighting in the Short-Time Objective Intelligibility Measure

被引:0
作者
Andersen, Asger Heidemann [1 ,2 ]
de Haan, Jan Mark [2 ]
Tan, Zheng-Hua [1 ]
Jensen, Jesper [1 ,2 ]
机构
[1] Aalborg Univ, Dept Elect Syst, DK-9220 Aalborg, Denmark
[2] Oticon AS, DK-2765 Smorum, Denmark
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
band importance function; speech intelligibility prediction; enhanced speech; speech in noise; SPEECH RECEPTION THRESHOLD; FREQUENCY-IMPORTANCE; NORMAL-HEARING; NOISE; PREDICTION; INDEX;
D O I
10.21437/Interspeech.2017-1043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech intelligibility prediction methods are popular tools within the speech processing community for objective evaluation of speech intelligibility of e.g. enhanced speech. The Short-Time Objective Intelligibility (STOI) measure has become highly used due to its simplicity and high prediction accuracy. In this paper we investigate the use of Band Importance Functions (BIFs) in the STOI measure, i.e. of unequally weighting the contribution of speech information from each frequency band. We do so by fitting BIFs to several datasets of measured intelligibility, and cross evaluating the prediction performance. Our findings indicate that it is possible to improve prediction performance in specific situations. However, it has not been possible to find BIFs which systematically improve prediction performance beyond the data used for fitting. In other words, we find no evidence that the performance of the STOI measure can be improved considerably by extending it with a non-uniform BIF.
引用
收藏
页码:2963 / 2967
页数:5
相关论文
共 22 条
  • [1] Andersen AH, 2017, INT CONF ACOUST SPEE, P5085, DOI 10.1109/ICASSP.2017.7953125
  • [2] Predicting the Intelligibility of Noisy and Nonlinearly Processed Binaural Speech
    Andersen, Asger Heidemann
    de Haan, Jan Mark
    Tan, Zheng-Hua
    Jensen, Jesper
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 1908 - 1920
  • [3] [Anonymous], 1997, NON TRADITIONAL REF
  • [4] Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners
    Beutelmann, Rainer
    Brand, Thomas
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (01) : 331 - 342
  • [5] Revision, extension, and evaluation of a binaural speech intelligibility model
    Beutelmann, Rainer
    Brand, Thomas
    Kollmeier, Birger
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (04) : 2479 - 2497
  • [6] DARPA, TIMIT ACOUSTIC PHONE
  • [7] Objective Quality and Intelligibility Prediction for Users of Assistive Listening Devices
    Falk, Tiago H.
    Parsa, Vijay
    Santos, Joao F.
    Arehart, Kathryn
    Hazrati, Oldooz
    Huber, Rainer
    Kates, James M.
    Scollie, Susan
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (02) : 114 - 124
  • [8] FACTORS GOVERNING THE INTELLIGIBILITY OF SPEECH SOUNDS
    FRENCH, NR
    STEINBERG, JC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1947, 19 (01) : 90 - 119
  • [9] An Algorithm for Predicting the Intelligibility of Speech Masked by Modulated Noise Maskers
    Jensen, Jesper
    Taal, Cees H.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 2009 - 2022
  • [10] Speech Intelligibility Evaluation for Mobile Phones
    Jorgensen, Soren
    Cubick, Jens
    Dau, Torsten
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2015, 101 (05) : 1016 - 1025