SPEECH INTELLIGIBILITY ENHANCEMENT BY EQUALIZATION FOR IN-CAR APPLICATIONS

被引:0
作者
Gentet, Enguerrand [1 ,2 ]
David, Bertrand [1 ]
Denjean, Sebastien [2 ]
Richard, Gael [1 ]
Roussarie, Vincent [2 ]
机构
[1] Inst Polytech Paris, Telecom Paris, LTCI, Paris, France
[2] Grp PSA, Chemin Gisy, F-78943 Velizy Villacoublay, France
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
near-end listening enhancement; speech intelligibility index; sentence recognition in noise; NOISE; ENVIRONMENT; THRESHOLD; INDEX;
D O I
10.1109/icassp40776.2020.9053537
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a speech intelligibility enhancement method for typical in-car applications in noisy environments. While traditional speech enhancement algorithms aim at increasing the Signal to Noise Ratio (SNR), the goal here is to increase intelligibility by applying dedicated voice transformation techniques without changing the original SNR. The proposed method consists in an adaptive equalizer which reallocates the energy of frequency bands to maximize the Speech Intelligibility Index (SII) under the constraint of a fixed perceived loudness. The validation of the algorithm is carried out by means of a perceptual test derived from the Hearing in Noise Test (HINT) using four typical in-car noises of different driving conditions. The results obtained demonstrate the merit of the algorithm for low-frequency noises, that correspond to usual driving conditions, but also show the limit of the algorithm on noises with a spectrum more spread out induced by rain.
引用
收藏
页码:6934 / 6938
页数:5
相关论文
共 23 条
  • [1] [Anonymous], 1997, S351997 ANSI, V19, P90
  • [2] Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests
    Brand, T
    Kollmeier, B
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (06) : 2801 - 2810
  • [3] The nonlinear knapsack problem - algorithms and applications
    Bretthauer, KM
    Shetty, B
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2002, 138 (03) : 459 - 472
  • [4] Brouckxon H., 2008, INTERSPEECH
  • [5] Cooke M., 2013, P INTERSPEECH, P3552
  • [6] Analysis of speech-based speech transmission index methods with implications for nonlinear operations
    Goldsworthy, RL
    Greenberg, JE
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (06) : 3679 - 3689
  • [7] Coherence and the speech intelligibility index
    Kates, JM
    Arehart, KH
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 117 (04) : 2224 - 2237
  • [8] Improving Speech Intelligibility in Noise Using Environment-Optimized Algorithms
    Kim, Gibak
    Loizou, Philipos C.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (08): : 2080 - 2090
  • [9] McLoughlin IV, 1997, DSP 97: 1997 13TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, P591, DOI 10.1109/ICDSP.1997.628419
  • [10] Speech intelligibility improvement in car noise environment by voice transformation
    Nathwani, Karan
    Richard, Gael
    David, Bertrand
    Prablanc, Pierre
    Roussarie, Vincent
    [J]. SPEECH COMMUNICATION, 2017, 91 : 17 - 27