Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering

被引:12
作者
Dong, Huan-Yu [1 ]
Lee, Chang-Myung [1 ,2 ]
机构
[1] Univ Ulsan, Dept Mech & Automot Engn, 93 Daehak Ro, Ulsan 44610, South Korea
[2] Russian Acad Sci, Siberian Brunch, Lavrentyev Inst Hydrodynam, 15 Lavrentyev Ave, Novosibirsk 630090, Russia
基金
俄罗斯科学基金会;
关键词
Speech intelligibility; Speech enhancement; Inverse filtering; Auditory model; Dereverberation; PERCEPTUAL-DISTORTION MEASURE; ROOM ACOUSTICS; SOUND REPRODUCTION; ADDITIVE NOISE; EQUALIZATION; RECOGNITION; SYSTEMS; REGULARIZATION; MULTICHANNEL; DATABASE;
D O I
10.1186/s13636-018-0126-8
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The speech intelligibility of indoor public address systems is degraded by reverberation and background noise. This paper proposes a preprocessing method that combines speech enhancement and inverse filtering to improve the speech intelligibility in such environments. An energy redistribution speech enhancement method was modified for use in reverberation conditions, and an auditory-model-based fast inverse filter was designed to achieve better dereverberation performance. An experiment was performed in various noisy, reverberant environments, and the test results verified the stability and effectiveness of the proposed method. In addition, a listening test was carried out to compare the performance of different algorithms subjectively. The objective and subjective evaluation results reveal that the speech intelligibility is significantly improved by the proposed method.
引用
收藏
页数:13
相关论文
共 41 条
[1]  
[Anonymous], SPEECH COMMUNICATION, V45, P101
[2]  
[Anonymous], DISS ABSTR INT
[3]  
[Anonymous], P SELSE JAN
[4]  
[Anonymous], 2011, 6026816 IEC
[5]   A multichannel and multiple position adaptive room response equalizer in warped domain: Real-time implementation and performance evaluation [J].
Cecchi, S. ;
Romoli, L. ;
Carini, A. ;
Piazza, F. .
APPLIED ACOUSTICS, 2014, 82 :28-37
[6]  
Crespo Joao B., 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P910, DOI 10.1109/ICASSP.2014.6853729
[7]   Multizone Speech Reinforcement [J].
Crespo, Joao B. ;
Hendriks, Richard C. .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (01) :54-66
[8]  
ELLIOTT SJ, 1989, J AUDIO ENG SOC, V37, P899
[9]  
Flanagan JL, 2013, SPEECH ANAL SYNTHESI, V3, P150
[10]  
Fuster L, 2012, EUR SIGNAL PR CONF, P1344