Speech Enhancement Algorithm Based on Sound Source Localization and Scene Matching for Binaural Digital Hearing Aids

被引:9
作者
Li, Ruwei [1 ]
Pan, Dongmei [1 ]
Zhang, Shuang [1 ]
机构
[1] Beijing Univ Technol, Coll Informat & Commun, Beijing 100124, Peoples R China
基金
中国国家自然科学基金;
关键词
Speech enhancement; Binaural sound source localization; Head-related transfer function (HRTF); Scene matching; RECOGNITION;
D O I
10.1007/s40846-018-0412-z
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
At present, the speech enhancement algorithm in binaural digital hearing aids is mainly based on adaptive beamforming algorithm. This algorithm strongly depends on the environment. And the enhancement performance is not satisfactory, which makes it difficult for hearing loss people to get high intelligibility and comfort speech. To solve this problem, a binaural speech enhancement algorithm based on sound source localization and scene matching is proposed in this paper. First, the spatial information of sound source is extracted by the sound source localization algorithm based on head-related transfer function and Gaussian mixture model classifier with Gammatone filter decomposition. Second, noise in different directions from the speaker is removed by spatial filter. Third, the type of noise in the same direction from the speaker is recognized by the scene recognition algorithm based on multi-feature and weighted minimum distance classifier. Finally, according to the type of noise, the optimal speech enhancement method is chosen to remove this noise. Experiment results show that the proposed algorithm has better robustness, better speech enhancement performance and lower complexity than the contrast algorithm.
引用
收藏
页码:403 / 417
页数:15
相关论文
共 27 条
[1]  
Ayllón D, 2016, INT CONF ACOUST SPEE, P6515, DOI 10.1109/ICASSP.2016.7472932
[2]   Reduced-Bandwidth and Distributed MWF-Based Noise Reduction Algorithms for Binaural Hearing Aids [J].
Doclo, Simon ;
Moonen, Marc ;
Van den Bogaert, Tim ;
Wouters, Jan .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (01) :38-51
[3]  
Fang Y, 2015, 2015 8TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), P1261, DOI 10.1109/CISP.2015.7408075
[4]  
Farmani M, 2015, INT CONF ACOUST SPEE, P16, DOI 10.1109/ICASSP.2015.7177923
[5]   Acoustic Recognition of Multiple Bird Species Based on Penalized Maximum Likelihood [J].
Jancovic, Peter ;
Kokuer, Munevver .
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (10) :1585-1589
[6]  
Kabiri C, 2013, INT WIREL COMMUN, P785, DOI 10.1109/IWCMC.2013.6583657
[7]   Robotic Binaural Localization and Separation of Multiple Simultaneous Sound Sources [J].
Keyrouz, Fakheredine .
2017 11TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2017, :188-195
[8]   Advanced Binaural Sound Localization in 3-D for Humanoid Robots [J].
Keyrouz, Fakheredine .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2014, 63 (09) :2098-2107
[9]   Adaptive environment classification system for hearing aids [J].
Lamarche, Luc ;
Giguere, Christian ;
Gueaieb, Wail ;
Aboulnasr, Tyseer ;
Othman, Hisham .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (05) :3124-3135
[10]   Automatic Scene Recognition for Digital Camera by Semantic Features [J].
Li, Jiming ;
Qian, Yunta .
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1 AND 2, 2008, :327-332