Robotic Binaural Localization and Separation of Multiple Simultaneous Sound Sources

被引:3
作者
Keyrouz, Fakheredine [1 ]
机构
[1] Notre Dame Univ, Zouk Mosbeh, Lebanon
来源
2017 11TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC) | 2017年
关键词
Robotic hearing; sound localization; sound separation; competitive learning; Bayesian fusion; HRTFs; BLIND SOURCE SEPARATION; CONVOLUTIVE MIXTURES; MAP;
D O I
10.1109/ICSC.2017.18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of intelligent localization and separation of multiple concurrent sound sources using only two microphones, and without usage of a visual channel, is a challenging one. It has been solved with little success today. Most of the approaches that have been presented on this subject use complex microphone arrays to localize sound in three dimensions. Psycho-physical listening tests show that humans can locate up to six sound sources simultaneously. This localization occurs not only in azimuth but also in elevation and to a certain extent in distance. In addition, humans are able to identify these sources and to separate and process the information that they provide, including the tone, pitch, and semantics. They do not use any sensor array, but only two ears. In this study, we tackle the challenging task of robotic sound source localization and separation of more than two dynamic sources using only two microphones inserted into the ear canals of two artificial ears of a humanoid head. Exploiting specific properties of the sound signals and using self-splitting competitive learning combined with Bayesian fusion, we achieved promising results compared to state-of-the-art techniques in terms of localization accuracy and separation efficiency.
引用
收藏
页码:188 / 195
页数:8
相关论文
共 20 条
[1]  
[Anonymous], 2011 2 C CIRC POW CO
[2]   EFFECT OF MULTIPLE SPEECH-LIKE MASKERS ON BINAURAL SPEECH RECOGNITION IN NORMAL AND IMPAIRED HEARING [J].
BRONKHORST, AW ;
PLOMP, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1992, 92 (06) :3132-3139
[3]   A generalization of blind source separation algorithms for convolutive mixtures based on second-order statistics [J].
Buchner, H ;
Aichner, R ;
Kellermann, W .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (01) :120-134
[4]  
Carlile S, 2016, TRENDS HEAR, V20, DOI [10.1177/2331216516614254, 10.1177/2331216516644254]
[5]  
Chen K, 2016, INT CONF ACOUST SPEE, P316, DOI 10.1109/ICASSP.2016.7471688
[6]  
Clayton K., 2016, PLOS ONE, V11, P12
[7]  
Imran M, 2016, INT BHURBAN C APPL S, P586, DOI 10.1109/IBCAST.2016.7429936
[8]  
Keyrouz F, 2015, 2015 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), P89, DOI 10.1109/MFI.2015.7295751
[9]   Advanced Binaural Sound Localization in 3-D for Humanoid Robots [J].
Keyrouz, Fakheredine .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2014, 63 (09) :2098-2107
[10]  
Liu J, 2007, LECT NOTES COMPUT SC, V4426, P704