Improved sound source localization in horizontal plane for binaural robot audition

被引:24
|
作者
Kim, Ui-Hyun [1 ]
Nakadai, Kazuhiro [2 ]
Okuno, Hiroshi G. [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Dept Intelligence Sci & Technol, Kyoto, Japan
[2] Honda Res Inst Japan Co Ltd, Wako, Saitama, Japan
基金
日本学术振兴会;
关键词
Intelligent robot audition; Human-robot interaction; Sound source localization; Front-back disambiguation; FRONT-BACK CONFUSION; TIME-DELAY; RESOLUTION;
D O I
10.1007/s10489-014-0544-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An improved sound source localization (SSL) method has been developed that is based on the generalized cross-correlation (GCC) method weighted by the phase transform (PHAT) for use with binaural robots equipped with two microphones inside artificial pinnae. The conventional SSL method based on the GCC-PHAT method has two main problems when used on a binaural robot platform: 1) diffraction of sound waves with multipath interference caused by the contours of the robot head, which affects localization accuracy, and 2) front-back ambiguity, which limits the localization range to half the horizontal space. The diffraction problem was overcome by incorporating a new time delay factor into the GCC-PHAT method under the assumption of a spherical robot head. The ambiguity problem was overcome by utilizing the amplification effect of the pinnae for localization over the entire azimuth. Experiments conducted using two dummy heads equipped with small or large pinnae showed that localization errors were reduced by 8.91A degrees (3.21A degrees vs. 12.12A degrees) on average with the new time delay factor compared with the conventional GCC-PHAT method and that the success rate for front-back disambiguation using the pinnae amplification effect was 29.76 % (93.46 % vs. 72.02 %) better on average over the entire azimuth than with a conventional head related transfer function (HRTF)-based method.
引用
收藏
页码:63 / 74
页数:12
相关论文
共 50 条
  • [41] BINAURAL AUDITION IN NON-STATIONARY DIFFUSE SOUND FIELDS
    DANILENK.L
    KYBERNETIK, 1969, 6 (02): : 50 - &
  • [42] Effects of reverberation on sound source localization using binaural spectral cues
    Benton, S
    Spanias, A
    Proceedings of the 23rd IASTED International Conference on Modelling, Identification, and Control, 2004, : 547 - 552
  • [43] Toward learning robust contrastive embeddings for binaural sound source localization
    Tang, Duowei
    Taseska, Maja
    van Waterschoot, Toon
    FRONTIERS IN NEUROINFORMATICS, 2022, 16
  • [44] A BINAURAL SOUND SOURCE LOCALIZATION METHOD USING AUDITIVE CUES AND VISION
    Youssef, Karim
    Argentieri, Sylvain
    Zarader, Jean-Luc
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 217 - 220
  • [45] 2D SOUND-SOURCE LOCALIZATION ON THE BINAURAL MANIFOLD
    Deleforge, Antoine
    Horaud, Radu
    2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2012,
  • [46] Sound Source Localization Method Based on Binaural Model Paper Title
    Liu Guanqun
    Zhang Rubo
    Wu Junwei
    2013 CHINESE AUTOMATION CONGRESS (CAC), 2013, : 538 - 541
  • [47] Multiple Sound Source Position Estimation by Drone Audition Based on Data Association Between Sound Source Localization and Identification
    Wakabayashi, Mizuho
    Okuno, Hiroshi G.
    Kumon, Makoto
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) : 782 - 789
  • [48] Sound localization under conditions of covered ears on the horizontal plane
    Takimoto, Madoka
    Nishino, Takanori
    Itou, Katunobu
    Takeda, Kazuya
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2007, 28 (05) : 335 - 342
  • [49] IMPROVED BINAURAL SOUND REPRODUCTION
    WALLACE, RL
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 57 : S26 - S26
  • [50] Bio-inspired Sound Source Localization Compensated for Sound Diffraction by Binaural Head and Torso
    Shimoyama, Ryuichi
    2012 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND CYBERNETICS (CYBERNETICSCOM), 2012, : 79 - 82