Improved sound source localization in horizontal plane for binaural robot audition

被引:24
|
作者
Kim, Ui-Hyun [1 ]
Nakadai, Kazuhiro [2 ]
Okuno, Hiroshi G. [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Dept Intelligence Sci & Technol, Kyoto, Japan
[2] Honda Res Inst Japan Co Ltd, Wako, Saitama, Japan
基金
日本学术振兴会;
关键词
Intelligent robot audition; Human-robot interaction; Sound source localization; Front-back disambiguation; FRONT-BACK CONFUSION; TIME-DELAY; RESOLUTION;
D O I
10.1007/s10489-014-0544-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An improved sound source localization (SSL) method has been developed that is based on the generalized cross-correlation (GCC) method weighted by the phase transform (PHAT) for use with binaural robots equipped with two microphones inside artificial pinnae. The conventional SSL method based on the GCC-PHAT method has two main problems when used on a binaural robot platform: 1) diffraction of sound waves with multipath interference caused by the contours of the robot head, which affects localization accuracy, and 2) front-back ambiguity, which limits the localization range to half the horizontal space. The diffraction problem was overcome by incorporating a new time delay factor into the GCC-PHAT method under the assumption of a spherical robot head. The ambiguity problem was overcome by utilizing the amplification effect of the pinnae for localization over the entire azimuth. Experiments conducted using two dummy heads equipped with small or large pinnae showed that localization errors were reduced by 8.91A degrees (3.21A degrees vs. 12.12A degrees) on average with the new time delay factor compared with the conventional GCC-PHAT method and that the success rate for front-back disambiguation using the pinnae amplification effect was 29.76 % (93.46 % vs. 72.02 %) better on average over the entire azimuth than with a conventional head related transfer function (HRTF)-based method.
引用
收藏
页码:63 / 74
页数:12
相关论文
共 50 条
  • [21] Binaural Sound Source Localization in Real and Virtual Rooms
    Rychtarikova, Monika
    Van den Bogaert, Tim
    Vermeir, Gerrit
    Wouters, Jan
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2009, 57 (04): : 205 - 220
  • [22] Binaural sound source localization in real and virtual rooms
    Laboratory of Acoustics and Thermal Physics, Katholieke Universiteit Leuven, 3001 Heverlee, Belgium
    不详
    不详
    AES J Audio Eng Soc, 2009, 4 (205-220):
  • [23] The effect of aging on horizontal plane sound localization
    Abel, SM
    Giguère, C
    Consoli, A
    Papsin, BC
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 108 (02): : 743 - 752
  • [24] Sound localization on a horizontal surface: virtual and real sound source localization
    Jonathan Lam
    Bill Kapralos
    Kamen Kanev
    Karen Collins
    Andrew Hogue
    Michael Jenkin
    Virtual Reality, 2015, 19 : 213 - 222
  • [25] Sound localization on a horizontal surface: virtual and real sound source localization
    Lam, Jonathan
    Kapralos, Bill
    Kanev, Kamen
    Collins, Karen
    Hogue, Andrew
    Jenkin, Michael
    VIRTUAL REALITY, 2015, 19 (3-4) : 213 - 222
  • [26] ROBUST FULL-SPHERE BINAURAL SOUND SOURCE LOCALIZATION
    Hammond, Benjamin R.
    Jackson, Philip J. B.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 86 - 90
  • [27] Binaural ambiguity amplifies visual bias in sound source localization
    Zhou, Yi
    Balderas, Leslie
    Venskytis, Emily Jo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 144 (06): : 3118 - 3123
  • [28] Binaural Sound Source Distance Estimation and Localization for a Moving Listener
    Krause, Daniel Aleksander
    Garcia-Barrios, Guillermo
    Politis, Archontis
    Mesaros, Annamaria
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 996 - 1011
  • [29] Binaural Sound Source Localization Based on Convolutional Neural Network
    Zhou, Lin
    Ma, Kangyu
    Wang, Lijie
    Chen, Ying
    Tang, Yibin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 60 (02): : 545 - 557
  • [30] VARIATIONAL EM FOR BINAURAL SOUND-SOURCE SEPARATION AND LOCALIZATION
    Deleforge, Antoine
    Forbes, Florence
    Horaud, Radu
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 76 - 80