Improved sound source localization in horizontal plane for binaural robot audition

被引:24
|
作者
Kim, Ui-Hyun [1 ]
Nakadai, Kazuhiro [2 ]
Okuno, Hiroshi G. [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Dept Intelligence Sci & Technol, Kyoto, Japan
[2] Honda Res Inst Japan Co Ltd, Wako, Saitama, Japan
基金
日本学术振兴会;
关键词
Intelligent robot audition; Human-robot interaction; Sound source localization; Front-back disambiguation; FRONT-BACK CONFUSION; TIME-DELAY; RESOLUTION;
D O I
10.1007/s10489-014-0544-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An improved sound source localization (SSL) method has been developed that is based on the generalized cross-correlation (GCC) method weighted by the phase transform (PHAT) for use with binaural robots equipped with two microphones inside artificial pinnae. The conventional SSL method based on the GCC-PHAT method has two main problems when used on a binaural robot platform: 1) diffraction of sound waves with multipath interference caused by the contours of the robot head, which affects localization accuracy, and 2) front-back ambiguity, which limits the localization range to half the horizontal space. The diffraction problem was overcome by incorporating a new time delay factor into the GCC-PHAT method under the assumption of a spherical robot head. The ambiguity problem was overcome by utilizing the amplification effect of the pinnae for localization over the entire azimuth. Experiments conducted using two dummy heads equipped with small or large pinnae showed that localization errors were reduced by 8.91A degrees (3.21A degrees vs. 12.12A degrees) on average with the new time delay factor compared with the conventional GCC-PHAT method and that the success rate for front-back disambiguation using the pinnae amplification effect was 29.76 % (93.46 % vs. 72.02 %) better on average over the entire azimuth than with a conventional head related transfer function (HRTF)-based method.
引用
收藏
页码:63 / 74
页数:12
相关论文
共 50 条
  • [31] Binaural sound source localization based on weighted template matching
    Liu, Hong
    Sun, Yongheng
    Yang, Ge
    Chen, Yang
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2021, 6 (02) : 214 - 223
  • [32] Real-time binaural azimuthal sound source localization
    Ponca, M
    Scarbata, G
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 350 - 355
  • [33] Drone Audition: Sound Source Localization Using On-Board Microphones
    Manamperi, Wageesha
    Abhayapala, Thushara D.
    Zhang, Jihui
    Samarasinghe, Prasanga N.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 508 - 519
  • [34] Localization of Moving Microphone Arrays from Moving Sound Sources for Robot Audition
    Evers, Christine
    Moore, Alastair H.
    Naylor, Patrick A.
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1008 - 1012
  • [35] NEUROPHYSIOLOGY OF AUDITION IN BATS - DIRECTIONAL LOCALIZATION AND BINAURAL INTERACTION
    GRINNELL, AD
    JOURNAL OF PHYSIOLOGY-LONDON, 1963, 167 (01): : 97 - &
  • [36] High performance sound source separation adaptable to environmental changes for robot audition
    Nakajima, Hirofumi
    Nakadai, Kazuhiro
    Hasegawa, Yuuji
    Tsujino, Hiroshi
    2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS, 2008, : 2165 - 2171
  • [37] A horizontal sound localization compensation algorithm for rendering binaural signals in noisy environment
    Chen D.
    Yao D.
    Zhao W.
    Zhao X.
    Jiang T.
    Li J.
    Shengxue Xuebao/Acta Acustica, 2024, 49 (03): : 611 - 619
  • [38] Sound Source Localization for Robot Auditory Systems
    Cho, Youngkyu
    Yook, Dongsuk
    Chang, Sukmoon
    Kim, Hyunsoo
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (03) : 1663 - 1668
  • [39] Sound localization and binaural mechanisms
    Blauert, J
    COMPUTATIONAL MODELS OF AUDITORY FUNCTION, 2001, 312 : 79 - 81
  • [40] The effect of blindness on horizontal plane sound source identification
    Abel, SM
    Figueiredo, JC
    Consoli, A
    Bir, CM
    Papsin, BC
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2002, 41 (05) : 285 - 292