Outdoor Auditory Scene Analysis Using a Moving Microphone Array Embedded in a Quadrocopter

被引:0
作者
Okutani, Keita [1 ]
Yoshida, Takami [1 ]
Nakamura, Keisuke [1 ,2 ]
Nakadai, Kazuhiro [2 ]
机构
[1] Tokyo Inst Technol, Grad Sch Informat Sci & Engn, Meguro Ku, 2-12-1 O Okayama, Tokyo 1528552, Japan
[2] Honda Res Inst Japan Co Ltd, Wako, Saitama 3510114, Japan
来源
2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2012年
关键词
LOCATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses auditory scene analysis, especially, sound source localization using an aerial vehicle with a microphone array in an outdoor environment. Since such a vehicle is able to search sound sources quickly and widely, it is useful to detect outdoor sound sources, for instance, to find distressed people in a disaster situation. In such an environment, noise is quite loud and dynamically-changing, and conventional microphone array techniques studied in the field of indoor robot audition are of less use. We, thus, proposed MUltiple SIgnal Classification based on incremental Generalized EigenValue Decomposition (iGEVD-MUSIC). It can deal with high power noise by introducing a noise correlation matrix and GEVD even when the signal-to-noise ratio is less than 0 dB. In addition, the noise correlation matrix is incrementally estimated to adapt to dynamic changes in noise. We developed a prototype system for auditory scene analysis based on the proposed method using the Parrot AR.Drone with a microphone array and a Kinect device. Experimental results using the prototype system showed that dynamically-changing noise is properly suppressed with the proposed method and multiple human voice sources are able to be localized even when the AR.Drone is moving in an outdoor environment.
引用
收藏
页码:3290 / 3295
页数:6
相关论文
共 17 条
  • [1] Burg J., 1967, SOCIETY
  • [2] de Bree H. E., 2009, SAE TECHNICAL PAPER
  • [3] Golub G. H., 1996, MATRIX COMPUTATIONS
  • [4] Hara I., 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566), P2404
  • [5] Ince G., 2011, P IEEE INT C ROBOTIC, P3517, DOI DOI 10.1109/ICRA.2011.5979578
  • [6] Kaushik Balakrishnan, 2005, 11 AIAA CEAS AER C M, V2997
  • [7] Design and Implementation of Robot Audition System 'HARK' - Open Source Software for Listening to Three Simultaneous Speakers
    Nakadai, Kazuhiro
    Takahashi, Toru
    Okuno, Hiroshi G.
    Nakajima, Hirofumi
    Hasegawa, Yuji
    Tsujino, Hiroshi
    [J]. ADVANCED ROBOTICS, 2010, 24 (5-6) : 739 - 761
  • [8] Nakamura K, 2011, IEEE INT C INT ROBOT, P143, DOI 10.1109/IROS.2011.6048166
  • [9] Intelligent Sound Source Localization for Dynamic Environments
    Nakamura, Keisuke
    Nakadai, Kazuhiro
    Asano, Futoshi
    Hasegawa, Yuji
    Tsujino, Hiroshi
    [J]. 2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 664 - 669
  • [10] PERFORMANCE ANALYSIS OF ROOT-MUSIC
    RAO, BD
    HARI, KVS
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12): : 1939 - 1949