Improvement in Outdoor Sound Source Detection Using a Quadrotor-Embedded Microphone Array

被引:0
作者
Ohata, Takuma [1 ]
Nakamura, Keisuke
Mizumoto, Takeshi
Taiki, Tezuka [1 ]
Nakadai, Kazuhiro [1 ]
机构
[1] Tokyo Inst Technol, Grad Sch Informat Sci & Engn, Meguro Ku, Tokyo 1528552, Japan
来源
2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014) | 2014年
关键词
robot audition; speech detection; sound source localization; sound source separation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses sound source detection in an outdoor environment using a quadrotor with a microphone array. Since the previously reported method has a high computational cost, we proposed a sound source detection algorithm called MUltiple SIgnal Classification based on incremental Generalized Singular Value Decomposition (iGSVD-MUSIC), which detects sound source location and temporal activity with low computational cost. In addition, to relax an over-esitimation problem of noise correlation matrix which is used in iGSVD-MUSIC, we proposed Correlation Matrix Scaling (CMS), which realizes soft whitening of noise. The protptype system based on the proposed methods were evaluated with two types of microphone arrays in an outdoor environment. Experimental results showed that the combination of iGSVD-MUSIC and CMS improves sound source detection performance drastically and achieves real-time processing.
引用
收藏
页码:1902 / 1907
页数:6
相关论文
共 9 条
  • [1] Bando Y, 2013, IEEE INT C INT ROBOT, P3446, DOI 10.1109/IROS.2013.6696847
  • [2] Furukawa K, 2013, IEEE INT C INT ROBOT, P3943, DOI 10.1109/IROS.2013.6696920
  • [3] Kaushik Balakrishnan, 2005, 11 AIAA CEAS AER C M, V2997
  • [4] Design and Implementation of Robot Audition System 'HARK' - Open Source Software for Listening to Three Simultaneous Speakers
    Nakadai, Kazuhiro
    Takahashi, Toru
    Okuno, Hiroshi G.
    Nakajima, Hirofumi
    Hasegawa, Yuji
    Tsujino, Hiroshi
    [J]. ADVANCED ROBOTICS, 2010, 24 (5-6) : 739 - 761
  • [5] Nakamura K, 2012, IEEE INT C INT ROBOT, P694, DOI 10.1109/IROS.2012.6385494
  • [6] Intelligent Sound Source Localization for Dynamic Environments
    Nakamura, Keisuke
    Nakadai, Kazuhiro
    Asano, Futoshi
    Hasegawa, Yuji
    Tsujino, Hiroshi
    [J]. 2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 664 - 669
  • [7] Okutani K., 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2012), P3288, DOI 10.1109/IROS.2012.6385994
  • [8] Sasaki Y, 2013, IEEE INT C INT ROBOT, P3930, DOI 10.1109/IROS.2013.6696918
  • [9] MULTIPLE EMITTER LOCATION AND SIGNAL PARAMETER-ESTIMATION
    SCHMIDT, RO
    [J]. IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 1986, 34 (03) : 276 - 280