Three ring microphone array for 3D sound localization and separation for mobile robot audition

被引:18
作者
Tamai, Y [1 ]
Sasaki, Y [1 ]
Kagami, S [1 ]
Mizoguchi, H [1 ]
机构
[1] Tokyo Univ Sci, Dept Mech Engn, Noda, Chiba, Japan
来源
2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4 | 2005年
关键词
sound localization; sound separation; Delay and Sum Beamforming; Frequency Band Selection;
D O I
10.1109/IROS.2005.1545095
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a three ring microphone array estimates the horizontal/vertical direction and distance of sound sources and separates multiple sound sources for mobile robot audition. Arrangement of microphones is simulated and an optimized pattern which has three rings is implemented with 32 microphones. Sound localization and separation are achieved by Delay and Sum Beam Forming(DSBF) and Frequency Band Selection(FBS). From on-line experiments results of sound horizontal and verticle localization, we confirmed that one or two sounds sources could be localized with an error of about 5 degrees and 200 to 300mm in the case of the distance of about 1m. The off-line experiments of sound separation were evaluated by power spectrums in each frequency of separated sounds, and we confirmed that an appropriate frequency band could be selected by DSBF and FBS. The system can separate 3 different pressure speech sources without drowning out.
引用
收藏
页码:903 / 908
页数:6
相关论文
共 4 条
[1]  
Sawada T., 2003, P 18 M SPEC INT GROU, P27
[2]  
STURIM DE, 1997, P 1997 INT C AC SPEE
[3]  
TAMAI Y, 2004, P 3 IEEE INT C SENS
[4]   Improvement of robot audition by interfacing sound source separation and automatic speech recognition with missing feature theory [J].
Yamamoto, S ;
Nakadai, K ;
Tsujino, H ;
Yokoyama, T ;
Okuno, HG .
2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, :1517-1523