Sound source tracking with directivity pattern estimation using a 64ch microphone array

被引:4
作者
Nakadai, K [1 ]
Nakajima, H [1 ]
Yamada, K [1 ]
Hasegawa, Y [1 ]
Nakamura, T [1 ]
Tsujino, H [1 ]
机构
[1] HONDA Res Inst Japan Co Ltd, Wako, Saitama 3510114, Japan
来源
2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, Vols 1-4 | 2005年
关键词
microphone array; weighted delay-and-sum beamforming; directivity pattern estimation; sound source localization; and sound source tracking;
D O I
10.1109/IROS.2005.1544981
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In human-robot communication, a robot should distinguish between voices uttered by a human and those played by a loudspeaker such as on a TV or a radio. This paper addresses detection of actual human voices by using a microphone array as an extension of auditory function of the robot to support environmental understanding by the robot. We introduce a 64ch microphone array system in a room and propose a new method based on weighted delay-and-sum beamforming to estimate a directivity pattern of a sound source. The microphone array system localizes a sound source and estimates its directivity pattern. The directivity pattern estimation has two advantages as follows: One is that the system can detect whether the sound source is an actual human voice or not by comparing the estimated directivity pattern with prerecorded directivity patterns. The other is that the heading of the sound source is estimated by detecting the angle with the highest power in the directivity pattern. As a result, we proved the effectiveness of our microphone array through sound source tracking with orientation and detection of actual human voices based on directivity pattern estimation.
引用
收藏
页码:196 / 202
页数:7
相关论文
共 21 条
[1]  
Aarabi P., 2001, Information Fusion, V2, P209, DOI 10.1016/S1566-2535(01)00035-5
[2]  
BISWAS R, IROS 2004, P1544
[3]   Exploration of pressure field around the human head during speech [J].
Dunn, HK ;
Farnsworth, DW .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1939, 10 (03) :184-199
[4]  
FLANAGAN JL, 1991, ACUSTICA, V73, P58
[5]   AN ALTERNATIVE APPROACH TO LINEARLY CONSTRAINED ADAPTIVE BEAMFORMING [J].
GRIFFITHS, LJ ;
JIM, CW .
IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 1982, 30 (01) :27-34
[6]  
Hall ET, 1966, HIDDEN DIMENSION
[7]  
HARA I, 2004, IROS 2004, P240
[8]  
Hershey J, 2000, ADV NEUR IN, V12, P813
[9]   BLIND SEPARATION OF SOURCES .1. AN ADAPTIVE ALGORITHM BASED ON NEUROMIMETIC ARCHITECTURE [J].
JUTTEN, C ;
HERAULT, J .
SIGNAL PROCESSING, 1991, 24 (01) :1-10
[10]   ADAPTIVE MICROPHONE-ARRAY SYSTEM FOR NOISE-REDUCTION [J].
KANEDA, Y ;
OHGA, J .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (06) :1391-1400