Robust talker direction estimation based on weighted CSP analysis and maximum likelihood estimation

被引:18
作者
Denda, Y [1 ]
Nishiura, T [1 ]
Yamashita, Y [1 ]
机构
[1] Ritsumeikan Univ, Grad Sch Sci & Engn, Kusatsu 5258577, Japan
关键词
robust talker direction estimation; CSP analysis; CSP coefficient subtraction; ML estimation; microphone array;
D O I
10.1093/ietisy/e89-d.3.1050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a new talker direction estimation method for front-end processing to capture distant-talking speech by using a microphone array. The proposed method consists of two algorithms: One is a TDOA (Time Delay Of Arrival) estimation algorithm based on a weighted CSP (Cross-power Spectrum Phase) analysis with an average speech spectrum and CSP coefficient subtraction. The other is a talker direction estimation algorithm based on ML (Maximum Likelihood) estimation in a time sequence of the estimated TDOAs. To evaluate the effectiveness of the proposed method, talker direction estimation experiments were carried out in an actual office room. The results confirmed that the talker direction estimation performance of the proposed method is superior to that of the conventional methods in both diffused- and directional-noise environments.
引用
收藏
页码:1050 / 1057
页数:8
相关论文
共 21 条
[1]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[2]  
BRANDSTEIN M, 1995, P ICASSP95, P3019
[3]   HIGH-RESOLUTION FREQUENCY-WAVENUMBER SPECTRUM ANALYSIS [J].
CAPON, J .
PROCEEDINGS OF THE IEEE, 1969, 57 (08) :1408-&
[4]   A SIMPLE AND EFFICIENT ESTIMATOR FOR HYPERBOLIC LOCATION [J].
CHAN, YT ;
HO, KC .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1994, 42 (08) :1905-1915
[5]  
Denda Y, 2003, PROCEEDINGS OF THE 2003 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING, P593
[6]  
DENDA Y, 2003, P EUROSPEECH2003, P2153
[7]   COMPUTER-STEERED MICROPHONE ARRAYS FOR SOUND TRANSDUCTION IN LARGE ROOMS [J].
FLANAGAN, JL ;
JOHNSTON, JD ;
ZAHN, R ;
ELKO, GW .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 78 (05) :1508-1518
[8]   Comparison of energy-based endpoint detectors for speech signal processing [J].
Ganapathiraju, A ;
Webster, L ;
Trimble, J ;
Bush, K ;
Kornman, P .
PROCEEDINGS OF THE IEEE SOUTHEASTCON '96: BRINGING TOGETHER EDUCATION, SCIENCE AND TECHNOLOGY, 1996, :500-503
[9]   AN ALTERNATIVE APPROACH TO LINEARLY CONSTRAINED ADAPTIVE BEAMFORMING [J].
GRIFFITHS, LJ ;
JIM, CW .
IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 1982, 30 (01) :27-34
[10]   ADAPTIVE MICROPHONE-ARRAY SYSTEM FOR NOISE-REDUCTION [J].
KANEDA, Y ;
OHGA, J .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (06) :1391-1400