Nonlinear Least Squares Methods for Joint DOA and Pitch Estimation

被引:52
作者
Jensen, Jesper Rindom [1 ]
Christensen, Mads Graesboll [1 ]
Jensen, Soren Holdt [2 ]
机构
[1] Aalborg Univ, Audio Anal Lab, Dept Architecture Design & Media Technol, DK-9200 Aalborg, Denmark
[2] Aalborg Univ, Dept Elect Syst, DK-9220 Aalborg, Denmark
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 05期
关键词
Cramer-Rao lower bound; direction-of-arrival estimation; fundamental frequency estimation; joint estimation; nonlinear least squares; FREQUENCY ESTIMATION; TIME-DELAY; FUNDAMENTAL-FREQUENCY; ROBUST; LOCALIZATION; ARRIVAL; ANGLES;
D O I
10.1109/TASL.2013.2239290
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we consider the problem of joint direction-of-arrival (DOA) and fundamental frequency estimation. Joint estimation enables robust estimation of these parameters in multi-source scenarios where separate estimators may fail. First, we derive the exact and asymptotic Cramer-Rao bounds for the joint estimation problem. Then, we propose a nonlinear least squares (NLS) and an approximate NLS (aNLS) estimator for joint DOA and fundamental frequency estimation. The proposed estimators are maximum likelihood estimators when: 1) the noise is white Gaussian, 2) the environment is anechoic, and 3) the source of interest is in the far-field. Otherwise, the methods still approximately yield maximum likelihood estimates. Simulations on synthetic data show that the proposed methods have similar or better performance than state-of-the-art methods for DOA and fundamental frequency estimation. Moreover, simulations on real-life data indicate that the NLS and aNLS methods are applicable even when reverberation is present and the noise is not white Gaussian.
引用
收藏
页码:923 / 933
页数:11
相关论文
共 48 条
[1]  
[Anonymous], P INTERSPEECH ANTW B
[2]  
[Anonymous], IEEE 29 AS C SIGN SY
[3]  
Armani L, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P113
[4]  
Brandstein MS, 1997, INT CONF ACOUST SPEE, P375, DOI 10.1109/ICASSP.1997.599651
[5]   HIGH-RESOLUTION FREQUENCY-WAVENUMBER SPECTRUM ANALYSIS [J].
CAPON, J .
PROCEEDINGS OF THE IEEE, 1969, 57 (08) :1408-&
[6]  
CAPON J, 1983, NONLINEAR METHODS SP
[7]   COHERENCE AND TIME-DELAY ESTIMATION [J].
CARTER, GC .
PROCEEDINGS OF THE IEEE, 1987, 75 (02) :236-255
[8]  
Christel M.G., 2009, Multimedia Content Analysis, Signals and Communication Technology, P1, DOI DOI 10.2200/S00178ED1V01Y200903SAP005
[9]   Robust subspace-based fundamental frequency estimation [J].
Christensen, Mads G. ;
Vera-Candeas, Pedro ;
Somasundaram, Samuel D. ;
Jakobsson, Andreas .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :101-+
[10]   Multi-pitch estimation [J].
Christensen, Mads Graesboll ;
Stoica, Petre ;
Jakobsson, Andreas ;
Jensen, Soren Holdt .
SIGNAL PROCESSING, 2008, 88 (04) :972-983