Acoustic Source Localization and Tracking of a Time-Varying Number of Speakers

被引:51
作者
Fallon, Maurice F. [1 ]
Godsill, Simon J. [2 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[2] Univ Cambridge, Dept Engn, Signal Proc & Commun Lab, Cambridge CB2 1PZ, England
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2012年 / 20卷 / 04期
关键词
Acoustic source location; multi-target tracking; particle filtering; sequential estimation; tracking filters;
D O I
10.1109/TASL.2011.2178402
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Particle filter-based acoustic source tracking algorithms track (online and in real-time) the position of a sound source-a person speaking in a room-based on the current data from a distributed microphone array as well as the previously recorded data. This paper develops a multi-target tracking (MTT) methodology to allow for an unknown and time-varying number of speakers in a fully probabilistic manner and in doing so does not resort to independent modules for new target proposal or target number estimation as in previous works. The approach uses the concept of an existence grid to propose possible regions of activity before tracking is carried out with a variable dimension particle filter-which also explicitly supports the concept of a null particle, containing no target states, when no speakers are active. Examples demonstrate typical tracking performance in a number of different scenarios with simultaneously active speech sources.
引用
收藏
页码:1409 / 1415
页数:7
相关论文
共 14 条
  • [1] [Anonymous], INT CONF ACOUST SPEE
  • [2] Fallon M., 2007, P WORKSHOP APPL SIGN, V4, P77
  • [3] Acoustic Source Localization and Tracking Using Track Before Detect
    Fallon, Maurice F.
    Godsill, Simon
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1228 - 1242
  • [4] NOVEL-APPROACH TO NONLINEAR NON-GAUSSIAN BAYESIAN STATE ESTIMATION
    GORDON, NJ
    SALMOND, DJ
    SMITH, AFM
    [J]. IEE PROCEEDINGS-F RADAR AND SIGNAL PROCESSING, 1993, 140 (02) : 107 - 113
  • [5] Particle filter design using importance sampling for acoustic source localisation and tracking in reverberant environments
    Lehmann, Eric A.
    Williamson, Robert C.
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
  • [6] Tracking an unknown time-varying number of speakers using TDOA measurements: A random finite set approach
    Ma, Wing-Kin
    Vo, Ba-Ngu
    Singh, Sumeetpal S.
    Baddeley, Adrian
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (09) : 3291 - 3304
  • [7] A Bayesian approach to multiple target detection and tracking
    Morelande, Mark R.
    Kreucher, Christopher M.
    Kastella, Keith
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (05) : 1589 - 1604
  • [8] Multitarget initiation, tracking and termination using Bayesian Monte Carlo methods
    Ng, William
    Li, Jack
    Godsill, Simon
    Pang, Sze Kim
    [J]. COMPUTER JOURNAL, 2007, 50 (06) : 674 - 693
  • [9] A TRACK BEFORE DETECT APPROACH FOR SEQUENTIAL BAYESIAN TRACKING OF MULTIPLE SPEECH SOURCES
    Pertila, Pasi
    Hamalainen, Matti S.
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4974 - 4977
  • [10] Salmond DJ, 2001, P AMER CONTR CONF, P3755, DOI 10.1109/ACC.2001.946220