Performance of speaker localization using microphone array

被引:4
|
作者
Visalakshi, R. [1 ]
Dhanalakshmi, P. [1 ]
Palanivel, S. [1 ]
机构
[1] Annamalai Univ, Dept CSE, Chidambaram, India
关键词
Speaker localization; Microphone array; Time difference of arrival; Gauss-Newton nonlinear least square method; Genetic algorithm and group search optimization algorithm;
D O I
10.1007/s10772-016-9341-9
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speaker localization is a technique to locate and track an active speaker from multiple acoustic sources using microphone array. Microphone array is used to improve the speech quality of recorded speech signal in meeting room and other places. In this work, the time delay estimation between source and each microphone is calculated using a localization method called time differences of arrival (TDOA). TDOA localization consists of two steps namely (a) a time delay estimator and (b) a localization estimator. For time delay estimation, the generalized cross-correlation using phase transform, the generalized cross correlation using maximum likelihood, linear prediction (LP) residual and the Hilbert envelope of the LP residual are chosen for estimating the location of a person. A new speaker localization algorithm known as group search optimization (GSO) algorithm is proposed. The performance of this algorithm is analyzed and compared with Gauss-Newton nonlinear least square method and genetic algorithm. Experimental results show that the proposed GSO method outperforms the other methods in terms of mean square error, root mean square error, mean absolute error, mean absolute percentage error, euclidean distance and mean absolute relative error.
引用
收藏
页码:467 / 483
页数:17
相关论文
共 50 条
  • [1] Speaker localization using microphone array in a reverberant room
    Zou, QY
    Rahardja, S
    Cai, ZB
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 354 - 357
  • [2] Speech recognition in cars by speaker localization using microphone array
    Kondo, Keisuke
    Nagai, Takayuki
    Kaneko, Masahide
    Kurematsu, Akira
    Systems and Computers in Japan, 2003, 34 (08) : 1 - 12
  • [3] Joint Identification and Localization of a Speaker in Adverse Conditions Using a Microphone Array
    Salvati, Daniele
    Drioli, Carlo
    Foresti, Gian Luca
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 21 - 25
  • [4] Robust speech recognition with speaker localization by a microphone array
    Yamada, T
    Nakamura, S
    Shikano, K
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1317 - 1320
  • [5] Microphone Array for Speaker Localization and Identification in Shared Autonomous Vehicles
    Marques, Ivo
    Sousa, Joao
    Sa, Bruno
    Costa, Diogo
    Sousa, Pedro
    Pereira, Samuel
    Santos, Afonso
    Lima, Carlos
    Hammerschmidt, Niklas
    Pinto, Sandro
    Gomes, Tiago
    ELECTRONICS, 2022, 11 (05)
  • [6] Visually Supervised Speaker Detection and Localization via Microphone Array
    Berghi, Davide
    Hilton, Adrian
    Jackson, Philip J. B.
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [7] AUDIO INPUTS FOR ACTIVE SPEAKER DETECTION AND LOCALIZATION VIA MICROPHONE ARRAY
    Berghi, Davide
    Jackson, Philip J. B.
    2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,
  • [8] Speaker tracking and identifying based on indoor localization system and microphone array
    Chen, Xiaojie
    Shi, Yuanchun
    Jiang, Wenfeng
    21ST INTERNATIONAL CONFERENCE ON ADVANCED NETWORKING AND APPLICATIONS WORKSHOPS/SYMPOSIA, VOL 2, PROCEEDINGS, 2007, : 347 - +
  • [9] SELF-CALIBRATION OF FLEXIBLE MICROPHONE ARRAY FOR SPEAKER LOCALIZATION IN MEETING CONVERSATIONS USING EMITTERS
    Nakamura, Keisuke
    Gomez, Randy
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 311 - 315
  • [10] A framework for speaker tracking using microphone array and camera
    Chen, JF
    Jiang, LJ
    Ser, W
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1384 - 1387