Joint Identification and Localization of a Speaker in Adverse Conditions Using a Microphone Array

被引:0
作者
Salvati, Daniele [1 ]
Drioli, Carlo [1 ]
Foresti, Gian Luca [1 ]
机构
[1] Univ Udine, Dept Math Comp Sci & Phys, Udine, Italy
来源
2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2018年
关键词
Acoustic source localization; speaker identification; beamforming; diagonal unloading; microphone array; SOUND SOURCE; MVDR BEAMFORMER; PERFORMANCE; MWF;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We discuss a joint identification and localization microphone array system based on diagonal unloading (DU) beamforming, which has been recently introduced for acoustic source localization. First, we propose a DU beamformer version for the signal enhancement problem. Then, we propose a enhanced DU steered response power (SRP), in which the first estimate of the source position is further refined with the information gathered from the speaker recognition module. The enhanced SRP-DU is obtained by weighting the frequency components with respect to the spectral characteristics of the speaker. The approach does not add significant computational load to the array processing. Experiments conducted in noisy and reverberant conditions show that the use of the DU beamformer provides better speaker recognition performance if compared to the conventional one since it reduces deleterious effects due to the spatially white noise and point-source interferences. Simulations also show that the speaker identification can improve the localization accuracy, and it is thus interesting for applications and systems which rely on integrated localization and speaker identification.
引用
收藏
页码:21 / 25
页数:5
相关论文
共 24 条
[1]   Kinect microphone array-based speech and speaker recognition for the exhibition control of humanoid robots [J].
Ding, Ing-Jr ;
Shi, Jia-Yi .
COMPUTERS & ELECTRICAL ENGINEERING, 2017, 62 :719-729
[2]   Reduced-Bandwidth and Distributed MWF-Based Noise Reduction Algorithms for Binaural Hearing Aids [J].
Doclo, Simon ;
Moonen, Marc ;
Van den Bogaert, Tim ;
Wouters, Jan .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (01) :38-51
[3]  
Kabal P., 2002, Tech. Rep.
[4]   Near-Field Acoustic Source Localization and Beamforming in Spherical Harmonics Domain [J].
Kumar, Lalan ;
Hegde, Rajesh M. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2016, 64 (13) :3351-3361
[5]   Microphone Array Processing for Distant Speech Recognition [J].
Kumatani, Kenichi ;
McDonough, John ;
Raj, Bhiksha .
IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) :127-140
[6]   Prediction of energy decay in room impulse responses simulated with an image-source model [J].
Lehmann, Eric A. ;
Johansson, Anders M. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (01) :269-277
[7]   Microphone Arrays and Speaker Identification [J].
Lin, Qiguang ;
Jan, Ea-Ee ;
Flanagan, James .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :622-629
[8]   Performance of the SDW-MWF With Randomly Located Microphones in a Reverberant Enclosure [J].
Markovich-Golan, Shmulik ;
Gannot, Sharon ;
Cohen, Israel .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07) :1513-1523
[9]   A Binaural Scene Analyzer for Joint Localization and Recognition of Speakers in the Presence of Interfering Noise Sources and Reverberation [J].
May, Tobias ;
van de Par, Steven ;
Kohlrausch, Armin .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (07) :2016-2030
[10]   Multi-channel sub-band speech recognition [J].
McCowan I.A. ;
Sridharan S. .
EURASIP Journal on Advances in Signal Processing, 2001 (1) :45-52