Design of a Real-Time Automatic Source Monitoring Framework Based on Sound Source Localization

被引:0
作者
Dey, Spandan [1 ]
Boppu, Srinivas [1 ]
Manikandan, M. Sabarimalai [1 ]
机构
[1] Indian Inst Technol, Real Time Embedded Signal Proc Lab, Sch Elect Sci, Jatani 752050, Odisha, India
来源
2019 SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS (ICDIPC 2019) | 2019年
关键词
Sound source localization; GCC-PHAT; Beam-forming; Real-time; Keyword detection;
D O I
10.1109/icdipc.2019.8723684
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Sound Source Localization is the technique to exactly locate the position of a sound emitting source with the help of the audio data only. Its applications are audio surveillance, security monitoring where visual data is not available, robotics such as robots acting as a waiter, etc. It is also an initial stage of signal processing for the fields of speech enhancement, speech separation and Automatic Speech Recognition (ASR). Localization is usually carried out using array of microphones. The scenario of locating a sound source can be different such as single source localization, multiple source localization, moving source localization, source localization in noisy environment, etc. Extensive research progress is there in this field due to its multidimensional applications. This paper attempts to address some of the existing challenges in this field. Work in this paper deals with designing of a real-time, automatic target monitoring framework based on sound source localization. A comparative study of our results with other standard localization algorithm is also presented. The studies presented in this paper leads to the fact that the designed sound source localization framework is indeed low cost, small, real-time and able to locate multiple sources satisfactorily even in office environment where level of noise and reverberation is higher than standard anechoic rooms.
引用
收藏
页码:35 / 40
页数:6
相关论文
共 14 条
[1]   Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization [J].
Davila-Chacon, Jorge ;
Liu, Jindong ;
Wermter, Stefan .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (01) :138-150
[2]   Distributed Expectation-Maximization Algorithm for Speaker Localization in Reverberant Environments [J].
Dorfan, Yuval ;
Plinge, Axel ;
Hazan, Gershon ;
Gannot, Sharon .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (03) :682-695
[3]   Direction of Arrival With One Microphone, a Few LEGOs, and Non-Negative Matrix Factorization [J].
El Badawy, Dalia ;
Dokmanic, Ivan .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) :2436-2446
[4]   Acoustic Source Localization and Tracking of a Time-Varying Number of Speakers [J].
Fallon, Maurice F. ;
Godsill, Simon J. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04) :1409-1415
[5]  
Grondin F, 2015, IEEE INT C INT ROBOT, P6149, DOI 10.1109/IROS.2015.7354253
[6]   Semi-Supervised Sound Source Localization Based on Manifold Regularization [J].
Laufer-Goldshtein, Bracha ;
Talmon, Ronen ;
Gannot, Sharon .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (08) :1393-1407
[7]   Robust Binaural Localization of a Target Sound Source by Combining Spectral Source Models and Deep Neural Networks [J].
Ma, Ning ;
Gonzalez, Jose A. ;
Brown, Guy J. .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (11) :2122-2131
[8]   Localization of sound sources in robotics: A review [J].
Rascon, Caleb ;
Meza, Ivan .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2017, 96 :184-210
[9]   Indoor Sound Source Localization With Probabilistic Neural Network [J].
Sun, Yingxiang ;
Chen, Jiajia ;
Yuen, Chau ;
Rahardja, Susanto .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2018, 65 (08) :6403-6413
[10]  
Swartling Mikael, 2012, THESIS