A LEARNING-BASED APPROACH TO DIRECTION OF ARRIVAL ESTIMATION IN NOISY AND REVERBERANT ENVIRONMENTS

被引:0
作者
Xiao, Xiong [1 ]
Zhao, Shengkui [2 ]
Zhong, Xionghu [3 ]
Jones, Douglas L. [2 ]
Chng, Eng Siong [3 ]
Li, Haizhou [3 ,4 ]
机构
[1] Nanyang Technol Univ, Temasek Lab, Singapore, Singapore
[2] Adv Digital Sci Ctr, Singapore, Singapore
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
[4] Inst Infocomm Res, Dept Human Language Technol, Singapore, Singapore
来源
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年
关键词
microphone arrays; direction of arrival; least squares; machine learning; neural networks; HISTOGRAM EQUALIZATION; LOCALIZATION; ADAPTATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a learning-based approach to the task of direction of arrival estimation (DOA) from microphone array input. Traditional signal processing methods such as the classic least square (LS) method rely on strong assumptions on signal models and accurate estimations of time delay of arrival (TDOA). They only work well in relatively clean conditions, but suffer from noise and reverberation distortions. In this paper, we propose a learning-based approach that can learn from a large amount of simulated noisy and reverberant microphone array inputs for robust DOA estimation. Specifically, we extract features from the generalised cross correlation (GCC) vectors and use a multilayer perceptron neural network to learn the nonlinear mapping from such features to the DOA. One advantage of the learning based method is that as more and more training data becomes available, the DOA estimation will become more and more accurate. Experimental results on simulated data show that the proposed learning based method produces much better results than the state-of-the-art LS method. The testing results on real data recorded in meeting rooms show improved root-mean-square error (RMSE) compared to the LS method.
引用
收藏
页码:2814 / 2818
页数:5
相关论文
共 50 条
  • [31] High-Precision Direction of Arrival Estimation Based on LightGBM
    Wang, Fuwei
    Zhang, Xiaoyu
    Liu, Lu
    Chen, Chen
    He, Xingrui
    Zhou, Yan
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (09) : 5834 - 5849
  • [32] Acoustic Direction of Arrival Estimation Based on Spatial Circular Prediction
    He Hongsen
    Lu Jing
    Gao Yang
    2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 3, PROCEEDINGS, 2009, : 177 - +
  • [33] PROBABILISTIC SPATIAL DICTIONARY BASED ONLINE ADAPTIVE BEAMFORMING FOR MEETING RECOGNITION IN NOISY AND REVERBERANT ENVIRONMENTS
    To, Nobutaka
    Araki, Shoko
    Deleroix, Mare
    Nakatani, Tomohiro
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 681 - 685
  • [34] Direction of Arrival Estimation Based on Generalized Reference Curve Model
    Cui, Lizhi
    Bu, Xuhui
    Yang, Junqi
    Yang, Yi
    He, Weina
    PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 650 - 653
  • [35] Speaker Direction-of-Arrival Estimation Based on Orthogonal Dipoles
    Guo, Feng
    Cao, Yuhang
    Huang, Zhaoqiong
    You, Xing
    Guan, Haixing
    Liang, Jiaen
    Li, Baoqing
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (05) : 2320 - 2334
  • [36] Time-Frequency Bins Selection for Direction of Arrival Estimation Based on Speech Presence Probability Learning
    Zhang, Qinzheng
    Wang, Haiyan
    Jensen, Jesper Rindom
    Tao, Shuai
    Christensen, Mads Graesboll
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (05) : 2961 - 2981
  • [37] A Review on Machine Learning-based Malware Detection Techniques for Internet of Things (IoT) Environments
    Sasikala, S.
    Janakiraman, Sengathir
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 132 (03) : 1961 - 1974
  • [38] A machine learning approach for detecting ultrasonic echoes in noisy environments
    Mohamed, Mohamed-Elamir
    Gotzig, Heinrich
    Zoellner, Raoul
    Maeder, Patrick
    2019 IEEE 89TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2019-SPRING), 2019,
  • [39] Machine Learning-Based Radio Coverage Prediction in Urban Environments
    Mohammadjafari, Sanaz
    Roginsky, Sophie
    Kavurmacioglu, Emir
    Cevik, Mucahit
    Ethier, Jonathan
    Bener, Ayse Basar
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2020, 17 (04): : 2117 - 2130
  • [40] Joint for time of arrival and direction of arrival estimation algorithm based on the subspace of extended hadamard product
    Ba Bin
    Liu Guo-Chun
    Li Tao
    Lin Yu-Cheng
    Wang Yu
    ACTA PHYSICA SINICA, 2015, 64 (07)