Joint DOA Estimation in Spherical Harmonics Domain using Low Complexity CNN

被引:0
作者
Dwivedi, Priyadarshini [1 ]
Gohil, Raj Prakash [2 ]
Routray, Gyanajyoti [1 ]
Varanasi, Vishnuvardhan [3 ]
Hegde, Rajesh M. [1 ]
机构
[1] Indian Inst Technol Kanpur, Kanpur, Uttar Pradesh, India
[2] Fraunhofer IDMT, Ilmenau, Germany
[3] Enphase Solar Energy Pvt Ltd, Bengaluru, India
来源
2022 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, SPCOM | 2022年
关键词
OF-ARRIVAL ESTIMATION; LOCALIZATION;
D O I
10.1109/SPCOM55316.2022.9840853
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Direction of arrival (DOA) estimation for multi-channel speech enhancement is a challenging problem. In this context, this paper proposes a new method for joint DOA estimation using a low complexity convolutional neural network (CNN) architecture. The spherical harmonic (SH) coefficients of the received speech signal are obtained from the spherical harmonics decomposition (SHD). The magnitude and phase features are extracted from these SH coefficients and combined as a single feature for training the CNN. A single CNN model is trained using these combined features in contrast to two CNN models used in earlier work. Both azimuth and elevation are then obtained for estimation of DOA from this single CNN. Extensive simulations are also conducted for the performance evaluation of the proposed low complexity CNN model. It is observed that the proposed CNN model provides robust DOA estimates at the various signal to noise ratios (SNR) and reverberation times with reduced computational complexity. Performance evaluated in terms of the gross error (GE) and run-time complexity also provides interesting results motivating the use of the proposed model in practical applications.
引用
收藏
页数:5
相关论文
共 23 条
  • [1] Abhayapala TD, 2002, INT CONF ACOUST SPEE, P1949
  • [2] Adavanne S, 2018, EUR SIGNAL PR CONF, P1462, DOI 10.23919/EUSIPCO.2018.8553182
  • [3] Open-sphere designs for spherical microphone arrays
    Balmages, Ilya
    Rafaely, Boaz
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02): : 727 - 732
  • [4] Hioka Y, 2004, IEICE T FUND ELECTR, VE87A, P559
  • [5] Rigid sphere room impulse response simulation: Algorithm and applications
    Jarrett, D. P.
    Habets, E. A. P.
    Thomas, M. R. P.
    Naylor, P. A.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (03) : 1462 - 1472
  • [6] GENERALIZED CORRELATION METHOD FOR ESTIMATION OF TIME-DELAY
    KNAPP, CH
    CARTER, GC
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (04): : 320 - 327
  • [7] Mabande E, 2011, EUR SIGNAL PR CONF, P146
  • [8] Direction of Arrival Estimation for Reverberant Speech Based on Enhanced Decomposition of the Direct Sound
    Madmoni, Lior
    Rafaely, Boaz
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) : 131 - 142
  • [9] Meyer J, 2002, INT CONF ACOUST SPEE, P1781
  • [10] mhacoustics, The microphone array