Reliable underwater multi-target direction of arrival estimation with optimal transport using deep models

被引:0
作者
Yang, Zehui [1 ,2 ]
Nie, Weihang [1 ,2 ]
Ye, Lingxuan [1 ,2 ]
Cheng, Gaofeng [1 ]
Yan, Yonghong [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Speech & Intelligent Informat Proc Lab, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
NEURAL-NETWORK; SENSOR ARRAY; LOCALIZATION; MULTIPLE;
D O I
10.1121/10.0030398
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multi-target direction of arrival (DoA) estimation is an important and challenging task for sonar signal processing. In this study, we propose a method called learning direction of arrival with optimal transport (LOT) to accurately estimate the DoAs of multiple sources with a single deep model. We model the DoA estimation problem as a multi-label classification task and introduce an optimal transport (OT) loss based on the OT theory to capture the intrinsic continuity within the angular categories. We design a cost matrix for the OT loss in LOT approach to characterize the order and periodicity of the angular grid. The LOT approach encourages reliable predictions closer to the ground truth and suppresses spurious targets. We also propose a lightweight channel mask data augmentation module for deep models that use items related to the covariance matrix as input. The proposed methods can be seamlessly integrated with different model architectures and we indicate the portability with experiments on several typical network backbones. Experiments across various scenarios using different measurements show the effectiveness and robustness of our approaches. Results on SwellEx-96 experimental data demonstrate the practicality in real applications.
引用
收藏
页码:2119 / 2131
页数:13
相关论文
共 47 条
  • [1] Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks
    Adavanne, Sharath
    Politis, Archontis
    Nikunen, Joonas
    Virtanen, Tuomas
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) : 34 - 48
  • [2] Ali M., 2023, P ICASSP 2023 RHOD I, P1
  • [3] Detecting and visualizing cell phenotype differences from microscopy images using transportbased morphometry
    Basu, Saurav
    Kolouri, Soheil
    Rohde, Gustavo K.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (09) : 3448 - 3453
  • [4] Source localization with broad-band matched-field processing in shallow water
    Booth, NO
    Baxley, PA
    Rice, JA
    Schey, PW
    Hodgkiss, WS
    DSpain, GL
    Murray, JJ
    [J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 1996, 21 (04) : 402 - 412
  • [5] HIGH-RESOLUTION FREQUENCY-WAVENUMBER SPECTRUM ANALYSIS
    CAPON, J
    [J]. PROCEEDINGS OF THE IEEE, 1969, 57 (08) : 1408 - &
  • [6] Multi-Speaker DOA Estimation Using Deep Convolutional Networks Trained With Noise Signals
    Chakrabarty, Soumitro
    Habets, Emanuel A. P.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) : 8 - 21
  • [7] Sensitivity to Basis Mismatch in Compressed Sensing
    Chi, Yuejie
    Scharf, Louis L.
    Pezeshki, Ali
    Calderbank, A. Robert
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (05) : 2182 - 2195
  • [8] Robust DOA Estimation Method for MIMO Radar via Deep Neural Networks
    Cong, Jingyu
    Wang, Xianpeng
    Huang, Mengxing
    Wan, Liangtian
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (06) : 7498 - 7507
  • [9] Cuturi M., 2013, ADV NEURAL INFORM PR, V2, P4
  • [10] Mirages in shallow water matched field processing
    D'Spain, GL
    Murray, JJ
    Hodgkiss, WS
    Booth, NO
    Schey, PW
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (06) : 3245 - 3265