Time-Frequency Mask-Aware Bidirectional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation

被引:11
作者
Chen, Jie [1 ]
Liu, Chang [1 ]
Xie, Jiawu [1 ]
An, Jie [1 ]
Huang, Nan [1 ]
机构
[1] Univ Elect Sci & Technol China, Natl Key Lab Sci & Technol Commun, Chengdu 610000, Peoples R China
关键词
blind source separation; binary mask; deep learning; underwater acoustic signal; BLIND SEPARATION; NOISE; MIXTURES; CNN;
D O I
10.3390/s22155598
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Underwater acoustic signal separation is a key technique for underwater communications. The existing methods are mostly model-based, and cannot accurately characterize the practical underwater acoustic communication environment. They are only suitable for binary signal separation and cannot handle multivariate signal separation. However, recurrent neural networks (RNNs) show a powerful ability to extract the features of temporal sequences. Inspired by this, in this paper, we present a data-driven approach for underwater acoustic signal separation using deep learning technology. We use a bidirectional long short-term memory (Bi-LSTM) approach to explore the features of a time-frequency (T-F) mask, and propose a T-F-mask-aware Bi-LSTM for signal separation. Taking advantage of the sparseness of the T-F image, the designed Bi-LSTM network is able to extract the discriminative features for separation, which further improves the separation performance. In particular, this method breaks through the limitations of the existing methods and not only achieves good results in multivariate separation but also effectively separates signals when they are mixed with 40 dB Gaussian noise signals. The experimental results show that this method can achieve a 97% guarantee ratio (PSR), and the average similarity coefficient of the multivariate signal separation is stable above 0.8 under high noise conditions. It should be noted that our model can only handle known signals such as test signals for calibration.
引用
收藏
页数:21
相关论文
共 49 条
[1]  
Alqahtani A, 2018, IEEE IMAGE PROC, P4058, DOI 10.1109/ICIP.2018.8451506
[2]  
Araki S, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS, P881
[3]  
Araki S, 2005, INT CONF ACOUST SPEE, P81
[4]   Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors [J].
Araki, Shoko ;
Sawada, Hiroshi ;
Mukai, Ryo ;
Makino, Shoji .
SIGNAL PROCESSING, 2007, 87 (08) :1833-1847
[5]   Remotely Powered Underwater Acoustic Sensor Networks [J].
Bereketli, Alper ;
Bilgen, Semih .
IEEE SENSORS JOURNAL, 2012, 12 (12) :3467-3472
[6]   Blind signal separation: Statistical principles [J].
Cardoso, JF .
PROCEEDINGS OF THE IEEE, 1998, 86 (10) :2009-2025
[7]  
Cardoso JF, 1996, ISCAS 96: 1996 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - CIRCUITS AND SYSTEMS CONNECTING THE WORLD, VOL 2, P93, DOI 10.1109/ISCAS.1996.540360
[8]   Underwater Target Recognition Based on Multi-Decision LOFAR Spectrum Enhancement: A Deep-Learning Approach [J].
Chen, Jie ;
Han, Bing ;
Ma, Xufeng ;
Zhang, Jian .
FUTURE INTERNET, 2021, 13 (10)
[9]  
Comon P, 2010, HANDBOOK OF BLIND SOURCE SEPARATION: INDEPENDENT COMPONENT ANALYSIS AND APPLICATIONS, P1
[10]   INDEPENDENT COMPONENT ANALYSIS, A NEW CONCEPT [J].
COMON, P .
SIGNAL PROCESSING, 1994, 36 (03) :287-314