Underwater Target Detection Utilizing Polarization Image Fusion Algorithm Based on Unsupervised Learning and Attention Mechanism

被引:8
作者
Cheng, Haoyuan [1 ]
Zhang, Deqing [1 ]
Zhu, Jinchi [1 ]
Yu, Hao [2 ]
Chu, Jinkui [2 ]
机构
[1] Ocean Univ China, Coll Engn, Qingdao 266100, Peoples R China
[2] Dalian Univ Technol, Key Lab Micro Nano Technol & Syst Liaoning Prov, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
underwater target detection; image fusion; unsupervised learning; attention mechanism; polarization; NETWORK;
D O I
10.3390/s23125594
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Since light propagation in water bodies is subject to absorption and scattering effects, underwater images using only conventional intensity cameras will suffer from low brightness, blurred images, and loss of details. In this paper, a deep fusion network is applied to underwater polarization images; that is, the underwater polarization images are fused with intensity images using the deep learning method. To construct a training dataset, we establish an experimental setup to obtain underwater polarization images and perform appropriate transformations to expand the dataset. Next, an end-to-end learning framework based on unsupervised learning and guided by an attention mechanism is constructed for fusing polarization and light intensity images. The loss function and weight parameters are elaborated. The produced dataset is used to train the network under different loss weight parameters, and the fused images are evaluated based on different image evaluation metrics. The results show that the fused underwater images are more detailed. Compared with light intensity images, the information entropy and standard deviation of the proposed method increase by 24.48% and 139%. The image processing results are better than other fusion-based methods. In addition, the improved U-net network structure is used to extract features for image segmentation. The results show that the target segmentation based on the proposed method is feasible under turbid water. The proposed method does not require manual adjustment of weight parameters, has faster operation speed, and has strong robustness and self-adaptability, which is important for research in vision fields, such as ocean detection and underwater target recognition.
引用
收藏
页数:12
相关论文
共 24 条
[1]  
Candes E.J., 1998, Ridgelets: theory and applications
[2]   Real-Time Position and Attitude Estimation for Homing and Docking of an Autonomous Underwater Vehicle Based on Bionic Polarized Optical Guidance [J].
Cheng, Haoyuan ;
Chu, Jinkui ;
Zhang, Ran ;
Gui, Xinyuan ;
Tian, Lianbiao .
JOURNAL OF OCEAN UNIVERSITY OF CHINA, 2020, 19 (05) :1042-1050
[3]   Underwater polarization patterns considering single Rayleigh scattering of water molecules [J].
Cheng, Haoyuan ;
Chu, Jinkui ;
Zhang, Ran ;
Tian, Lianbiao ;
Gui, Xinyuan .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2020, 41 (13) :4947-4962
[4]   A novel perceptual two layer image fusion using deep learning for imbalanced COVID-19 dataset [J].
Elzeki, Omar M. ;
Abd Elfattah, Mohamed ;
Salem, Hanaa ;
Hassanien, Aboul Ella ;
Shams, Mahmoud .
PEERJ COMPUTER SCIENCE, 2021,
[5]   Underwater image recovery considering polarization effects of objects [J].
Huang, Bingjing ;
Liu, Tiegen ;
Hu, Haofeng ;
Han, Jiahui ;
Yu, Mingxuan .
OPTICS EXPRESS, 2016, 24 (09) :9826-9838
[6]   COMPUTER MODELING AND THE DESIGN OF OPTIMAL UNDERWATER IMAGING-SYSTEMS [J].
JAFFE, JS .
IEEE JOURNAL OF OCEANIC ENGINEERING, 1990, 15 (02) :101-111
[7]  
Jolliffe Ian T., 1986, Principal component analysis, DOI DOI 10.1016/0169-7439(87)80084-9
[8]   Deep High Dynamic Range Imaging of Dynamic Scenes [J].
Kalantari, Nima Khademi ;
Ramamoorthi, Ravi .
ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04)
[9]   Neural Network Analysis for Microplastic Segmentation [J].
Lee, Gwanghee ;
Jhang, Kyoungson .
SENSORS, 2021, 21 (21)
[10]   An Underwater Image Enhancement Benchmark Dataset and Beyond [J].
Li, Chongyi ;
Guo, Chunle ;
Ren, Wenqi ;
Cong, Runmin ;
Hou, Junhui ;
Kwong, Sam ;
Tao, Dacheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :4376-4389