Multiple attentional path aggregation network for marine object detection

被引:51
作者
Yu, Haifeng [1 ]
Li, Xinbin [1 ]
Feng, Yankai [1 ]
Han, Song [1 ]
机构
[1] Yanshan Univ, Inst Elect Engn, Qinhuangdao 066004, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Marine target detection; Path aggregation network; Multi-attention; Underwater image enhancement; CNN;
D O I
10.1007/s10489-022-03622-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Marine target detection is a challenging task because degraded underwater images cause unclear targets. Furthermore, marine targets are small in size and tend to live together. The popular object detection methods perform poorly in marine target detection. Thus, this paper proposes a novel multiple attentional path aggregation network named APAN to improve performance on marine object detection. Firstly, we design a path aggregation network structure which brings features from backbone network to bottom-up path augmentation. Each feature map is enhanced by the lower layer through the bottom-up downsampling pathway and incorporates the features from top-down upsampling layers. Specifically, the last layer fuses feature map from backbone network which enhances the semantic features and improve the ability of feature extraction. Then, a multi-attention which combines coordinate competing attention and spatial supplement attention applies to proposed path aggregation network. Multi-attention can further improve the accuracy of multiple marine object detection. Finally, a double transmission underwater image enhancement algorithm is proposed to enhance the underwater image datasets. The experiments show our method achieves 79.6% mAP in underwater image datasets and 79.03% mAP in enhanced underwater image datasets. Meanwhile, our method achieves 81.5% mAP in PASCAL VOC datasets. In addition, we also applly the method to the underwater robot. The experiments show our method achieves good performance compared with popular object detection methods. The source code is publicly available at https://github.com/yhf2022/APAN.
引用
收藏
页码:2434 / 2451
页数:18
相关论文
共 52 条
[1]   Color Balance and Fusion for Underwater Image Enhancement [J].
Ancuti, Codruta O. ;
Ancuti, Cosmin ;
De Vleeschouwer, Christophe ;
Bekaert, Philippe .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (01) :379-393
[2]   Deep semantic segmentation of natural and medical images: a review [J].
Asgari Taghanaki, Saeid ;
Abhishek, Kumar ;
Cohen, Joseph Paul ;
Cohen-Adad, Julien ;
Hamarneh, Ghassan .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) :137-178
[3]   Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI [J].
Barredo Arrieta, Alejandro ;
Diaz-Rodriguez, Natalia ;
Del Ser, Javier ;
Bennetot, Adrien ;
Tabik, Siham ;
Barbado, Alberto ;
Garcia, Salvador ;
Gil-Lopez, Sergio ;
Molina, Daniel ;
Benjamins, Richard ;
Chatila, Raja ;
Herrera, Francisco .
INFORMATION FUSION, 2020, 58 :82-115
[4]  
Bell S, 2016, P IEEE C COMPUTER VI
[5]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[6]  
Chen L., 2017, P IEEE C COMPUTER VI
[7]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[8]   Fast single shot multibox detector and its application on vehicle counting system [J].
Chen, Lili ;
Zhang, Zhengdao ;
Peng, Li .
IET INTELLIGENT TRANSPORT SYSTEMS, 2018, 12 (10) :1406-1413
[9]  
Chen X, 2020, ARXIV 200301913
[10]   Towards Real-Time Advancement of Underwater Visual Quality With GAN [J].
Chen, Xingyu ;
Yu, Junzhi ;
Kong, Shihan ;
Wu, Zhengxing ;
Fang, Xi ;
Wen, Li .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2019, 66 (12) :9350-9359