Multiple attentional path aggregation network for marine object detection

被引:51
作者
Yu, Haifeng [1 ]
Li, Xinbin [1 ]
Feng, Yankai [1 ]
Han, Song [1 ]
机构
[1] Yanshan Univ, Inst Elect Engn, Qinhuangdao 066004, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Marine target detection; Path aggregation network; Multi-attention; Underwater image enhancement; CNN;
D O I
10.1007/s10489-022-03622-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Marine target detection is a challenging task because degraded underwater images cause unclear targets. Furthermore, marine targets are small in size and tend to live together. The popular object detection methods perform poorly in marine target detection. Thus, this paper proposes a novel multiple attentional path aggregation network named APAN to improve performance on marine object detection. Firstly, we design a path aggregation network structure which brings features from backbone network to bottom-up path augmentation. Each feature map is enhanced by the lower layer through the bottom-up downsampling pathway and incorporates the features from top-down upsampling layers. Specifically, the last layer fuses feature map from backbone network which enhances the semantic features and improve the ability of feature extraction. Then, a multi-attention which combines coordinate competing attention and spatial supplement attention applies to proposed path aggregation network. Multi-attention can further improve the accuracy of multiple marine object detection. Finally, a double transmission underwater image enhancement algorithm is proposed to enhance the underwater image datasets. The experiments show our method achieves 79.6% mAP in underwater image datasets and 79.03% mAP in enhanced underwater image datasets. Meanwhile, our method achieves 81.5% mAP in PASCAL VOC datasets. In addition, we also applly the method to the underwater robot. The experiments show our method achieves good performance compared with popular object detection methods. The source code is publicly available at https://github.com/yhf2022/APAN.
引用
收藏
页码:2434 / 2451
页数:18
相关论文
共 52 条
[11]   Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification [J].
Du, Yang ;
Yuan, Chunfeng ;
Li, Bing ;
Zhao, Lili ;
Li, Yangxi ;
Hu, Weiming .
COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 :388-404
[12]   Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks [J].
Fan, Deng-Ping ;
Lin, Zheng ;
Zhang, Zhao ;
Zhu, Menglong ;
Cheng, Ming-Ming .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) :2075-2089
[13]  
Gao S B, 2019, UNDERWATER IMAGE ENH, V28
[14]  
Ge Z., 2021, ARXIV, DOI 10.48550/ARXIV.2107.08430
[15]  
Ghiasi G, 2016, EUROPEAN C COMPUTER
[16]   Advanced Deep-Learning Techniques for Salient and Category-Specific Object Detection A survey [J].
Han, Junwei ;
Zhang, Dingwen ;
Cheng, Gong ;
Liu, Nian ;
Xu, Dong .
IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) :84-100
[17]   The extended marine underwater environment database and baseline evaluations [J].
Jian, Muwei ;
Qi, Qiang ;
Yu, Hui ;
Dong, Junyu ;
Cui, Chaoran ;
Nie, Xiushan ;
Zhang, Huaxiang ;
Yin, Yilong ;
Lam, Kin-Man .
APPLIED SOFT COMPUTING, 2019, 80 :425-437
[18]   A Survey of Deep Learning-Based Object Detection [J].
Jiao, Licheng ;
Zhang, Fan ;
Liu, Fang ;
Yang, Shuyuan ;
Li, Lingling ;
Feng, Zhixi ;
Qu, Rong .
IEEE ACCESS, 2019, 7 :128837-128868
[19]   A survey of the recent architectures of deep convolutional neural networks [J].
Khan, Asifullah ;
Sohail, Anabia ;
Zahoora, Umme ;
Qureshi, Aqsa Saeed .
ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (08) :5455-5516
[20]   An Underwater Image Enhancement Benchmark Dataset and Beyond [J].
Li, Chongyi ;
Guo, Chunle ;
Ren, Wenqi ;
Cong, Runmin ;
Hou, Junhui ;
Kwong, Sam ;
Tao, Dacheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :4376-4389