Asymmetric Aggregation Network for Accurate Ship Detection in Optical Imagery

被引：0

作者：

Zhang, Yani ^{[1
]}

Er, Meng Joo ^{[1
]}

机构：

[1] Dalian Maritime Univ, Coll Marine Elect Engn, Inst Artificial Intelligence & Marine Robot, Dalian 116026, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

关键词：

Feature extraction; Marine vehicles; Semantics; Accuracy; YOLO; Convolution; Visualization; Real-time systems; Optical sensors; Optical imaging; Depthwise convolution; feature pyramid network (FPN); ship detection; visual attention; DATASET;

D O I：

10.1109/TGRS.2024.3481370

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Optical imagery ship detection has achieved significant developments recently. However, accurate detection in complex scenes and for different-scale ships remains a vital challenge. To solve the above issues, in this article, we propose the asymmetric aggregation feature pyramid network (A2FPN), incorporating top-down semantic aggregation and bottom-up detail enhancement to propagate semantic and detailed information across different feature levels. In particular, the higher-level hierarchical features propagate global semantic information to the lower-level hierarchical features, successively enhancing the discriminative ability of each level of hierarchical features. After that, the lower-level hierarchical features with abundant semantic information are also aggregated successively to the higher-level hierarchical features through the augmentation path, enriching the details of each level of hierarchical features. Considering the real-time requirements of ship detection, we replace the original path aggregation feature pyramid network (FPN) of YOLOX with the proposed A2FPN and develop a ship detection model termed asymmetric aggregation network (A2Net). Extensive experiments are performed on the three commonly used ship detection datasets, ShipRSImageNet, Seaships7000, and HRSC2016. Quantitative and qualitative results demonstrate that A2Net outperforms the state-of-the-art methods.

引用

页数：14

共 73 条

[1] Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[2] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[3] Improved YOLOv3 Based on Attention Mechanism for Fast and Accurate Ship Detection in Optical Remote Sensing Images
Chen, Liqiong
Shi, Wenxuan
Deng, Dexiang
[J]. REMOTE SENSING, 2021, 13 (04) : 1 - 18
[4] YOLO-World: Real-Time Open-Vocabulary Object Detection
Cheng, Tianheng
Sone, Lin
Ge, Yixiao
Liu, Wenyu
Wang, Xinggang
Shan, Yong
[J]. 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16901 - 16911
[5] Cho HC, 2023, Arxiv, DOI arXiv:2303.13040
[6] Du ZW, 2024, Arxiv, DOI arXiv:2407.19696
[7] Ship detection with deep learning: a survey
Er, Meng Joo
Zhang, Yani
Chen, Jie
Gao, Wenxiao
[J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (10) : 11825 - 11865
[8] Multiphysical Interpretable Deep Learning Network for Oil Spill Identification Based on SAR Images
Fan, Jianchao
Sui, Zitai
Wang, Xinzhe
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
[9] Open Vocabulary Object Detection with Pseudo Bounding-Box Labels
Gao, Mingfei
Xing, Chen
Niebles, Juan Carlos
Li, Junnan
Xu, Ran
Liu, Wenhao
Xiong, Caiming
[J]. COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 266 - 282
[10] EASE-DETR: Easing the Competition among Object Queries
Gao, Yulu
Sun, Yifan
Ding, Xudong
Zhao, Chuyang
Liu, Si
[J]. 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 17282 - 17291

← 1 2 3 4 5 6 7 8 →