Asymmetric Aggregation Network for Accurate Ship Detection in Optical Imagery

被引:0
作者
Zhang, Yani [1 ]
Er, Meng Joo [1 ]
机构
[1] Dalian Maritime Univ, Coll Marine Elect Engn, Inst Artificial Intelligence & Marine Robot, Dalian 116026, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Feature extraction; Marine vehicles; Semantics; Accuracy; YOLO; Convolution; Visualization; Real-time systems; Optical sensors; Optical imaging; Depthwise convolution; feature pyramid network (FPN); ship detection; visual attention; DATASET;
D O I
10.1109/TGRS.2024.3481370
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Optical imagery ship detection has achieved significant developments recently. However, accurate detection in complex scenes and for different-scale ships remains a vital challenge. To solve the above issues, in this article, we propose the asymmetric aggregation feature pyramid network (A2FPN), incorporating top-down semantic aggregation and bottom-up detail enhancement to propagate semantic and detailed information across different feature levels. In particular, the higher-level hierarchical features propagate global semantic information to the lower-level hierarchical features, successively enhancing the discriminative ability of each level of hierarchical features. After that, the lower-level hierarchical features with abundant semantic information are also aggregated successively to the higher-level hierarchical features through the augmentation path, enriching the details of each level of hierarchical features. Considering the real-time requirements of ship detection, we replace the original path aggregation feature pyramid network (FPN) of YOLOX with the proposed A2FPN and develop a ship detection model termed asymmetric aggregation network (A2Net). Extensive experiments are performed on the three commonly used ship detection datasets, ShipRSImageNet, Seaships7000, and HRSC2016. Quantitative and qualitative results demonstrate that A2Net outperforms the state-of-the-art methods.
引用
收藏
页数:14
相关论文
共 73 条
  • [1] Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
  • [2] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [3] Improved YOLOv3 Based on Attention Mechanism for Fast and Accurate Ship Detection in Optical Remote Sensing Images
    Chen, Liqiong
    Shi, Wenxuan
    Deng, Dexiang
    [J]. REMOTE SENSING, 2021, 13 (04) : 1 - 18
  • [4] YOLO-World: Real-Time Open-Vocabulary Object Detection
    Cheng, Tianheng
    Sone, Lin
    Ge, Yixiao
    Liu, Wenyu
    Wang, Xinggang
    Shan, Yong
    [J]. 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16901 - 16911
  • [5] Cho HC, 2023, Arxiv, DOI arXiv:2303.13040
  • [6] Du ZW, 2024, Arxiv, DOI arXiv:2407.19696
  • [7] Ship detection with deep learning: a survey
    Er, Meng Joo
    Zhang, Yani
    Chen, Jie
    Gao, Wenxiao
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (10) : 11825 - 11865
  • [8] Multiphysical Interpretable Deep Learning Network for Oil Spill Identification Based on SAR Images
    Fan, Jianchao
    Sui, Zitai
    Wang, Xinzhe
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [9] Open Vocabulary Object Detection with Pseudo Bounding-Box Labels
    Gao, Mingfei
    Xing, Chen
    Niebles, Juan Carlos
    Li, Junnan
    Xu, Ran
    Liu, Wenhao
    Xiong, Caiming
    [J]. COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 266 - 282
  • [10] EASE-DETR: Easing the Competition among Object Queries
    Gao, Yulu
    Sun, Yifan
    Ding, Xudong
    Zhao, Chuyang
    Liu, Si
    [J]. 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 17282 - 17291