Hybrid Multiscale SAR Ship Detector With CNN-Transformer and Adaptive Fusion Loss

被引:0
作者
Wang, Fei [1 ]
Chen, Chengcheng [1 ]
Zeng, Weiming [1 ]
机构
[1] Shanghai Maritime Univ, Digital Imaging & Intelligent Comp Lab, Shanghai 201306, Peoples R China
关键词
Marine vehicles; Feature extraction; Detectors; Convolution; Transformers; Computational modeling; Synthetic aperture radar; Deep learning; multiscale feature fusion; ship detection; synthetic aperture radar (SAR);
D O I
10.1109/LGRS.2024.3450716
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Ship detection in remote sensing imagery is crucial for various maritime applications such as surveillance and navigation. Convolutional neural networks (CNNs) and transformers have shown significant potential in object detection within the field of image processing. However, existing models applied directly to ship detection in synthetic aperture radar (SAR) imagery encounter challenges due to the varying sizes of ship targets. This often leads to issues such as low detection accuracy, missed detections, and false alarms. In this letter, we propose a new detection network, HMA-Net, to further address these issues. Initially, we introduce the Cwin module, which enhances interference resistance at a relatively low cost, enabling the model to more accurately capture target information. Subsequently, we design a multiscale ship feature extraction module, which uses a parallel multibranch structure to extract features of ships of various sizes and shapes. Finally, we introduce an adaptive fusion loss function that flexibly allocates loss calculation methods to detected targets, thereby enhancing the robustness of the model and achieving high-quality detection boxes. The proposed HMA-Net achieved improvements of 2.0% and 0.9% in mAP(.50:.95) over the baseline models on the SAR Ship Detection dataset and the High-Resolution SAR Images dataset, using only 3.52 M parameters.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Enhancing Remote Sensing Scene Classification With Hy-MSDA: A Hybrid CNN-Transformer for Multisource Domain Adaptation
    Xu, Kai
    Zhu, Zhou
    Wang, Wenxin
    Fan, Chengcheng
    Wu, Bocai
    Jia, Zhaohong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [42] A CNN-transformer hybrid approach for decoding visual neural activity into text
    Zhang, Jiang
    Li, Chen
    Liu, Ganwanming
    Min, Min
    Wang, Chong
    Li, Jiyi
    Wang, Yuting
    Yan, Hongmei
    Zuo, Zhentao
    Huang, Wei
    Chen, Huafu
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 214
  • [43] EEG classification algorithm of motor imagery based on CNN-Transformer fusion network
    Liu, Haofeng
    Liu, Yuefeng
    Wang, Yue
    Liu, Bo
    Bao, Xiang
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1302 - 1309
  • [44] A Robust One-Stage Detector for Multiscale Ship Detection With Complex Background in Massive SAR Images
    Yang, Xi
    Zhang, Xin
    Wang, Nannan
    Gao, Xinbo
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [45] RoadCT: A Hybrid CNN-Transformer Network for Road Extraction From Satellite Imagery
    Liu, Wei
    Gao, Shufeng
    Zhang, Chun
    Yang, Bijia
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [46] MFH-Net: A Hybrid CNN-Transformer Network Based Multi-Scale Fusion for Medical Image Segmentation
    Wang, Ying
    Zhang, Meng
    Liang, Jian'an
    Liang, Meiyan
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (06)
  • [47] A hybrid CNN-transformer surrogate model for the multi-objective robust optimization of geological carbon sequestration
    Feng, Zhao
    Yan, Bicheng
    Shen, Xianda
    Zhang, Fengshou
    Tariq, Zeeshan
    Ouyang, Weiquan
    Han, Zhilei
    [J]. ADVANCES IN WATER RESOURCES, 2025, 196
  • [48] CTMFNet: CNN and Transformer Multiscale Fusion Network of Remote Sensing Urban Scene Imagery
    Song, Pengfei
    Li, Jinjiang
    An, Zhiyong
    Fan, Hui
    Fan, Linwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [49] MATR: Multimodal Medical Image Fusion via Multiscale Adaptive Transformer
    Tang, Wei
    He, Fazhi
    Liu, Yu
    Duan, Yansong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5134 - 5149
  • [50] HTCNet: Hybrid Transformer-CNN for SAR Image Denoising
    Huang, Min
    Luo, Shuaili
    Wang, Shuaihui
    Guo, Jinghang
    Wang, Jingyang
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 19380 - 19394