FBDPN: CNN-Transformer hybrid feature boosting and differential pyramid network for underwater object detection

被引:2
|
作者
Ji, Xun [1 ]
Chen, Shijie [1 ]
Hao, Li-Ying [1 ]
Zhou, Jingchun [2 ]
Chen, Long [1 ]
机构
[1] Dalian Maritime Univ, Sch Marine Elect Engn, Dalian 116026, Peoples R China
[2] Dalian Maritime Univ, Sch Informat Sci & Technol, Dalian 116026, Peoples R China
关键词
Underwater object detection; Feature pyramid network; Convolutional neural network; Vision transformer;
D O I
10.1016/j.eswa.2024.124978
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite advancements in underwater object detection (UOD) from optical underwater images in recent years, the task still poses significant challenges due to the chaotic underwater environment, as well as the substantial variations in scale and contour of objects. Existing deep learning-based schemes generally overlook the enhancement and refinement between multi-scale features of densely distributed underwater objects, leading to inaccurate localization and classification predictions with excessive information redundancy. To tackle the above issues, this article presents a novel feature boosting and differential pyramid network (FBDPN) for precise and efficient UOD. The salient properties of our paper are: (1) a heuristic feature pyramid network (FPN)-inspired architecture is constructed, which employs a convolutional neural network (CNN)-Transformer hybrid strategy to simultaneously facilitate the learning of multi-scale features and the capture of long-distance dependencies among pixels. (2) A neighborhood-scale feature boosting module (NSFBM) is developed to enhance contextual information between features of neighborhood scales. (3) A cross-scale feature differential module (CSFDM) is designed further to achieve effective information redundancy between features of different scales. Extensive experiments are conducted to reveal that our proposed FBDPN can outperform other stateof-the-art methods in both UOD performance and computational complexity. In addition, sufficient ablation studies are also performed to demonstrate the effectiveness of each component in our FBDPN. The source code is available at https://github.com/jixun-dmu/FBDPN.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A CNN-TRANSFORMER HYBRID FEATURE DESCRIPTOR FOR OPTICAL-SAR IMAGE REGISTRATION
    Lin, Mingxin
    Liu, Binyuan
    Liu, Yijun
    Wang, Qingsong
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6069 - 6072
  • [22] Scale-Insensitive Object Detection via Attention Feature Pyramid Transformer Network
    Li, Lingling
    Zheng, Changwen
    Mao, Cunli
    Deng, Haibo
    Jin, Taisong
    NEURAL PROCESSING LETTERS, 2022, 54 (01) : 581 - 595
  • [23] CTFU-Net:CNN-Transformer Fusion U-shaped Network for Moving Object Detection
    Xia, Tingting
    Yang, Yizhong
    2024 3RD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MEDIA COMPUTING, ICIPMC 2024, 2024, : 44 - 50
  • [24] Scale-Insensitive Object Detection via Attention Feature Pyramid Transformer Network
    Lingling Li
    Changwen Zheng
    Cunli Mao
    Haibo Deng
    Taisong Jin
    Neural Processing Letters, 2022, 54 : 581 - 595
  • [25] HCTNet: A hybrid CNN-transformer network for breast ultrasound image segmentation
    He, Qiqi
    Yang, Qiuju
    Xie, Minghao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 155
  • [26] An improved feature pyramid network for object detection
    Zhu, Linxiang
    Lee, Feifei
    Cai, Jiawei
    Yu, Hongliu
    Chen, Qiu
    NEUROCOMPUTING, 2022, 483 : 127 - 139
  • [27] Parallel Feature Pyramid Network for Object Detection
    Kim, Seung-Wook
    Kook, Hyong-Keun
    Sun, Jee-Young
    Kang, Mun-Cheon
    Ko, Sung-Jea
    COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 239 - 256
  • [28] Latent Feature Pyramid Network for Object Detection
    Xie, Jin
    Pang, Yanwei
    Nie, Jing
    Cao, Jiale
    Han, Jungong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2153 - 2163
  • [29] Gated Feature Pyramid Network for Object Detection
    Xie, Xuemei
    Liao, Quan
    Ma, Lihua
    Jin, Xing
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 199 - 208
  • [30] Complementary Feature Pyramid Network for Object Detection
    Xie, Jin
    Pang, Yanwei
    Pan, Jing
    Nie, Jing
    Cao, Jiale
    Han, Jungong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)