FBDPN: CNN-Transformer hybrid feature boosting and differential pyramid network for underwater object detection

被引:2
|
作者
Ji, Xun [1 ]
Chen, Shijie [1 ]
Hao, Li-Ying [1 ]
Zhou, Jingchun [2 ]
Chen, Long [1 ]
机构
[1] Dalian Maritime Univ, Sch Marine Elect Engn, Dalian 116026, Peoples R China
[2] Dalian Maritime Univ, Sch Informat Sci & Technol, Dalian 116026, Peoples R China
关键词
Underwater object detection; Feature pyramid network; Convolutional neural network; Vision transformer;
D O I
10.1016/j.eswa.2024.124978
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite advancements in underwater object detection (UOD) from optical underwater images in recent years, the task still poses significant challenges due to the chaotic underwater environment, as well as the substantial variations in scale and contour of objects. Existing deep learning-based schemes generally overlook the enhancement and refinement between multi-scale features of densely distributed underwater objects, leading to inaccurate localization and classification predictions with excessive information redundancy. To tackle the above issues, this article presents a novel feature boosting and differential pyramid network (FBDPN) for precise and efficient UOD. The salient properties of our paper are: (1) a heuristic feature pyramid network (FPN)-inspired architecture is constructed, which employs a convolutional neural network (CNN)-Transformer hybrid strategy to simultaneously facilitate the learning of multi-scale features and the capture of long-distance dependencies among pixels. (2) A neighborhood-scale feature boosting module (NSFBM) is developed to enhance contextual information between features of neighborhood scales. (3) A cross-scale feature differential module (CSFDM) is designed further to achieve effective information redundancy between features of different scales. Extensive experiments are conducted to reveal that our proposed FBDPN can outperform other stateof-the-art methods in both UOD performance and computational complexity. In addition, sufficient ablation studies are also performed to demonstrate the effectiveness of each component in our FBDPN. The source code is available at https://github.com/jixun-dmu/FBDPN.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Hybrid CNN-transformer network for interactive learning of challenging musculoskeletal images
    Bi, Lei
    Buehner, Ulrich
    Fu, Xiaohang
    Williamson, Tom
    Choong, Peter
    Kim, Jinman
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 243
  • [32] CNN-Transformer hybrid network for concrete dam crack patrol inspection
    Li, Mingchao
    Yuan, Jingyue
    Ren, Qiubing
    Luo, Qiling
    Fu, Junen
    Li, Zhitang
    AUTOMATION IN CONSTRUCTION, 2024, 163
  • [33] CNN-TRANSFORMER WITH SELF-ATTENTION NETWORK FOR SOUND EVENT DETECTION
    Wakayama, Keigo
    Saito, Shoichiro
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 806 - 810
  • [34] CTHD-Net: CNN-Transformer hybrid dehazing network via residual global attention and gated boosting strategy
    Li, Haiyan
    Qiao, Renchao
    Yu, Pengfei
    Li, Haijiang
    Tan, Mingchuan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 99
  • [35] Relating CNN-Transformer Fusion Network for Remote Sensing Change Detection
    Gao, Yuhao
    Pei, Gensheng
    Sheng, Mengmeng
    Sun, Zeren
    Chen, Tao
    Yao, Yazhou
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
  • [36] RoadCT: A Hybrid CNN-Transformer Network for Road Extraction From Satellite Imagery
    Liu, Wei
    Gao, Shufeng
    Zhang, Chun
    Yang, Bijia
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [37] Hybrid CNN-Transformer model for medical image segmentation with pyramid convolution and multi-layer perceptron
    Liu, Xiaowei
    Hu, Yikun
    Chen, Jianguo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [38] Breast Ultrasound Tumor Classification Using a Hybrid Multitask CNN-Transformer Network
    Shareef, Bryar
    Xian, Min
    Vakanski, Aleksandar
    Wang, Haotian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 344 - 353
  • [39] DACTransNet: A Hybrid CNN-Transformer Network for Histopathological Image Classification of Pancreatic Cancer
    Kou, Yongqing
    Xia, Cong
    Jiao, Yiping
    Zhang, Daoqiang
    Ge, Rongjun
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 422 - 434
  • [40] A CNN-transformer hybrid approach for an intrusion detection system in advanced metering infrastructure
    Ruizhe Yao
    Ning Wang
    Peng Chen
    Di Ma
    Xianjun Sheng
    Multimedia Tools and Applications, 2023, 82 : 19463 - 19486