Multi-Modal Object Detection Method Based on Dual-Branch Asymmetric Attention Backbone and Feature Fusion Pyramid Network

被引:1
|
作者
Wang, Jinpeng [1 ]
Su, Nan [1 ,2 ]
Zhao, Chunhui [1 ,2 ]
Yan, Yiming [1 ,2 ]
Feng, Shou [1 ,2 ]
机构
[1] Harbin Engn Univ, Coll Informat & Commun Engn, Harbin 150001, Peoples R China
[2] Harbin Engn Univ, Key Lab Adv Marine Commun & Informat Technol, Minist Ind & Informat Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-modal fusion; object detection; asymmetric attention; REMOTE-SENSING IMAGES;
D O I
10.3390/rs16203904
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
With the simultaneous acquisition of the infrared and optical remote sensing images of the same target becoming increasingly easy, using multi-modal data for high-performance object detection has become a research focus. In remote sensing multi-modal data, infrared images lack color information, it is hard to detect difficult targets with low contrast, and optical images are easily affected by illuminance. One of the most effective ways to solve this problem is to integrate multi-modal images for high-performance object detection. The challenge of fusion object detection lies in how to fully integrate multi-modal image features with significant modal differences and avoid introducing interference information while taking advantage of complementary advantages. To solve these problems, a new multi-modal fusion object detection method is proposed. In this paper, the method is improved in terms of two aspects: firstly, a new dual-branch asymmetric attention backbone network (DAAB) is designed, which uses a semantic information supplement module (SISM) and a detail information supplement module (DISM) to supplement and enhance infrared and RGB image information, respectively. Secondly, we propose a feature fusion pyramid network (FFPN), which uses a Transformer-like strategy to carry out multi-modal feature fusion and suppress features that are not conducive to fusion during the fusion process. This method is a state-of-the-art process for both FLIR-aligned and DroneVehicle datasets. Experiments show that this method has strong competitiveness and generalization performance.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Dual-Branch Feature Fusion Network for Salient Object Detection
    Song, Zhehan
    Xu, Zhihai
    Wang, Jing
    Feng, Huajun
    Li, Qi
    PHOTONICS, 2022, 9 (01)
  • [2] DMFusion: A dual-branch multi-scale feature fusion network for medical multi-modal image fusion
    Ma, Gengchen
    Qiu, Xihe
    Tan, Xiaoyu
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 105
  • [3] A novel intelligent bearing fault diagnosis method based on VMD denoising and dual-branch multi-modal feature fusion
    Li, Youjia
    Zhang, Zhongwei
    Jiao, Zonghao
    Shao, Mingyu
    Dai, Xiangjun
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2025,
  • [4] MULTI-MODAL FEATURE FUSION NETWORK FOR GHOST IMAGING OBJECT DETECTION
    Hu, Nan
    Ma, Huimin
    Le, Chao
    Shao, Xuehui
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 351 - 355
  • [5] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [6] Research on an Underwater Object Detection Network Based on Dual-Branch Feature Extraction
    Chen, Xiao
    Yuan, Mujiahui
    Fan, Chenye
    Chen, Xingwu
    Li, Yaan
    Wang, Haiyan
    ELECTRONICS, 2023, 12 (16)
  • [7] Dual-branch multi-modal convergence network for crater detection using Chang'e image
    Lin, Feng
    Hu, Xie
    Lin, Yiling
    Li, Yao
    Liu, Yang
    Li, Dongmei
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 134
  • [8] A Multiscale Dual-Branch Feature Fusion and Attention Network for Hyperspectral Images Classification
    Gao, Hongmin
    Zhang, Yiyan
    Chen, Zhonghao
    Li, Chenming
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 8180 - 8192
  • [9] DBGAN: Dual-Branch Generative Adversarial Network for Multi-Modal MRI Translation
    Lyu, Jun
    Yan, Shouang
    Hossain, M. Shamim
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (08) : 1 - 22
  • [10] Dual-branch feature fusion dehazing network via multispectral channel attention
    Jian, Huachun
    Zhang, Yongjun
    Gao, Weihao
    Wang, Bufan
    Wang, Guomei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2655 - 2671