DACFusion: Dual Asymmetric Cross-Attention guided feature fusion for multispectral object detection

被引:0
|
作者
Qian, Jingchen [1 ]
Qiao, Baiyou [1 ,2 ]
Zhang, Yuekai [1 ]
Liu, Tongyan [1 ]
Wang, Shuo [1 ]
Wu, Gang [1 ,2 ]
Han, Donghong [1 ,2 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang 110819, Peoples R China
关键词
Multispectral object detection; Cross-attention; Feature fusion; SCALING-UP; NETWORK;
D O I
10.1016/j.neucom.2025.129913
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective fusion of unique features from different spectra plays a crucial role in multispectral object detection. Recent research has focused on transplanting advanced methods from other multimodal fusion fields to multispectral object detection tasks. These fusion methods focus on the fusion of features and ignore the spatial correspondence between multispectral images. This lack of correspondence in turn limits the full utilization of the complementarities between the different modalities, which affects the accuracy of object detection. To address this problem, we creatively propose a dual asymmetric cross-attention multispectral fusion (DACFusion) method, which is able to process features interactively based on the positional correspondence between two spectra, and then asymmetrically fuses the multispectral data according to the characteristics of each spectrum to take advantage of their complementary strengths. Meanwhile, we introduce a large selective kernel network to expand the receptive field for object detection, which further improves the detection accuracy. Experimental results on the VEDAI and LLVIP datasets validate the significant performance advantages of our proposed method and show its applicability to a variety of practical application scenarios. Code will be available at https://github.com/wood-fish/DACFusion.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Dual Stream Cross Domain Feature Fusion for Land-Oceanic Object Detection
    Lv, JunFeng
    Hui, Tian
    Xu, YueLei
    Zhi, YongFeng
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 3865 - 3875
  • [22] Complex Scenes Fire Object Detection Based on Feature Fusion and Channel Attention
    Cao, Xinrong
    Wu, Jincai
    Chen, Jian
    Li, Zuoyong
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, : 7587 - 7601
  • [23] Dual Cross-Attention for medical image segmentation
    Ates, Gorkem Can
    Mohan, Prasoon
    Celik, Emrah
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [24] Object Detection Network Based on Feature Fusion and Attention Mechanism
    Zhang, Ying
    Chen, Yimin
    Huang, Chen
    Gao, Mingke
    FUTURE INTERNET, 2019, 11 (01):
  • [25] Attention and Feature Fusion SSD for Remote Sensing Object Detection
    Lu, Xiaocong
    Ji, Jian
    Xing, Zhiqi
    Miao, Qiguang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [26] AGFNet: Attention Guided Fusion Network for Camouflaged Object Detection
    Zhao, Zeyu
    Liu, Zhihao
    Peng, Chenglei
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 478 - 489
  • [27] Boundary Guided Feature Fusion Network for Camouflaged Object Detection
    Qiu, Tianchi
    Li, Xiuhong
    Liu, Kangwei
    Li, Songlin
    Chen, Fan
    Zhou, Chenyu
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 433 - 444
  • [28] Locality guided cross-modal feature aggregation and pixel-level fusion for multispectral pedestrian detection
    Cao, Yanpeng
    Luo, Xing
    Yang, Jiangxin
    Cao, Yanlong
    Yang, Michael Ying
    INFORMATION FUSION, 2022, 88 : 1 - 11
  • [29] CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection
    Hwang, Jyh-Jing
    Kretzschmar, Henrik
    Manela, Joshua
    Rafferty, Sean
    Armstrong-Crews, Nicholas
    Chen, Tiffany
    Anguelov, Dragomir
    COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 388 - 405
  • [30] GACFNet: A global attention cross-level feature fusion network for aerial image object detection
    Liang, Xingzhu
    Li, Mengyuan
    Lin, Yu-e
    Fang, Xianjin
    COMPUTERS & ELECTRICAL ENGINEERING, 2025, 123