DACFusion: Dual Asymmetric Cross-Attention guided feature fusion for multispectral object detection

被引:0
|
作者
Qian, Jingchen [1 ]
Qiao, Baiyou [1 ,2 ]
Zhang, Yuekai [1 ]
Liu, Tongyan [1 ]
Wang, Shuo [1 ]
Wu, Gang [1 ,2 ]
Han, Donghong [1 ,2 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang 110819, Peoples R China
关键词
Multispectral object detection; Cross-attention; Feature fusion; SCALING-UP; NETWORK;
D O I
10.1016/j.neucom.2025.129913
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective fusion of unique features from different spectra plays a crucial role in multispectral object detection. Recent research has focused on transplanting advanced methods from other multimodal fusion fields to multispectral object detection tasks. These fusion methods focus on the fusion of features and ignore the spatial correspondence between multispectral images. This lack of correspondence in turn limits the full utilization of the complementarities between the different modalities, which affects the accuracy of object detection. To address this problem, we creatively propose a dual asymmetric cross-attention multispectral fusion (DACFusion) method, which is able to process features interactively based on the positional correspondence between two spectra, and then asymmetrically fuses the multispectral data according to the characteristics of each spectrum to take advantage of their complementary strengths. Meanwhile, we introduce a large selective kernel network to expand the receptive field for object detection, which further improves the detection accuracy. Experimental results on the VEDAI and LLVIP datasets validate the significant performance advantages of our proposed method and show its applicability to a variety of practical application scenarios. Code will be available at https://github.com/wood-fish/DACFusion.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Improved Traffic Small Object Detection via Cross-Layer Feature Fusion and Channel Attention
    Chuai, Qinliang
    He, Xiaowei
    Li, Yi
    ELECTRONICS, 2023, 12 (16)
  • [32] DTCA: Dual-Branch Transformer with Cross-Attention for EEG and Eye Movement Data Fusion
    Zhang, Xiaoshan
    Shi, Enze
    Yu, Sigang
    Zhang, Shu
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT II, 2024, 15002 : 141 - 151
  • [33] Joint Transformer and Mamba fusion for multispectral object detection
    Li, Chao
    Peng, Xiaoming
    IMAGE AND VISION COMPUTING, 2025, 156
  • [34] DECA-Net: Dual encoder and cross-attention fusion network for surgical instrument segmentation
    Liang, Sixin
    Zhang, Jianzhou
    Bian, Ang
    You, Jiaying
    PATTERN RECOGNITION LETTERS, 2024, 185 : 130 - 136
  • [35] Small object detection model based on feature fusion of attention mechanism
    Chen H.
    Zhen X.
    Zhao T.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (03): : 60 - 66
  • [36] Dual-Template Siamese Network with Attention Feature Fusion for Object Tracking
    Liu, Minhua
    Shi, Jiantong
    Wang, Yu
    RADIOENGINEERING, 2023, 32 (03) : 371 - 380
  • [37] Research on Small Object Detection Based on Feature Fusion and Attention Mechanism
    Liu, Jianwei
    Liu, Zheng
    Lu, Jingwen
    Li, Chuancan
    Chen, Gangqiang
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 2285 - 2291
  • [38] Boundary-Aware Feature Fusion With Dual-Stream Attention for Remote Sensing Small Object Detection
    Song, Jingnan
    Zhou, Mingliang
    Luo, Jun
    Pu, Huayan
    Feng, Yong
    Wei, Xuekai
    Jia, Weijia
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [39] MFSA-Net: Semantic Segmentation With Camera-LiDAR Cross-Attention Fusion Based on Fast Neighbor Feature Aggregation
    Duan, Yijian
    Meng, Liwen
    Meng, Yanmei
    Zhu, Jihong
    Zhang, Jiacheng
    Zhang, Jinlai
    Liu, Xin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 19627 - 19639
  • [40] Edge-guided Contextual Attention Fusion Network for Camouflaged Object Detection
    Hu, Bo
    Chen, Sibao
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 108 - 112