Multimodal Object Detection by Channel Switching and Spatial Attention

被引:24
|
作者
Cao, Yue [1 ]
Bin, Junchi [1 ]
Hamari, Jozsef [2 ]
Blasch, Erik [3 ]
Liu, Zheng [1 ]
机构
[1] Univ British Columbia, Kelowna, BC, Canada
[2] TerraSense Anal, Kelowna, BC, Canada
[3] MOVEJ Anal, Fairborn, OH USA
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW | 2023年
关键词
D O I
10.1109/CVPRW59228.2023.00046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal object detection has attracted great attention in recent years since the information specific to different modalities can complement each other and effectively improve the accuracy and stability of the detection model. However, compared to processing the inputs from a single modality, fusing information from multiple modalities can significantly increase the computational complexity of the model, thus impairing its efficiency. Therefore the multimodal fusion module needs to be carefully designed to enhance the performance of the detection model while keeping the computational consumption low. In this paper, we propose a novel lightweight fusion module that can efficiently fuse the inputs from different modalities using channel switching and spatial attention (CSSA). The effectiveness and generalizability of the module are tested using two public multimodal datasets LLVIP and FLIR, both of which comprise paired infrared (IR) and visible (RGB) images. The experiments demonstrate that the proposed CSSA module can substantially improve the accuracy of multimodal object detection without consuming excessive computing resources.
引用
收藏
页码:403 / 411
页数:9
相关论文
共 50 条
  • [1] Group channel pruning and spatial attention distilling for object detection
    Yun Chu
    Pu Li
    Yong Bai
    Zhuhua Hu
    Yongqing Chen
    Jiafeng Lu
    Applied Intelligence, 2022, 52 : 16246 - 16264
  • [2] Group channel pruning and spatial attention distilling for object detection
    Chu, Yun
    Li, Pu
    Bai, Yong
    Hu, Zhuhua
    Chen, Yongqing
    Lu, Jiafeng
    APPLIED INTELLIGENCE, 2022, 52 (14) : 16246 - 16264
  • [3] Object Detection by Channel and Spatial Exchange for Multimodal Remote Sensing Imagery
    Nan, Guozheng
    Zhao, Yue
    Fu, Liyong
    Ye, Qiaolin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 8581 - 8593
  • [4] Hybrid spatial and channel attention in post-accident object detection
    Kim, Junyoung
    Lee, Soomok
    IET INTELLIGENT TRANSPORT SYSTEMS, 2025, 19 (01)
  • [5] Channel-Spatial Mutual Attention Network for 360° Salient Object Detection
    Zhang, Yi
    Hamidouche, Wassim
    Deforges, Olivier
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3436 - 3442
  • [6] On the Optimality of Spatial Attention for Object Detection
    Harel, Jonathan
    Koch, Christof
    ATTENTION IN COGNITIVE SYSTEMS, 2009, 5395 : 1 - 14
  • [7] Underwater Object Detection Algorithm Based on Adding Channel and Spatial Fusion Attention Mechanism
    Wang, Xingyao
    Xue, Gang
    Huang, Shuting
    Liu, Yanjun
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (06)
  • [8] SCANET: SPATIAL-CHANNEL ATTENTION NETWORK FOR 3D OBJECT DETECTION
    Lu, Haihua
    Chen, Xuesong
    Zhang, Guiying
    Zhou, Qiuhao
    Ma, Yanbo
    Zhao, Yong
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1992 - 1996
  • [9] Mixed local channel attention for object detection
    Wan, Dahang
    Lu, Rongsheng
    Shen, Siyuan
    Xu, Ting
    Lang, Xianli
    Ren, Zhijie
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [10] Spatial and Channel Attention Mechanism Method for Object Tracking
    Liu Jiamin
    Xie Wenjie
    Huang Hong
    Tang Yiming
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (09) : 2569 - 2576