INSANet: INtra-INter Spectral Attention Network for Effective Feature Fusion of Multispectral Pedestrian Detection

被引:6
|
作者
Lee, Sangin [1 ]
Kim, Taejoo [2 ]
Shin, Jeongmin [2 ]
Kim, Namil [3 ]
Choi, Yukyung [2 ]
机构
[1] Sejong Univ, Dept Software, Seoul 05006, South Korea
[2] Sejong Univ, Dept Convergence Engn Intelligent Drone, Seoul 05006, South Korea
[3] NAVER LABS, Seongnam 13561, South Korea
关键词
autonomous vehicle; computer vision; data augmentation; feature fusion; multispectral; pedestrian detection;
D O I
10.3390/s24041168
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Pedestrian detection is a critical task for safety-critical systems, but detecting pedestrians is challenging in low-light and adverse weather conditions. Thermal images can be used to improve robustness by providing complementary information to RGB images. Previous studies have shown that multi-modal feature fusion using convolution operation can be effective, but such methods rely solely on local feature correlations, which can degrade the performance capabilities. To address this issue, we propose an attention-based novel fusion network, referred to as INSANet (INtra-INter Spectral Attention Network), that captures global intra- and inter-information. It consists of intra- and inter-spectral attention blocks that allow the model to learn mutual spectral relationships. Additionally, we identified an imbalance in the multispectral dataset caused by several factors and designed an augmentation strategy that mitigates concentrated distributions and enables the model to learn the diverse locations of pedestrians. Extensive experiments demonstrate the effectiveness of the proposed methods, which achieve state-of-the-art performance on the KAIST dataset and LLVIP dataset. Finally, we conduct a regional performance evaluation to demonstrate the effectiveness of our proposed network in various regions.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] A multispectral feature fusion network for robust pedestrian detection
    Song, Xiaoru
    Gao, Song
    Chen, Chaobo
    ALEXANDRIA ENGINEERING JOURNAL, 2021, 60 (01) : 73 - 85
  • [2] A feature aggregation network for multispectral pedestrian detection
    Gong, Yan
    Wang, Lu
    Xu, Lisheng
    APPLIED INTELLIGENCE, 2023, 53 (19) : 22117 - 22131
  • [3] A feature aggregation network for multispectral pedestrian detection
    Yan Gong
    Lu Wang
    Lisheng Xu
    Applied Intelligence, 2023, 53 : 22117 - 22131
  • [4] Guided Attentive Feature Fusion for Multispectral Pedestrian Detection
    Zhang, Heng
    Fromont, Elisa
    Lefevre, Sebastien
    Avignon, Bruno
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 72 - 80
  • [5] Cascaded information enhancement and cross-modal attention feature fusion for multispectral pedestrian detection
    Yang, Yang
    Xu, Kaixiong
    Wang, Kaizheng
    FRONTIERS IN PHYSICS, 2023, 11
  • [6] Adaptive spatial pixel-level feature fusion network for multispectral pedestrian detection
    Fu, Lei
    Gu, Wen-bin
    Ai, Yong-bao
    Li, Wei
    Wang, Dong
    INFRARED PHYSICS & TECHNOLOGY, 2021, 116
  • [7] Adaptive spatial pixel-level feature fusion network for multispectral pedestrian detection
    Fu, Lei
    Gu, Wen-bin
    Ai, Yong-bao
    Li, Wei
    Wang, Dong
    Infrared Physics and Technology, 2021, 116
  • [8] Attention Fusion for One-Stage Multispectral Pedestrian Detection
    Cao, Zhiwei
    Yang, Huihua
    Zhao, Juan
    Guo, Shuhong
    Li, Lingqiao
    SENSORS, 2021, 21 (12)
  • [9] IIN-FFD: Intra-Inter Network for Face Forgery Detection
    Zhou, Qihua
    Zhou, Zhili
    Bao, Zhipeng
    Niu, Weina
    Liu, Yuling
    TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 29 (06): : 1839 - 1850
  • [10] Deep Feature Fusion by Competitive Attention for Pedestrian Detection
    Chen, Zhichang
    Zhang, Li
    Khattak, Abdul Mateen
    Gao, Wanlin
    Wang, Minjuan
    IEEE ACCESS, 2019, 7 : 21981 - 21989