INSANet: INtra-INter Spectral Attention Network for Effective Feature Fusion of Multispectral Pedestrian Detection

被引:11
作者
Lee, Sangin [1 ]
Kim, Taejoo [2 ]
Shin, Jeongmin [2 ]
Kim, Namil [3 ]
Choi, Yukyung [2 ]
机构
[1] Sejong Univ, Dept Software, Seoul 05006, South Korea
[2] Sejong Univ, Dept Convergence Engn Intelligent Drone, Seoul 05006, South Korea
[3] NAVER LABS, Seongnam 13561, South Korea
关键词
autonomous vehicle; computer vision; data augmentation; feature fusion; multispectral; pedestrian detection;
D O I
10.3390/s24041168
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Pedestrian detection is a critical task for safety-critical systems, but detecting pedestrians is challenging in low-light and adverse weather conditions. Thermal images can be used to improve robustness by providing complementary information to RGB images. Previous studies have shown that multi-modal feature fusion using convolution operation can be effective, but such methods rely solely on local feature correlations, which can degrade the performance capabilities. To address this issue, we propose an attention-based novel fusion network, referred to as INSANet (INtra-INter Spectral Attention Network), that captures global intra- and inter-information. It consists of intra- and inter-spectral attention blocks that allow the model to learn mutual spectral relationships. Additionally, we identified an imbalance in the multispectral dataset caused by several factors and designed an augmentation strategy that mitigates concentrated distributions and enables the model to learn the diverse locations of pedestrians. Extensive experiments demonstrate the effectiveness of the proposed methods, which achieve state-of-the-art performance on the KAIST dataset and LLVIP dataset. Finally, we conduct a regional performance evaluation to demonstrate the effectiveness of our proposed network in various regions.
引用
收藏
页数:17
相关论文
共 50 条
[21]   EFFECTIVE FEATURE FUSION NETWORK IN BIFPN FOR SMALL OBJECT DETECTION [J].
Chen, Jun ;
Mai, HongSheng ;
Luo, Linbo ;
Chen, Xiaoqiang ;
Wu, Kangle .
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, :699-703
[22]   DACFusion: Dual Asymmetric Cross-Attention guided feature fusion for multispectral object detection [J].
Qian, Jingchen ;
Qiao, Baiyou ;
Zhang, Yuekai ;
Liu, Tongyan ;
Wang, Shuo ;
Wu, Gang ;
Han, Donghong .
NEUROCOMPUTING, 2025, 635
[23]   TCCDNet: A Multimodal Pedestrian Detection Network Integrating Cross-Modal Complementarity with Deep Feature Fusion [J].
Han, Shipeng ;
Chai, Chaowen ;
Hu, Min ;
Wang, Yanni ;
Jiao, Teng ;
Wang, Jianqi ;
Lv, Hao .
SENSORS, 2025, 25 (09)
[24]   Attention guided contextual feature fusion network for salient object detection [J].
Zhang, Jin ;
Shi, Yanjiao ;
Zhang, Qing ;
Cui, Liu ;
Chen, Ying ;
Yi, Yugen .
IMAGE AND VISION COMPUTING, 2022, 117
[25]   Attention-based acoustic feature fusion network for depression detection [J].
Xu, Xiao ;
Wang, Yang ;
Wei, Xinru ;
Wang, Fei ;
Zhang, Xizhe .
NEUROCOMPUTING, 2024, 601
[26]   Attention Feature Fusion Network for Rapid Aircraft Detection in SAR Images [J].
Zhao Y. ;
Zhao L.-J. ;
Kuang G.-Y. .
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (09) :1665-1674
[27]   Locality guided cross-modal feature aggregation and pixel-level fusion for multispectral pedestrian detection [J].
Cao, Yanpeng ;
Luo, Xing ;
Yang, Jiangxin ;
Cao, Yanlong ;
Yang, Michael Ying .
INFORMATION FUSION, 2022, 88 :1-11
[28]   HFMIDet: Hierarchical Feature Fusion-Guided Multidimensional Infrared Pedestrian Detection Network [J].
Liu, Yang ;
Zhang, Ming ;
Fan, Fei ;
Yu, Dahua ;
Li, Jianjun .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
[29]   Effective Dual-Feature Fusion Network for Transmission Line Detection [J].
Zhou, Wujie ;
Ji, Chuanming ;
Fang, Meixin .
IEEE SENSORS JOURNAL, 2024, 24 (01) :101-109
[30]   Multi-attention guided feature fusion network for salient object detection [J].
Li, Anni ;
Qi, JinQing ;
Lu, Huchuan .
NEUROCOMPUTING, 2020, 411 :416-427