SSTNet: Sliced Spatio-Temporal Network With Cross-Slice ConvLSTM for Moving Infrared Dim-Small Target Detection

被引:31
作者
Chen, Shengjia [1 ]
Ji, Luping [1 ]
Zhu, Jiewen [1 ]
Ye, Mao [1 ]
Yao, Xiaoyong [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[2] Jinggangshan Univ, Sch Mech & Elect Engn, Jian 343009, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Feature extraction; Object detection; Videos; Tensors; Neck; Visualization; Training; Infrared dim-small target detection; motion-coordination loss (MCL); motion-coupling neck; sliced spatio-temporal network (SSTNet); LOCAL CONTRAST METHOD; MODEL;
D O I
10.1109/TGRS.2024.3350024
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Infrared dim-small target detection, as an important branch of object detection, has been attracting research attention in recent decades. Its challenges mainly lie in the small target sizes and dim contrast to background images. Recent research schemes on it mainly focus on improving the feature representation of spatio-temporal domains only in single-slice temporal scope. More cross-slice motion, i.e., past and future, is seldom considered to enhance target features. To use cross-slice motion context, this article proposes a sliced spatio-temporal network (SSTNet) with cross-slice enhancement for moving infrared dim-small target detection. In our scheme, a new cross-slice ConvLSTM node is designed to capture spatio-temporal motion features from both inner slice and inter-slices. Moreover, to improve infrared small target motion feature learning, we extend conventional loss function by adopting a new motion-coordination loss (MCL) term. On these, we propose a motion-coupling neck to assist feature extractor in facilitating the capturing and utilization of motion features from multiframes. To our best knowledge, our work is the first one to explore the cross-slice spatio-temporal motion modeling for infrared dim-small targets. Experiments verify that our SSTNet could refresh most state-of-the-art metrics on two public benchmarks (DAUB and IRDST). Our source codes are available at https://github.com/UESTC-nnLab/SSTNet.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 60 条
[1]   Small infrared target detection using absolute average difference weighted by cumulative directional derivatives [J].
Aghaziyarati, Saeid ;
Moradi, Saed ;
Talebi, Hasan .
INFRARED PHYSICS & TECHNOLOGY, 2019, 101 :78-87
[2]  
[Anonymous], 1952, The Principle of Relativity
[3]   A Local Contrast Method for Small Infrared Target Detection [J].
Chen, C. L. Philip ;
Li, Hong ;
Wei, Yantao ;
Xia, Tian ;
Tang, Yuan Yan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (01) :574-581
[4]  
Chen S., 2023, P IEEE INT C AC SPEE, P1
[5]   Improving semantic segmentation with knowledge reasoning network? [J].
Chen, Shengjia ;
Yang, Xiwei ;
Li, Zhixin .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 96
[6]   Improving Object Detection with Relation Mining Network [J].
Chen, Shengjia ;
Li, Zhixin ;
Huang, Feicheng ;
Zhang, Canlong ;
Ma, Huifang .
20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, :52-61
[7]   Relation R-CNN: A Graph Based Relation-Aware Network for Object Detection [J].
Chen, Shengjia ;
Li, Zhixin ;
Tang, Zhenjun .
IEEE SIGNAL PROCESSING LETTERS, 2020, 27 :1680-1684
[8]   Memory Enhanced Global-Local Aggregation for Video Object Detection [J].
Chen, Yihong ;
Cao, Yue ;
Hu, Han ;
Wang, Liwei .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10334-10343
[9]   Attentional Local Contrast Networks for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (11) :9813-9824
[10]   Asymmetric Contextual Modulation for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :949-958