ABMNET: COUPLING TRANSFORMER WITH CNN BASED ON ADAMS-BASHFORTH-MOULTON METHOD FOR INFRARED SMALL TARGET DETECTION

被引:7
作者
Chen, Tianxiang [1 ,3 ]
Chu, Qi [1 ,3 ]
Tan, Zhentao [1 ,2 ,3 ]
Liu, Bin [1 ,3 ]
Yu, Nenghai [1 ,3 ]
机构
[1] Univ Sci & Technol China, Sch Cyber Sci & Technol, Beijing, Peoples R China
[2] Alibaba DAMO Acad, Beijing, Peoples R China
[3] Chinese Acad Sci, Key Lab Elect Space Informat, Beijing, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME | 2023年
基金
中国国家自然科学基金;
关键词
Infrared small target detection; Adams-Bashforth-Moulton method; Vision transformer;
D O I
10.1109/ICME55011.2023.00326
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Infrared small target detection (ISTD) aims at segmenting the small targets from infrared images, which has wide applications in military surveillance. Present methods are mainly based on CNN and focus on modelling locality while ignoring global dependencies, which are indispensable because the local areas similar to small targets always spread over most of the background, causing heavy target ambiguity. Recently, RKformer [1] has combined local features with global dependencies and further introduced Runge-Kutta method, a one-step Ordinary Differential Equation (ODE) solver, to ISTD and performed well. However, the method simply fuses features from original transformer and residual blocks by naive concatenation, causing insufficient feature interaction. Also, it inevitably brings effective information loss, which greatly impairs ambiguous target features. To address above problems and target ambiguity, we introduce Adams-Bashforth-Moulton method and propose ABMNet, which has (1) multistep memory and self-rectification mechanisms, guaranteeing more sufficient information usage and more accurate detection, (2) and achieves more sufficient interaction of both local and global information. Experiments on MDFA and IRSTD-1k demonstrate the superiority of our method.
引用
收藏
页码:1901 / 1906
页数:6
相关论文
共 20 条
[1]   IRSTFormer: A Hierarchical Vision Transformer for Infrared Small Target Detection [J].
Chen, Gao ;
Wang, Weihua ;
Tan, Sirui .
REMOTE SENSING, 2022, 14 (14)
[2]   Attentional Local Contrast Networks for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (11) :9813-9824
[3]   Asymmetric Contextual Modulation for Infrared Small Target Detection [J].
Dai, Yimian ;
Wu, Yiquan ;
Zhou, Fei ;
Barnard, Kobus .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :949-958
[4]   CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows [J].
Dong, Xiaoyi ;
Bao, Jianmin ;
Chen, Dongdong ;
Zhang, Weiming ;
Yu, Nenghai ;
Yuan, Lu ;
Chen, Dong ;
Guo, Baining .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12114-12124
[5]   A Local Contrast Method Combined With Adaptive Background Estimation for Infrared Small Target Detection [J].
Han, Jinhui ;
Liu, Sibang ;
Qin, Gang ;
Zhao, Qian ;
Zhang, Honghui ;
Li, Nana .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (09) :1442-1446
[6]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[7]   ODE-inspired Network Design for Single Image Super-Resolution [J].
He, Xiangyu ;
Mo, Zitao ;
Wang, Peisong ;
Liu, Yang ;
Yang, Mingyuan ;
Cheng, Jian .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1732-1741
[8]  
Li B, 2021, Arxiv, DOI arXiv:2104.02308
[9]   Dense Nested Attention Network for Infrared Small Target Detection [J].
Li, Boyang ;
Xiao, Chao ;
Wang, Longguang ;
Wang, Yingqian ;
Lin, Zaiping ;
Li, Miao ;
An, Wei ;
Guo, Yulan .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 :1745-1758
[10]  
Liu FC, 2021, Arxiv, DOI arXiv:2109.14379