DTNet: A Specialized Dual-Tuning Network for Infrared Vehicle Detection in Aerial Images

被引:10
作者
Zhang, Nan [1 ]
Liu, Youmeng [1 ]
Liu, Hao [1 ]
Tian, Tian [1 ]
Ma, Jiayi [2 ]
Tian, Jinwen [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Natl Key Lab Multispectral Informat IntelligentPro, Wuhan 430074, Peoples R China
[2] Wuhan Univ, Elect Informat Sch, Wuhan 430072, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
frequency decomposition; image degradation; infrared aerial image; vehicle detection; Feature extraction;
D O I
10.1109/TGRS.2024.3386309
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Vehicle detection in infrared aerial images is vital for both military and civilian applications, as infrared imaging remains effective under low-light conditions and various adverse weather scenarios. However, the longer wavelengths of long-wave infrared, compared to visible light, make diffraction more noticeable, leading to low-frequency degradation of vehicle information. Thermal radiation from the environment and optical system leads to higher noise in infrared images. Additionally, atmospheric transport models for various weather conditions can degrade infrared images to different extents. These factors lead to a reduced signal-to-noise ratio (SNR), which complicates the extraction of clear features. To overcome these challenges, we propose the dual-tuning network (DTNet), an advanced framework for vehicle detection in infrared aerial images, developed based on the mechanisms of infrared imaging. Specifically, the core component of DTNet is the dual-tuning block (DTBlock), which works alongside the dynamic guided filtering module (DGFM) and the point spread recovery module (PSRM) for feature extraction. DTBlock decomposes feature maps into low- and high-frequency components with learnable low-pass filters. DGFM eliminates disturbance from the optical system and background thermal radiation in the high-frequency component of feature maps, while preserving the details and texture of vehicles. PSRM aggregates vehicle features in the low-frequency component of feature maps, which is proposed with reference to diffraction and atmospheric models. The concept of dual-tuning refers to enhancing the signal and suppressing interference in the low- and high-frequency parts, respectively. Experimental results on the DroneVehicle public dataset for infrared vehicle detection indicate that our proposed approach achieves state-of-the-art (SOTA) performance. Moreover, extensive ablation studies confirm the superior capability of our DTNet in robust feature extraction from infrared images.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 89 条
[1]  
Chen DQ, 2018, PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), P662, DOI 10.1109/IAEAC.2018.8577211
[2]   Advanced Automatic Target Recognition (ATR) with Infrared (IR) Sensors [J].
Chen, Hai-Wen ;
Gross, Neal ;
Kapadia, Ravi ;
Cheah, Joseph ;
Gharbieh, Mo .
2021 IEEE AEROSPACE CONFERENCE (AEROCONF 2021), 2021,
[3]   Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks [J].
Chen, Jierun ;
Kao, Shiu-Hong ;
He, Hao ;
Zhuo, Weipeng ;
Wen, Song ;
Lee, Chul-Ho ;
Chan, S. -H. Gary .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :12021-12031
[4]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[5]   Instance Segmentation in the Dark [J].
Chen, Linwei ;
Fu, Ying ;
Wei, Kaixuan ;
Zheng, Dezhi ;
Heide, Felix .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (08) :2198-2218
[6]   Image denoising by sparse 3-D transform-domain collaborative filtering [J].
Dabov, Kostadin ;
Foi, Alessandro ;
Katkovnik, Vladimir ;
Egiazarian, Karen .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (08) :2080-2095
[7]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[8]   LASER DIFFRACTION SPECTROMETRY - FRAUNHOFER-DIFFRACTION VERSUS MIE SCATTERING [J].
DEBOER, GBJ ;
DEWEERD, C ;
THOENES, D ;
GOOSSENS, HWJ .
PARTICLE CHARACTERIZATION, 1987, 4 (01) :14-19
[9]  
Dhanaraj M., 2020, SPIE Defense Commercial Sensing, P22
[10]   Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].
Ding, Jian ;
Xue, Nan ;
Long, Yang ;
Xia, Gui-Song ;
Lu, Qikai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853