Multistage approach for automatic target detection and recognition in infrared imagery using deep learning

被引:2
作者
Baili, Nada [1 ]
Moalla, Mahdi [1 ]
Frigui, Hichem [1 ]
Karem, Andrew D. [1 ]
机构
[1] Univ Louisville, Comp Sci & Engn Dept, Louisville, KY 40292 USA
关键词
automatic target recognition; infrared imagery; defense systems information analysis center; area under the curve loss; you-only-look-once; convolutional neural networks; TRACKING;
D O I
10.1117/1.JRS.16.048505
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Automatic target recognition (ATR) is a challenging task for several computer vision applications. It requires efficient, accurate, and robust methods for target detection and target identification. Deep learning has shown great success in many computer vision applications involving color RGB images. However, the performance of these networks in ATR with infrared sensor data needs further investigation. In this paper, we propose a multistage automatic target detection and recognition (ATDR) system that performs both target detection and target classification on infrared (IR) imagery using deep learning. Our system processes large IR image frames where targets take < 1% of the total number of pixels. First, we train a state-of-the-art object detector you only look once (YOLO) to localize all potential targets in the input image frame. Then, we train a convolutional neural network (CNN) to identify these detections as targets or false alarms. In this second phase, we adapt and analyze the performance of three CNN architectures: a compact and fully connected CNN, VGG16 with batch normalization, and a wide residual neural network (WRN). We also explore the use of a loss function that optimizes directly the area under the receiver operating characteristic (ROC) curve (AUC), and adapt it to our ATR application. To enhance the robustness of the proposed ATR to perturbation and variations introduced during the detection stage, we train our CNN classifiers on automatically detected targets using YOLO, in addition to ground truth bounding boxes and apply selected data augmentation techniques. To simulate real testing environments, where the spatial location of the targets within the image frame is unknown, only YOLO-detected boxes are used during validation. We evaluate our ATDR on a real benchmark dataset that includes different vehicles captured at different resolutions. Our experiments have shown that YOLO can detect most of the targets at the expense of generating a high number of false alarms. We show that the VGG-16 network with batch normalization, which is the best performing model, can correctly identify the classes of the targets, as well as classify the majority of YOLO's false detections into an additional nontarget class. We also show that the proposed training modification to optimize an AUC-based loss function for ATR proved to be advantageous mainly in identifying difficult targets. (c) 2022 Society of Photo-Optical Instrumentation Engineers (SPIE)
引用
收藏
页数:18
相关论文
共 41 条
[1]  
AIA, 2018, INFRARED THERMAL IMA
[2]  
Akula A, 2017, PROC INT C COMPUT VI, V460
[3]  
[Anonymous], ATR Algorithm Development Image Database
[4]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[5]   Forward-looking infrared target recognition based on histograms of oriented gradients [J].
Cao, Zhiguo ;
Zhang, Xuan ;
Wang, Wenwu .
MIPPR 2011: AUTOMATIC TARGET RECOGNITION AND IMAGE ANALYSIS, 2011, 8003
[6]   Automated Detection and Recognition of Wildlife Using Thermal Cameras [J].
Christiansen, Peter ;
Steen, Kim Arild ;
Jorgensen, Rasmus Nyholm ;
Karstoft, Henrik .
SENSORS, 2014, 14 (08) :13778-13793
[7]   CNN-Based Target Recognition and Identification for Infrared Imaging in Defense Systems [J].
d'Acremont, Antoine ;
Fablet, Ronan ;
Baussard, Alexandre ;
Quin, Guillaume .
SENSORS, 2019, 19 (09)
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]  
Evgeniou T., 2001, Machine learning and its applications. Advanced lectures, P249
[10]   Automatic censoring CFAR detector based on ordered data variability for nonhomogeneous environments [J].
Farrouki, A ;
Barkat, M .
IEE PROCEEDINGS-RADAR SONAR AND NAVIGATION, 2005, 152 (01) :43-51