SSTN: Self-Supervised Domain Adaptation Thermal Object Detection for Autonomous Driving

被引:27
作者
Munir, Farzeen [1 ]
Azam, Shoaib [1 ]
Jeon, Moongu [1 ]
机构
[1] Gwangju Inst Sci & Technol, Sch Elect Engn & Comp Sci, Gwangju, South Korea
来源
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2021年
关键词
Self-supervised learning; Contrastive learning; Thermal object detection;
D O I
10.1109/IROS51168.2021.9636353
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The perception of the environment plays a decisive role in the safe and secure operation of autonomous vehicles. The perception of the surrounding is way similar to human vision. The human's brain perceives the environment by utilizing different sensory channels and develop a view-invariant representation model. In this context, different exteroceptive sensors like cameras, Lidar, are deployed on the autonomous vehicle to perceive the environment. These sensors have illustrated their benefit in the visible spectrum domain yet in the adverse weather conditions; for instance, they have limited operational capability at night, leading to fatal accidents. This work explores thermal object detection to model a view-invariant model representation by employing the self-supervised contrastive learning approach. We have proposed a deep neural network Self Supervised Thermal Network (SSTN) for learning the feature embedding to maximize the information between visible and infrared spectrum domain by contrastive learning. Later, these learned feature representations are employed for thermal object detection using a multi-scale encoder-decoder transformer network. The proposed method is extensively evaluated on the two publicly available datasets: the FLIR-ADAS dataset and the KAIST Multi-Spectral dataset. The experimental results illustrate the efficacy of the proposed method.
引用
收藏
页码:206 / 213
页数:8
相关论文
共 47 条
[1]  
Agrawal K., 2019, ARXIV PREPRINT ARXIV
[2]  
[Anonymous], 2006, PROC IEEE COMPUT SOC, DOI 10.1109/CVPR.2006.100
[3]   Efficient Pedestrian Detection at Nighttime Using a Thermal Camera [J].
Baek, Jeonghyun ;
Hong, Sungjun ;
Kim, Jisu ;
Kim, Euntai .
SENSORS, 2017, 17 (08)
[4]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[5]  
Chen T, 2020, PR MACH LEARN RES, V119
[6]   Self-Supervised GANs via Auxiliary Rotation Loss [J].
Chen, Ting ;
Zhai, Xiaohua ;
Ritter, Marvin ;
Lucic, Mario ;
Houlsby, Neil .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12146-12155
[7]   How prediction errors shape perception, attention, and motivation [J].
den Ouden, Hanneke E. M. ;
Kok, Peter ;
de Lange, Floris P. .
FRONTIERS IN PSYCHOLOGY, 2012, 3
[8]   Borrow from Anywhere: Pseudo Multi-modal Object Detection in Thermal Imagery [J].
Devaguptapu, Chaitanya ;
Akolekar, Ninad ;
Sharma, Manuj M. ;
Balasubramanian, Vineeth N. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :1029-1038
[9]   Unsupervised Visual Representation Learning by Context Prediction [J].
Doersch, Carl ;
Gupta, Abhinav ;
Efros, Alexei A. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1422-1430
[10]   Pedestrian Detection in Thermal Images using Saliency Maps [J].
Ghose, Debasmita ;
Desai, Shasvat M. ;
Bhattacharya, Sneha ;
Chakraborty, Deep ;
Fiterau, Madalina ;
Rahman, Tauhidur .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :988-997