CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal Images

被引:45
作者
Feng, Zhen [1 ,2 ]
Guo, Yanning [2 ]
Sun, Yuxiang [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Mech Engn, Hung Hom, Kowloon, Hong Kong, Peoples R China
[2] Harbin Inst Technol, Dept Control Sci & Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Image edge detection; Decoding; Semantic segmentation; Feature extraction; Semantics; Sun; Lighting; Knowledge distillation; semantic segmentation; privileged information; autonomous driving; thermal images; SEGMENTATION;
D O I
10.1109/LRA.2023.3247175
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Semantic scene understanding using thermal images has received great attention due to the advantage that thermal imaging cameras could see in challenging illumination conditions. However, thermal images are lack of color information and the edges in thermal images are often blurred, making them not very suitable to be directly used by existing semantic segmentation networks that are designed with RGB images. To address this problem, we propose a cross-modal edge-privileged knowledge distillation framework, which utilizes a well-trained RGB-Thermal fusion-based semantic segmentation network with edge-privileged information as the teacher, to guide the training of a semantic segmentation network as the student. The student network only uses thermal images. The experimental results on a public dataset demonstrate that under the guidance of the teacher, the student network achieves superior performance over the state of the arts using only thermal images.
引用
收藏
页码:2205 / 2212
页数:8
相关论文
共 30 条
[1]   Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges [J].
Feng, Di ;
Haase-Schutz, Christian ;
Rosenbaum, Lars ;
Hertlein, Heinz ;
Glaser, Claudius ;
Timm, Fabian ;
Wiesbeck, Werner ;
Dietmayer, Klaus .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (03) :1341-1360
[2]  
Ha Q, 2017, IEEE INT C INT ROBOT, P5108, DOI 10.1109/IROS.2017.8206396
[3]   A Brief Survey on Semantic Segmentation with Deep Learning [J].
Hao, Shijie ;
Zhou, Yuan ;
Guo, Yanrong .
NEUROCOMPUTING, 2020, 406 :302-321
[4]   FuseNet: Incorporating Depth into Semantic Segmentation via Fusion-Based CNN Architecture [J].
Hazirbas, Caner ;
Ma, Lingni ;
Domokos, Csaba ;
Cremers, Daniel .
COMPUTER VISION - ACCV 2016, PT I, 2017, 10111 :213-228
[5]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[6]  
Hu Y, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P782
[7]  
Ji M, 2021, AAAI CONF ARTIF INTE, V35, P7945
[8]   MS-UDA: Multi-Spectral Unsupervised Domain Adaptation for Thermal Image Semantic Segmentation [J].
Kim, Yeong-Hyeon ;
Shin, Ukcheol ;
Park, Jinsun ;
Kweon, In So .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) :6497-6504
[9]   Virtual Multi-view Fusion for 3D Semantic Segmentation [J].
Kundu, Abhijit ;
Yin, Xiaoqi ;
Fathi, Alireza ;
Ross, David ;
Brewington, Brian ;
Funkhouser, Thomas ;
Pantofaru, Caroline .
COMPUTER VISION - ECCV 2020, PT XXIV, 2020, 12369 :518-535
[10]   Cross-Modal Knowledge Distillation for Depth Privileged Monocular Visual Odometry [J].
Li, Bin ;
Wang, Shuling ;
Ye, Haifeng ;
Gong, Xiaojin ;
Xiang, Zhiyu .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) :6171-6178