CDAda: A Curriculum Domain Adaptation for Nighttime Semantic Segmentation

被引:44
作者
Xu, Qi [1 ]
Ma, Yinan [1 ]
Wu, Jing [1 ]
Long, Chengnian [1 ]
Huang, Xiaolin [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai, Peoples R China
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021) | 2021年
基金
中国国家自然科学基金;
关键词
PEDESTRIAN DETECTION; TRACKING;
D O I
10.1109/ICCVW54120.2021.00331
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autonomous driving needs to ensure all-weather safety, especially in unfavorable environments such as night and rain. However, the current daytime-trained semantic segmentation networks face significant performance degradation at night because of the huge domain divergence. In this paper, we propose a novel Curriculum Domain Adaptation method (CDAda) to realize the smooth semantic knowledge transfer from daytime to nighttime. Specifically, it consists of two steps: 1) inter-domain style adaptation: fine-tune the daytime-trained model on the labeled synthetic nighttime images through the proposed frequency-based style transformation method (replace the low frequency components of daytime images with those of nighttime images); 2) intra-domain gradual self-training: separate the nighttime domain into the easy split nighttime domain and hard split nighttime domain based on the "entropy + illumination" ranking principle, then gradually adapt the model to the two sub-domains through pseudo supervision on easy split data and entropy minimization on hard split data. To the best of our knowledge, we first extend the idea of intra-domain adaptation to self-training and prove different treatments on two parts can reduce the distribution divergence in the nighttime domain itself. In particular, aimed at the adopted unlabeled day-night image pairs, the prediction of the daytime images can guide the segmentation on the nighttime images by ensuring patch-level consistency. Extensive experiments on Nighttime Driving, Dark Zurich, and BDD100K-night dataset highlight the effectiveness of our approach with the more favorable performance 50.9%, 45.0%, and 33.8% Mean IoU against existing state-of-the-art approaches.
引用
收藏
页码:2962 / 2971
页数:10
相关论文
共 46 条
[1]   Road Detection Based on Illuminant Invariance [J].
Alvarez, Jose M. ;
Lopez, Antonio M. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (01) :184-193
[2]  
Bengio Y., 2009, P 26 ANN INT C MACH, P41
[3]  
Bijelic M, 2018, IEEE INT VEH SYM, P1773
[4]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[5]   Night-time Pedestrian Detection by Visual-Infrared Video Fusion [J].
Chen, Yuxi ;
Han, Chongzhao .
2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, :5079-5084
[6]   Cars Can't Fly up in the Sky: Improving Urban-Scene Segmentation via Height-driven Attention Networks [J].
Choi, Sungha ;
Kim, Joanne T. ;
Choo, Jaegul .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9370-9380
[7]   KAIST Multi-Spectral Day/Night Data Set for Autonomous and Assisted Driving [J].
Choi, Yukyung ;
Kim, Namil ;
Hwang, Soonmin ;
Park, Kibaek ;
Yoon, Jae Shin ;
An, Kyounghwan ;
Kweon, In So .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (03) :934-948
[8]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[9]   Curriculum Model Adaptation with Synthetic and Real Data for Semantic Foggy Scene Understanding [J].
Dai, Dengxin ;
Sakaridis, Christos ;
Hecker, Simon ;
Van Gool, Luc .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (05) :1182-1204
[10]  
Dai DX, 2018, IEEE INT C INTELL TR, P3819, DOI 10.1109/ITSC.2018.8569387