CDAda: A Curriculum Domain Adaptation for Nighttime Semantic Segmentation

被引：44

作者：

Xu, Qi ^{[1
]}

Ma, Yinan ^{[1
]}

Wu, Jing ^{[1
]}

Long, Chengnian ^{[1
]}

Huang, Xiaolin ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai, Peoples R China

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021) | 2021年

基金：

中国国家自然科学基金;

关键词：

PEDESTRIAN DETECTION; TRACKING;

D O I：

10.1109/ICCVW54120.2021.00331

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Autonomous driving needs to ensure all-weather safety, especially in unfavorable environments such as night and rain. However, the current daytime-trained semantic segmentation networks face significant performance degradation at night because of the huge domain divergence. In this paper, we propose a novel Curriculum Domain Adaptation method (CDAda) to realize the smooth semantic knowledge transfer from daytime to nighttime. Specifically, it consists of two steps: 1) inter-domain style adaptation: fine-tune the daytime-trained model on the labeled synthetic nighttime images through the proposed frequency-based style transformation method (replace the low frequency components of daytime images with those of nighttime images); 2) intra-domain gradual self-training: separate the nighttime domain into the easy split nighttime domain and hard split nighttime domain based on the "entropy + illumination" ranking principle, then gradually adapt the model to the two sub-domains through pseudo supervision on easy split data and entropy minimization on hard split data. To the best of our knowledge, we first extend the idea of intra-domain adaptation to self-training and prove different treatments on two parts can reduce the distribution divergence in the nighttime domain itself. In particular, aimed at the adopted unlabeled day-night image pairs, the prediction of the daytime images can guide the segmentation on the nighttime images by ensuring patch-level consistency. Extensive experiments on Nighttime Driving, Dark Zurich, and BDD100K-night dataset highlight the effectiveness of our approach with the more favorable performance 50.9%, 45.0%, and 33.8% Mean IoU against existing state-of-the-art approaches.

引用

页码：2962 / 2971

页数：10

共 46 条

[1] Road Detection Based on Illuminant Invariance [J].

Alvarez, Jose M. ;

Lopez, Antonio M. .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (01) :184-193

[2]

Bengio Y., 2009, P 26 ANN INT C MACH, P41

[3]

Bijelic M, 2018, IEEE INT VEH SYM, P1773

[4] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[5] Night-time Pedestrian Detection by Visual-Infrared Video Fusion [J].

Chen, Yuxi ;

Han, Chongzhao .

2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, :5079-5084

[6] Cars Can't Fly up in the Sky: Improving Urban-Scene Segmentation via Height-driven Attention Networks [J].

Choi, Sungha ;

Kim, Joanne T. ;

Choo, Jaegul .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9370-9380

[7] KAIST Multi-Spectral Day/Night Data Set for Autonomous and Assisted Driving [J].

Choi, Yukyung ;

Kim, Namil ;

Hwang, Soonmin ;

Park, Kibaek ;

Yoon, Jae Shin ;

An, Kyounghwan ;

Kweon, In So .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (03) :934-948

[8] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[9] Curriculum Model Adaptation with Synthetic and Real Data for Semantic Foggy Scene Understanding [J].

Dai, Dengxin ;

Sakaridis, Christos ;

Hecker, Simon ;

Van Gool, Luc .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (05) :1182-1204

[10]

Dai DX, 2018, IEEE INT C INTELL TR, P3819, DOI 10.1109/ITSC.2018.8569387

← 1 2 3 4 5 →