Adversarial Domain Adaptation With Prototype-Based Normalized Output Conditioner

被引:34
作者
Hu, Dapeng [1 ]
Liang, Jian [2 ]
Hou, Qibin [3 ]
Yan, Hanshu [1 ]
Chen, Yunpeng [4 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 119077, Singapore
[2] Chinese Acad Sci CASIA, Ctr Res Intelligent Percept & Comp CRIPAC, Natl Lab Pattern Recognit NLPR, Beijing 100190, Peoples R China
[3] Nankai Univ, Dept Comp Sci, Jinnan Campus, Tianjin 300350, Peoples R China
[4] Meitu Inc, Beijing 100083, Peoples R China
关键词
Training; Task analysis; Semantics; Sensitivity; Object recognition; Predictive models; Prototypes; Domain adaptation; adversarial learning; prototype; semantic structures; pseudo-labels;
D O I
10.1109/TIP.2021.3124674
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain adversarial training has become a prevailing and effective paradigm for unsupervised domain adaptation (UDA). To successfully align the multi-modal data structures across domains, the following works exploit discriminative information in the adversarial training process, e.g., using multiple class-wise discriminators and involving conditional information in the input or output of the domain discriminator. However, these methods either require non-trivial model designs or are inefficient for UDA tasks. In this work, we attempt to address this dilemma by devising simple and compact conditional domain adversarial training methods. We first revisit the simple concatenation conditioning strategy where features are concatenated with output predictions as the input of the discriminator. We find the concatenation strategy suffers from the weak conditioning strength. We further demonstrate that enlarging the norm of concatenated predictions can effectively energize the conditional domain alignment. Thus we improve concatenation conditioning by normalizing the output predictions to have the same norm of features, and term the derived method as Normalized OutpUt coNditioner (NOUN). However, conditioning on raw output predictions for domain alignment, NOUN suffers from inaccurate predictions of the target domain. To this end, we propose to condition the cross-domain feature alignment in the prototype space rather than in the output space. Combining the novel prototype-based conditioning with NOUN, we term the enhanced method as PROtotype-based Normalized OutpUt coNditioner (PRONOUN). Experiments on both object recognition and semantic segmentation show that NOUN can effectively align the multi-modal structures across domains and even outperform state-of-the-art domain adversarial training methods. Together with prototype-based conditioning, PRONOUN further improves the adaptation performance over NOUN on multiple object recognition benchmarks for UDA. Code is available at https://github.com/tim-learn/NOUN.
引用
收藏
页码:9359 / 9371
页数:13
相关论文
共 70 条
[1]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[2]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[3]  
Chen XY, 2019, PR MACH LEARN RES, V97
[4]   No More Discrimination: Cross City Adaptation of Road Scene Segmenters [J].
Chen, Yi-Hsin ;
Chen, Wei-Yu ;
Chen, Yu-Ting ;
Tsai, Bo-Cheng ;
Wang, Yu-Chiang Frank ;
Sun, Min .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2011-2020
[5]  
Choi Jaehoon, 2019, BMVC, P2
[6]   Unsupervised Domain Adaptation via Regularized Conditional Alignment [J].
Cicek, Safa ;
Soatto, Stefano .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1416-1425
[7]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[8]   Cluster Alignment with a Teacher for Unsupervised Domain Adaptation [J].
Deng, Zhijie ;
Luo, Yucen ;
Zhu, Jun .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9943-9952
[9]  
Dudik M., 2006, ADV NEURAL INFORM PR, V18, P323
[10]   Unsupervised Visual Domain Adaptation Using Subspace Alignment [J].
Fernando, Basura ;
Habrard, Amaury ;
Sebban, Marc ;
Tuytelaars, Tinne .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :2960-2967