Uncertainty-Aware Unsupervised Domain Adaptation in Object Detection

被引:81
作者
Guan, Dayan [1 ]
Huang, Jiaxing [1 ]
Xiao, Aoran [1 ]
Lu, Shijian [1 ]
Cao, Yanpeng [2 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[2] Zhejiang Univ, Sch Mech Engn, Hangzhou 310027, Peoples R China
关键词
Proposals; Uncertainty; Object detection; Entropy; Feature extraction; Detectors; Convolutional neural networks; Unsupervised domain adaptation; object detection; adversarial learning; curriculum learning;
D O I
10.1109/TMM.2021.3082687
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unsupervised domain adaptive object detection aims to adapt detectors from a labelled source domain to an unlabelled target domain. Most existing works take a two-stage strategy that first generates region proposals and then detects objects of interest, where adversarial learning is widely adopted to mitigate the inter-domain discrepancy in both stages. However, adversarial learning may impair the alignment of well-aligned samples as it merely aligns the global distributions across domains. To address this issue, we design an uncertainty-aware domain adaptation network (UaDAN) that introduces conditional adversarial learning to align well-aligned and poorly-aligned samples separately in different manners. Specifically, we design an uncertainty metric that assesses the alignment of each sample and adjusts the strength of adversarial learning for well-aligned and poorly-aligned samples adaptively. In addition, we exploit the uncertainty metric to achieve curriculum learning that first performs easier image-level alignment and then more difficult instance-level alignment progressively. Extensive experiments over four challenging domain adaptive object detection datasets show that UaDAN achieves superior performance as compared with state-of-the-art methods.
引用
收藏
页码:2502 / 2514
页数:13
相关论文
共 81 条
[1]  
Alldrin N., 2008, PROC CVPR IEEE, P18
[2]  
[Anonymous], 2011, Advances in Neural Information Processing Systems
[3]  
[Anonymous], Bell System Technical Journal
[4]   KTransGAN: Variational Inference-Based Knowledge Transfer for Unsupervised Conditional Generative Learning [J].
Azzam, Mohamed ;
Wu, Wenhao ;
Cao, Wenming ;
Wu, Si ;
Wong, Hau-San .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :3318-3331
[5]   A theory of learning from different domains [J].
Ben-David, Shai ;
Blitzer, John ;
Crammer, Koby ;
Kulesza, Alex ;
Pereira, Fernando ;
Vaughan, Jennifer Wortman .
MACHINE LEARNING, 2010, 79 (1-2) :151-175
[6]  
Bengio Y., 2009, P 26 ANN INT C MACH, DOI DOI 10.1145/1553374.1553380
[7]   Exploring Object Relation in Mean Teacher for Cross-Domain Detection [J].
Cai, Qi ;
Pan, Yingwei ;
Ngo, Chong-Wah ;
Tian, Xinmei ;
Duan, Lingyu ;
Yao, Ting .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11449-11458
[8]   Cascade R-CNN: High Quality Object Detection and Instance Segmentation [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) :1483-1498
[9]  
Campbell D., 2017, IEEE I CONF COMP VIS
[10]   Transductive semi-supervised metric learning for person re-identification [J].
Chang, Xinyuan ;
Ma, Zhiheng ;
Wei, Xing ;
Hong, Xiaopeng ;
Gong, Yihong .
PATTERN RECOGNITION, 2020, 108