Adapting Object Detectors via Selective Cross-Domain Alignment

被引:186
作者
Zhu, Xinge [1 ]
Pang, Jiangmiao [2 ]
Yang, Ceyuan [1 ]
Shi, Jianping [3 ]
Lin, Dahua [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Zhejiang Univ, Hangzhou, Peoples R China
[3] SenseTime Res, Hong Kong, Peoples R China
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art object detectors are usually trained on public datasets. They often face substantial difficulties when applied to a different domain, where the imaging condition differs significantly and the corresponding annotated data are unavailable (or expensive to acquire). A natural remedy is to adapt the model by aligning the image representations on both domains. This can be achieved, for example, by adversarial learning, and has been shown to be effective in tasks like image classification. However, we found that in object detection, the improvement obtained in this way is quite limited. An important reason is that conventional domain adaptation methods strive to align images as a whole, while object detection, by nature, focuses on local regions that may contain objects of interest. Motivated by this, we propose a novel approach to domain adaption for object detection to handle the issues in "where to look" and "how to align". Our key idea is to mine the discriminative regions, namely those that are directly pertinent to object detection, and focus on aligning them across both domains. Experiments show that the proposed method performs remarkably better than existing methods with about 4% similar to 6% improvement under various domain-shift scenarios while keeping good scalability.
引用
收藏
页码:687 / 696
页数:10
相关论文
共 49 条
[41]  
Tao Y, 2017, CHIN CONTR CONF, P4288, DOI 10.23919/ChiCC.2017.8028032
[42]   Learning to Adapt Structured Output Space for Semantic Segmentation [J].
Tsai, Yi-Hsuan ;
Hung, Wei-Chih ;
Schulter, Samuel ;
Sohn, Kihyuk ;
Yang, Ming-Hsuan ;
Chandraker, Manmohan .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7472-7481
[43]   Adversarial Discriminative Domain Adaptation [J].
Tzeng, Eric ;
Hoffman, Judy ;
Saenko, Kate ;
Darrell, Trevor .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2962-2971
[44]  
Ulyanov Dmitry, 2016, ARXIV
[45]   Pose Guided Human Video Generation [J].
Yang, Ceyuan ;
Wang, Zhe ;
Zhu, Xinge ;
Huang, Chen ;
Shi, Jianping ;
Lin, Dahua .
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :204-219
[46]  
Zhang LH, 2015, INT CONF SOFTW ENG, P931, DOI 10.1109/ICSESS.2015.7339207
[47]   Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes [J].
Zhang, Yang ;
David, Philip ;
Gong, Boqing .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2039-2049
[48]   Penalizing Top Performers: Conservative Loss for Semantic Segmentation Adaptation [J].
Zhu, Xinge ;
Zhou, Hui ;
Yang, Ceyuan ;
Shi, Jianping ;
Lin, Dahua .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :587-603
[49]   Generative Adversarial Frontal View to Bird View Synthesis [J].
Zhu, Xinge ;
Yin, Zhichao ;
Shi, Jianping ;
Li, Hongsheng ;
Lin, Dahua .
2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, :454-463