Sequential Instance Refinement for Cross-Domain Object Detection in Images

被引:13
作者
Chen, Jin [1 ,2 ]
Wu, Xinxiao [1 ,2 ]
Duan, Lixin [3 ]
Chen, Lin [4 ]
机构
[1] Beijing Inst Technol, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Comp Sci, Beijing 100081, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[4] Wyze Labs, Kirkland, WA 98034 USA
关键词
Object detection; Feature extraction; Detectors; Reinforcement learning; Proposals; Task analysis; Benchmark testing; Cross-domain object detection; negative transfer; reinforcement learning;
D O I
10.1109/TIP.2021.3066904
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-domain object detection in images has attracted increasing attention in the past few years, which aims at adapting the detection model learned from existing labeled images (source domain) to newly collected unlabeled ones (target domain). Existing methods usually deal with the cross-domain object detection problem through direct feature alignment between the source and target domains at the image level, the instance level (i.e., region proposals) or both. However, we have observed that directly aligning features of all object instances from the two domains often results in the problem of negative transfer, due to the existence of (1) outlier target instances that contain confusing objects not belonging to any category of the source domain and thus are hard to be captured by detectors and (2) low-relevance source instances that are considerably statistically different from target instances although their contained objects are from the same category. With this in mind, we propose a reinforcement learning based method, coined as sequential instance refinement, where two agents are learned to progressively refine both source and target instances by taking sequential actions to remove both outlier target instances and low-relevance source instances step by step. Extensive experiments on several benchmark datasets demonstrate the superior performance of our method over existing state-of-the-art baselines for cross-domain object detection.
引用
收藏
页码:3970 / 3984
页数:15
相关论文
共 79 条
[11]   Domain Adversarial Reinforcement Learning for Partial Domain Adaptation [J].
Chen, Jin ;
Wu, Xinxiao ;
Duan, Lixin ;
Gao, Shenghua .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (02) :539-553
[12]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[13]   Hyperparameter Optimization for Tracking with Continuous Deep Q-Learning [J].
Dong, Xingping ;
Shen, Jianbing ;
Wang, Wenguan ;
Liu, Yu ;
Shao, Ling ;
Porikli, Fatih .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :518-527
[14]   CenterNet: Keypoint Triplets for Object Detection [J].
Duan, Kaiwen ;
Bai, Song ;
Xie, Lingxi ;
Qi, Honggang ;
Huang, Qingming ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577
[15]   REPRESENTATION AND MATCHING OF PICTORIAL STRUCTURES [J].
FISCHLER, MA ;
ELSCHLAGER, RA .
IEEE TRANSACTIONS ON COMPUTERS, 1973, C 22 (01) :67-92
[16]  
Ganin Y, 2016, J MACH LEARN RES, V17
[17]  
Ganin Y, 2015, PR MACH LEARN RES, V37, P1180
[18]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[19]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[20]  
Gong BQ, 2012, PROC CVPR IEEE, P2066, DOI 10.1109/CVPR.2012.6247911