Sequential Instance Refinement for Cross-Domain Object Detection in Images

被引:13
作者
Chen, Jin [1 ,2 ]
Wu, Xinxiao [1 ,2 ]
Duan, Lixin [3 ]
Chen, Lin [4 ]
机构
[1] Beijing Inst Technol, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Comp Sci, Beijing 100081, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[4] Wyze Labs, Kirkland, WA 98034 USA
关键词
Object detection; Feature extraction; Detectors; Reinforcement learning; Proposals; Task analysis; Benchmark testing; Cross-domain object detection; negative transfer; reinforcement learning;
D O I
10.1109/TIP.2021.3066904
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-domain object detection in images has attracted increasing attention in the past few years, which aims at adapting the detection model learned from existing labeled images (source domain) to newly collected unlabeled ones (target domain). Existing methods usually deal with the cross-domain object detection problem through direct feature alignment between the source and target domains at the image level, the instance level (i.e., region proposals) or both. However, we have observed that directly aligning features of all object instances from the two domains often results in the problem of negative transfer, due to the existence of (1) outlier target instances that contain confusing objects not belonging to any category of the source domain and thus are hard to be captured by detectors and (2) low-relevance source instances that are considerably statistically different from target instances although their contained objects are from the same category. With this in mind, we propose a reinforcement learning based method, coined as sequential instance refinement, where two agents are learned to progressively refine both source and target instances by taking sequential actions to remove both outlier target instances and low-relevance source instances step by step. Extensive experiments on several benchmark datasets demonstrate the superior performance of our method over existing state-of-the-art baselines for cross-domain object detection.
引用
收藏
页码:3970 / 3984
页数:15
相关论文
共 79 条
[1]   Efficient and Fast Objects Detection Technique for Intelligent Video Surveillance Using Transfer Learning and Fine-Tuning [J].
Ahmadi, Mahmoud ;
Ouarda, Wael ;
Alimi, Adel M. .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2020, 45 (03) :1421-1433
[2]  
Alvarenga e Silva L. F., 2020, P AN WORKSH VIS COMP, P357
[3]  
[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4
[4]  
[Anonymous], 2017, P INT C MACH LEARN
[5]  
[Anonymous], 2016, P COMPUTER VISION EC
[6]   Exploring Object Relation in Mean Teacher for Cross-Domain Detection [J].
Cai, Qi ;
Pan, Yingwei ;
Ngo, Chong-Wah ;
Tian, Xinmei ;
Duan, Lingyu ;
Yao, Ting .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11449-11458
[7]   Active Object Localization with Deep Reinforcement Learning [J].
Caicedo, Juan C. ;
Lazebnik, Svetlana .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2488-2496
[8]   AutoDIAL: Automatic DomaIn Alignment Layers [J].
Carlucci, Fabio Maria ;
Porzi, Lorenzo ;
Caputo, Barbara ;
Ricci, Elisa ;
Bulo, Samuel Rota .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5077-5085
[9]   Learning Aligned Cross-Modal Representations from Weakly Aligned Data [J].
Castrejon, Lluis ;
Aytar, Yusuf ;
Vondrick, Carl ;
Pirsiavash, Hamed ;
Torralba, Antonio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2940-2949
[10]   Improved Techniques for Adversarial Discriminative Domain Adaptation [J].
Chadha, Aaron ;
Andreopoulos, Yiannis .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :2622-2637