Relation Matters: Foreground-Aware Graph-Based Relational Reasoning for Domain Adaptive Object Detection

被引:26
作者
Chen, Chaoqi [1 ]
Li, Jiongcheng [2 ]
Zhou, Hong-Yu [1 ]
Han, Xiaoguang [3 ]
Huang, Yue [2 ]
Ding, Xinghao [2 ]
Yu, Yizhou [1 ]
机构
[1] Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Xiamen Univ, Dept Informat & Commun Engn, Xiamen 361005, Peoples R China
[3] Chinese Univ Hong Kong Shenzhen, Shenzhen Res Inst Big Data, Shenzhen 518172, Peoples R China
基金
中国国家自然科学基金;
关键词
Cognition; Feature extraction; Semantics; Object detection; Training; Task analysis; Knowledge transfer; Domain adaptive object detection; foreground-aware; relational reasoning; graph structure; intra- and inter-domain; ALIGNMENT; KERNEL;
D O I
10.1109/TPAMI.2022.3179445
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain Adaptive Object Detection (DAOD) focuses on improving the generalization ability of object detectors via knowledge transfer. Recent advances in DAOD strive to change the emphasis of the adaptation process from global to local in virtue of fine-grained feature alignment methods. However, both the global and local alignment approaches fail to capture the topological relations among different foreground objects as the explicit dependencies and interactions between and within domains are neglected. In this case, only seeking one-vs-one alignment does not necessarily ensure the precise knowledge transfer. Moreover, conventional alignment-based approaches may be vulnerable to catastrophic overfitting regarding those less transferable regions (e.g., backgrounds) due to the accumulation of inaccurate localization results in the target domain. To remedy these issues, we first formulate DAOD as an open-set domain adaptation problem, in which the foregrounds and backgrounds are seen as the "known classes" and "unknown class" respectively. Accordingly, we propose a new and general framework for DAOD, named Foreground-aware Graph-based Relational Reasoning (FGRR), which incorporates graph structures into the detection pipeline to explicitly model the intra- and inter-domain foreground object relations on both pixel and semantic spaces, thereby endowing the DAOD model with the capability of relational reasoning beyond the popular alignment-based paradigm. FGRR first identifies the foreground pixels and regions by searching reliable correspondence and cross-domain similarity regularization respectively. The inter-domain visual and semantic correlations are hierarchically modeled via bipartite graph structures, and the intra-domain relations are encoded via graph attention mechanisms. Through message-passing, each node aggregates semantic and contextual information from the same and opposite domain to substantially enhance its expressive power. Empirical results demonstrate that the proposed FGRR exceeds the state-of-the-art performance on four DAOD benchmarks.
引用
收藏
页码:3677 / 3694
页数:18
相关论文
共 103 条
[1]  
Baktashmotlagh M., 2019, 7 INT C LEARN REPR I, P1
[2]   A theory of learning from different domains [J].
Ben-David, Shai ;
Blitzer, John ;
Crammer, Koby ;
Kulesza, Alex ;
Pereira, Fernando ;
Vaughan, Jennifer Wortman .
MACHINE LEARNING, 2010, 79 (1-2) :151-175
[3]   Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks [J].
Bousmalis, Konstantinos ;
Silberman, Nathan ;
Dohan, David ;
Erhan, Dumitru ;
Krishnan, Dilip .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :95-104
[4]   Open Set Domain Adaptation [J].
Busto, Pau Panareda ;
Gall, Juergen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :754-763
[5]   Exploring Object Relation in Mean Teacher for Cross-Domain Detection [J].
Cai, Qi ;
Pan, Yingwei ;
Ngo, Chong-Wah ;
Tian, Xinmei ;
Duan, Lingyu ;
Yao, Ting .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11449-11458
[6]  
Chen C., 2021, P IEEE C COMPUTER VI, P12576
[7]   Dual Bipartite Graph Learning: A General Approach for Domain Adaptive Object Detection [J].
Chen, Chaoqi ;
Li, Jiongcheng ;
Zheng, Zebiao ;
Huang, Yue ;
Ding, Xinghao ;
Yu, Yizhou .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :2683-2692
[8]  
Chen CQ, 2020, PROC CVPR IEEE, P8866, DOI 10.1109/CVPR42600.2020.00889
[9]   Progressive Feature Alignment for Unsupervised Domain Adaptation [J].
Chen, Chaoqi ;
Xie, Weiping ;
Huang, Wenbing ;
Rong, Yu ;
Ding, Xinghao ;
Huang, Yue ;
Xu, Tingyang ;
Huang, Junzhou .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :627-636
[10]  
Chen XY, 2019, PR MACH LEARN RES, V97