Representative Feature Alignment for Adaptive Object Detection

被引:16
作者
Xu, Shan [1 ]
Zhang, Huaidong [2 ]
Xu, Xuemiao [3 ,4 ,5 ]
Hu, Xiaowei [6 ]
Xu, Yangyang [1 ]
Dai, Liangui [7 ]
Choi, Kup-Sze [8 ]
Heng, Pheng-Ann [9 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[2] South China Univ Technol, Sch Future Technol, Guangzhou 510000, Peoples R China
[3] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510000, Peoples R China
[4] Minist Educ, State Key Lab Subtrop Bldg Sci, Key Lab Big Data & Intelligent Robot, Guangzhou 510000, Guangdong, Peoples R China
[5] Guangdong Prov Key Lab Computat Intelligence & Cyb, Guangzhou 510000, Guangdong, Peoples R China
[6] Shanghai AI Lab, Shanghai 200031, Peoples R China
[7] Guangdong Leatop Technol Investment Co Ltd, Guangzhou 518001, Peoples R China
[8] Hong Kong Polytech Univ, Ctr Smart Hlth, Hong Kong, Peoples R China
[9] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
关键词
Feature extraction; Detectors; Proposals; Adaptation models; Object detection; Semantics; Prototypes; Domain adaptation; deep learning; object detection; UNSUPERVISED DOMAIN ADAPTATION; NETWORKS;
D O I
10.1109/TCSVT.2022.3202094
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unsupervised domain adaptation for object detection aims to generalize the object detector trained on the label-rich source domain to the unlabeled target domain. Recently, existing works adopt the instance-level alignment or pixel-level alignment to perform domain transfer, which can effectively avoid the negative transfer due to the diverse background between domains. However, we find that they treat all the regions of an instance feature equally without suppressing background area. They do not segment the specific texture and discriminative regions of objects, which are transferable during adaptation. We call the features that combine the local structure feature and semantic discriminant features as representative features. We propose a novel Representative Feature Alignment (RFA) model to align the features extracted from representative patterns of objects, i.e. representative features, for domain adaptation. Specifically, the representative features are extracted by the Representative Feature Extraction (RFE) submodules. The RFE submodules take the features extracted from different intermediate layers of the detector as input, and filter out the representative features layer-by-layer via integrating class weighting generator, category selection and class activation mapping. Then the representative features from multi-layers are further adaptively aggregated to obtain the final representative features, which are utilized to conduct feature alignment in a class-aware manner. Our representative features are free of untransferable regions and background areas, which leads to better feature alignment. Extensive experimental results show that the proposed model outperforms state-of-the-art methods on a few benchmark datasets.
引用
收藏
页码:689 / 700
页数:12
相关论文
共 72 条
[1]   Compressed Domain Moving Object Detection Based on CRF [J].
Alizadeh, Mohammadsadegh ;
Sharifkhani, Mohammad .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (03) :674-684
[2]   Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks [J].
Bousmalis, Konstantinos ;
Silberman, Nathan ;
Dohan, David ;
Erhan, Dumitru ;
Krishnan, Dilip .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :95-104
[3]   Exploring Object Relation in Mean Teacher for Cross-Domain Detection [J].
Cai, Qi ;
Pan, Yingwei ;
Ngo, Chong-Wah ;
Tian, Xinmei ;
Duan, Lingyu ;
Yao, Ting .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11449-11458
[4]   Dual Bipartite Graph Learning: A General Approach for Domain Adaptive Object Detection [J].
Chen, Chaoqi ;
Li, Jiongcheng ;
Zheng, Zebiao ;
Huang, Yue ;
Ding, Xinghao ;
Yu, Yizhou .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :2683-2692
[5]   Harmonizing Transferability and Discriminability for Adapting Object Detectors [J].
Chen, Chaoqi ;
Zheng, Zebiao ;
Ding, Xinghao ;
Huang, Yue ;
Dou, Qi .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8866-8875
[6]   Domain Adaptive Faster R-CNN for Object Detection in the Wild [J].
Chen, Yuhua ;
Li, Wen ;
Sakaridis, Christos ;
Dai, Dengxin ;
Van Gool, Luc .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3339-3348
[7]   SLV: Spatial Likelihood Voting forWeakly Supervised Object Detection [J].
Chen, Ze ;
Fu, Zhihang ;
Jiang, Rongxin ;
Chen, Yaowu ;
Hua, Xian-Sheng .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :12992-13001
[8]   Hierarchical Context Embedding for Region-Based Object Detection [J].
Chen, Zhao-Min ;
Jin, Xin ;
Zhao, Borui ;
Wei, Xiu-Shen ;
Guo, Yanwen .
COMPUTER VISION - ECCV 2020, PT XXI, 2020, 12366 :633-648
[9]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[10]   Unbiased Mean Teacher for Cross-domain Object Detection [J].
Deng, Jinhong ;
Li, Wen ;
Chen, Yuhua ;
Duan, Lixin .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4089-4099