Robust Object Detection via Instance-Level Temporal Cycle Confusion

被引：14

作者：

Wang, Xin ^{[1
]}

Huang, Thomas E. ^{[2
]}

Liu, Benlin ^{[3
]}

Yu, Fisher ^{[2
]}

Wang, Xiaolong ^{[4
]}

Gonzalez, Joseph E. ^{[5
]}

Darrell, Trevor ^{[5
]}

机构：

[1] Microsoft Res, Redmond, WA 98052 USA

[2] Swiss Fed Inst Technol, Zurich, Switzerland

[3] Univ Washington, Seattle, WA 98195 USA

[4] Univ Calif San Diego, San Diego, CA USA

[5] Univ Calif Berkeley, Berkeley, CA USA

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

关键词：

D O I：

10.1109/ICCV48922.2021.00901

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Building reliable object detectors that are robust to domain shifts, such as various changes in context, viewpoint, and object appearances, is critical for real-world applications. In this work, we study the effectiveness of auxiliary self-supervised tasks to improve the out-of-distribution generalization of object detectors. Inspired by the principle of maximum entropy, we introduce a novel self-supervised task, instance-level temporal cycle confusion (CycConf), which operates on the region features of the object detectors. For each object, the task is to find the most different object proposals in the adjacent frame in a video and then cycle back to itself for self-supervision. CycConf encourages the object detector to explore invariant structures across instances under various motions, which leads to improved model robustness in unseen domains at test time. We observe consistent out-of-domain performance improvements when training object detectors in tandem with self-supervised tasks on various domain adaptation benchmarks with static images (Cityscapes, Foggy Cityscapes, Sim10K) and large-scale video datasets (BDD100K and Waymo open data)(1).

引用

页码：9123 / 9132

页数：10

共 52 条

[1] [Anonymous], PHYS REV
[2] [Anonymous], 2007, P IEEE C COMP VIS PA
[3] Exploring Object Relation in Mean Teacher for Cross-Domain Detection
Cai, Qi
Pan, Yingwei
Ngo, Chong-Wah
Tian, Xinmei
Duan, Lingyu
Yao, Ting
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11449 - 11458
[4] Carion N., 2020, ARXIV200512872
[5] Domain Adaptive Faster R-CNN for Object Detection in the Wild
Chen, Yuhua
Li, Wen
Sakaridis, Christos
Dai, Dengxin
Van Gool, Luc
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3339 - 3348
[6] Cheng-Chun Hsu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12354), P733, DOI 10.1007/978-3-030-58545-7_42
[7] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[8] A Risk Assessment Method for Enterprise Cloud Accounting
Deng, Guohua
Xu, Chang
[J]. 2019 12TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2019), 2019, : 172 - 175
[9] Autonomous Driving in the Real World: Experiences with Tesla Autopilot and Summon
Dikmen, Murat
Burns, Catherine M.
[J]. 8TH INTERNATIONAL CONFERENCE ON AUTOMOTIVE USER INTERFACES AND INTERACTIVE VEHICULAR APPLICATIONS (AUTOMOTIVEUI 2016), 2016, : 225 - 228
[10] Unsupervised Visual Representation Learning by Context Prediction
Doersch, Carl
Gupta, Abhinav
Efros, Alexei A.
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1422 - 1430

← 1 2 3 4 5 6 →