Robust Object Detection via Instance-Level Temporal Cycle Confusion

被引:14
作者
Wang, Xin [1 ]
Huang, Thomas E. [2 ]
Liu, Benlin [3 ]
Yu, Fisher [2 ]
Wang, Xiaolong [4 ]
Gonzalez, Joseph E. [5 ]
Darrell, Trevor [5 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
[2] Swiss Fed Inst Technol, Zurich, Switzerland
[3] Univ Washington, Seattle, WA 98195 USA
[4] Univ Calif San Diego, San Diego, CA USA
[5] Univ Calif Berkeley, Berkeley, CA USA
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
关键词
D O I
10.1109/ICCV48922.2021.00901
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Building reliable object detectors that are robust to domain shifts, such as various changes in context, viewpoint, and object appearances, is critical for real-world applications. In this work, we study the effectiveness of auxiliary self-supervised tasks to improve the out-of-distribution generalization of object detectors. Inspired by the principle of maximum entropy, we introduce a novel self-supervised task, instance-level temporal cycle confusion (CycConf), which operates on the region features of the object detectors. For each object, the task is to find the most different object proposals in the adjacent frame in a video and then cycle back to itself for self-supervision. CycConf encourages the object detector to explore invariant structures across instances under various motions, which leads to improved model robustness in unseen domains at test time. We observe consistent out-of-domain performance improvements when training object detectors in tandem with self-supervised tasks on various domain adaptation benchmarks with static images (Cityscapes, Foggy Cityscapes, Sim10K) and large-scale video datasets (BDD100K and Waymo open data)(1).
引用
收藏
页码:9123 / 9132
页数:10
相关论文
共 52 条
  • [11] Temporal Cycle-Consistency Learning
    Dwibedi, Debidatta
    Aytar, Yusuf
    Tompson, Jonathan
    Sermanet, Pierre
    Zisserman, Andrew
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1801 - 1810
  • [12] Eysenbach Benjamin, 2021, Maximum entropy rl (provably) solves some robust rl problems
  • [13] Geirhos R., 2019, INT C LEARN REPR
  • [14] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [15] Multi-adversarial Faster-RCNN for Unrestricted Object Detection
    He, Zhenwei
    Zhang, Lei
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6667 - 6676
  • [16] Hendrycks D., 2018, INT C LEARN REPR
  • [17] Hendrycks D, 2020, P INT C LEARN REPR I
  • [18] Hendrycks D, 2019, PR MACH LEARN RES, V97
  • [19] Hendrycks D, 2019, ADV NEUR IN, V32
  • [20] INFORMATION THEORY AND STATISTICAL MECHANICS
    JAYNES, ET
    [J]. PHYSICAL REVIEW, 1957, 106 (04): : 620 - 630