Back to the Source: Diffusion-Driven Adaptation to Test-Time Corruption

被引：34

作者：

Gao, Jin ^{[1
]}

Zhang, Jialing ^{[1
]}

Liu, Xihui ^{[3
]}

Darrell, Trevor ^{[4
]}

Shelhamer, Evan ^{[5
]}

Wang, Dequan ^{[1
,2
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[2] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China

[3] Univ Hong Kong, Hong Kong, Peoples R China

[4] Univ Calif Berkeley, Berkeley, CA USA

[5] DeepMind, London, England

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.01134

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Test-time adaptation harnesses test inputs to improve the accuracy of a model trained on source data when tested on shifted target data. Most methods update the source model by (re-)training on each target domain. While retraining can help, it is sensitive to the amount and order of the data and the hyperparameters for optimization. We update the target data instead, and project all test inputs toward the source domain with a generative diffusion model. Our diffusion-driven adaptation (DDA) method shares its models for classification and generation across all domains, training both on source then freezing them for all targets, to avoid expensive domain-wise re-training. We augment diffusion with image guidance and classifier self-ensembling to automatically decide how much to adapt. Input adaptation by DDA is more robust than model adaptation across a variety of corruptions, models, and data regimes on the ImageNet-C benchmark. With its input-wise updates, DDA succeeds where model adaptation degrades on too little data (small batches), on dependent data (correlated orders), or on mixed data (multiple corruptions).

引用

页码：11786 / 11796

页数：11

共 62 条

[41]

Rombach Robin, 2022, CVPR

[42]

Rusak E., 2020, ECCV

[43] ImageNet Large Scale Visual Recognition Challenge [J].

Russakovsky, Olga ;

Deng, Jia ;

Su, Hao ;

Krause, Jonathan ;

Satheesh, Sanjeev ;

Ma, Sean ;

Huang, Zhiheng ;

Karpathy, Andrej ;

Khosla, Aditya ;

Bernstein, Michael ;

Berg, Alexander C. ;

Fei-Fei, Li .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252

[44] Adapting Visual Category Models to New Domains [J].

Saenko, Kate ;

Kulis, Brian ;

Fritz, Mario ;

Darrell, Trevor .

COMPUTER VISION-ECCV 2010, PT IV, 2010, 6314 :213-+

[45]

Salimans Tim, 2021, ICLR

[46]

Schneider Steffen, 2020, ADV NEUR IN, V33

[47]

Sohl-Dickstein Jascha., 2015, ICML

[48]

Song Y., 2021, INT C LEARN REPR

[49]

Song Yang., 2019, NeurIPS

[50]

Song Yuhang, 2020, Adv Neural Inf Process Syst, V33, P22566

← 1 2 3 4 5 6 7 →