Continual Predictive Learning from Videos

被引：3

作者：

Chen, Geng ^{[1
]}

Zhang, Wendong ^{[1
]}

Lu, Han ^{[1
]}

Gao, Siyu ^{[1
]}

Wang, Yunbo ^{[1
]}

Long, Mingsheng ^{[2
]}

Yang, Xiaokang ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China

[2] Tsinghua Univ, Sch Software, BNRist, Beijing, Peoples R China

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2022年

关键词：

D O I：

10.1109/CVPR52688.2022.01046

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Predictive learning ideally builds the world model of physical processes in one or more given environments. Typical setups assume that we can collect data from all environments at all times. In practice, however, different prediction tasks may arrive sequentially so that the environments may change persistently throughout the training procedure. Can we develop predictive learning algorithms that can deal with more realistic, non-stationary physical environments? In this paper, we study a new continual learning problem in the context of video prediction, and observe that most existing methods suffer from severe catastrophic forgetting in this setup. To tackle this problem, we propose the continual predictive learning (CPI.) approach, which learns a mixture world model via predictive experience replay and performs test-time adaptation with non parametric task inference. We construct two new benchmarks based on RoboNet and KTH, in which different tasks correspond to different physical robotic environments or human actions. Our approach is shown to effectively mitigate forgetting and remarkably outperform the naive combinations of previous art in video prediction and continual learning.

引用

页码：10718 / 10727

页数：10

共 50 条

[1]

[Anonymous], 2016, NEURIPS

[2]

[Anonymous], 2017, Neurips

[3]

[Anonymous], 2018, Stochastic adversarial video prediction

[4]

[Anonymous], 2017, NEURIPS

[5]

[Anonymous], 2013, INT C MACH LEARN

[6]

[Anonymous], 2017, ICML

[7]

Ayub A., 2021, ICLR

[8]

Azizzadenesheli K., 2019, ARXIV190309734

[9]

Babaeizadeh M., 2018, P INT C LEARN REPR

[10] Improved Conditional VRNNs for Video Prediction [J].

Castrejon, Lluis ;

Ballas, Nicolas ;

Courville, Aaron .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7607-7616

← 1 2 3 4 5 →