Z-ORDER RECURRENT NEURAL NETWORKS FOR VIDEO PREDICTION

被引：17

作者：

Zhang, Jianjin

Wang, Yunbo

Long, Mingsheng ^{[1
]}

Wang, Jianmin

Yu, Philip S.

机构：

[1] Tsinghua Univ, Sch Software, BNRist, Beijing, Peoples R China

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2019年

基金：

国家重点研发计划;

关键词：

Video Prediction; RNNs; GANs;

D O I：

10.1109/ICME.2019.00048

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

We present a Z-Order RNN (Znet) for predicting future video frames given historical observations. There are two main contributions respectively in deterministic and stochastic modeling perspective. First, we propose a new RNN architecture for modeling the deterministic dynamics, which updates hidden states along a z-order curve to enhance the consistency of the features of mirrored layers. Second, we introduce an adversarial training approach to two-stream Znet for modeling the stochastic variations, which forces the Znet-Predictor to imitate the behavior of the Znet-Probe. This two-stream architecture enables the adversarial training to be conducted in the feature space instead of the image space. Our model achieves the state-of-the-art prediction accuracy on two video datasets.

引用

页码：230 / 235

页数：6