Z-ORDER RECURRENT NEURAL NETWORKS FOR VIDEO PREDICTION

被引:17
作者
Zhang, Jianjin
Wang, Yunbo
Long, Mingsheng [1 ]
Wang, Jianmin
Yu, Philip S.
机构
[1] Tsinghua Univ, Sch Software, BNRist, Beijing, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2019年
基金
国家重点研发计划;
关键词
Video Prediction; RNNs; GANs;
D O I
10.1109/ICME.2019.00048
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a Z-Order RNN (Znet) for predicting future video frames given historical observations. There are two main contributions respectively in deterministic and stochastic modeling perspective. First, we propose a new RNN architecture for modeling the deterministic dynamics, which updates hidden states along a z-order curve to enhance the consistency of the features of mirrored layers. Second, we introduce an adversarial training approach to two-stream Znet for modeling the stochastic variations, which forces the Znet-Predictor to imitate the behavior of the Znet-Probe. This two-stream architecture enables the adversarial training to be conducted in the feature space instead of the image space. Our model achieves the state-of-the-art prediction accuracy on two video datasets.
引用
收藏
页码:230 / 235
页数:6
相关论文
共 27 条
[1]  
[Anonymous], 2017, ICLR
[2]  
[Anonymous], 2014, ABS14091259 CORR
[3]  
[Anonymous], 2015, NIPS 15 P 28 INT C N
[4]  
[Anonymous], 2016, ICML
[5]  
[Anonymous], 2016, P 4 INT C LEARN REPR
[6]  
[Anonymous], 2016, NIPS
[7]  
[Anonymous], 2016, NIPS
[8]  
[Anonymous], 2015, ICML
[9]  
[Anonymous], 2018, ECCV
[10]  
[Anonymous], 2018, ICML