ArrowGAN : Learning to generate videos by learning Arrow of Time

被引:1
作者
Hong, Kibeom [1 ]
Uh, Youngjung [2 ]
Byun, Hyeran [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul, South Korea
[2] NAVER, Clova AI Res, Seongnam, South Korea
基金
新加坡国家研究基金会;
关键词
Video generation; Generative model; Generative Adversarial Networks; Arrow of Time; Self-supervised learning;
D O I
10.1016/j.neucom.2021.01.043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training GANs on videos is even more sophisticated than on images because videos have a distinguished dimension: time. While recent methods designed a dedicated architecture considering time, generated videos are still far from indistinguishable from real videos. In this paper, we introduce ArrowGAN framework, where the discriminators learns to classify arrow of time as an auxiliary task and the generators tries to synthesize forward-running videos. We argue that the auxiliary task should be carefully chosen regarding the target domain. In addition, we explore categorical ArrowGAN with recent techniques in conditional image generation upon ArrowGAN framework, achieving the state-of-the-art performance on categorical video generation. Our extensive experiments validate the effectiveness of arrow of time as a self-supervisory task, and demonstrate that all our components of categorical ArrowGAN lead to the improvement regarding video inception score and Frechet video distance on three datasets: Weizmann, UCFsports, and UCF-101. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:223 / 234
页数:12
相关论文
共 50 条
  • [1] Non-Adversarial Video Synthesis with Learned Priors
    Aich, Abhishek
    Gupta, Akash
    Panda, Rameswar
    Hyder, Rakib
    Asif, M. Salman
    Roy-Chowdhury, Amit K.
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6089 - 6098
  • [2] [Anonymous], A learned representation for artistic style
  • [3] [Anonymous], 2017, ICML
  • [4] [Anonymous], 2016, P NIPS
  • [5] [Anonymous], 2016, NIPS
  • [6] [Anonymous], 2014, ADV COMPUT VIS PATTE, DOI 10.1007/978-3-319-09396-3_9
  • [7] [Anonymous], 2018, P MACHINE LEARNING R
  • [8] [Anonymous], 2018, INT C LEARN REPR 201
  • [9] Babaeizadeh Mohammad, 2018, INT C LEARN REPR
  • [10] Blank M, 2005, IEEE I CONF COMP VIS, P1395