Deep Video Inpainting

被引:145
作者
Kim, Dahun [1 ]
Woo, Sanghyun [1 ]
Lee, Joon-Young [2 ]
Kweon, In So [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
[2] Adobe Res, San Jose, CA USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/CVPR.2019.00594
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video inpainting aims to fill spatio-temporal holes with plausible content in a video. Despite tremendous progress of deep neural networks for image inpainting, it is challenging to extend these methods to the video domain due to the additional time dimension. In this work, we propose a novel deep network architecture for fast video inpainting. Built upon an image-based encoder-decoder model, our framework is designed to collect and refine information from neighbor frames and synthesize still-unknown regions. At the same time, the output is enforced to be temporally consistent by a recurrent feedback and a temporal memory module. Compared with the state-of-the-art image inpainting algorithm, our method produces videos that are much more semantically correct and temporally smooth. In contrast to the prior video completion method which relies on time-consuming optimization, our method runs in near real-time while generating competitive video results. Finally, we applied our framework to video retargeting task, and obtain visually pleasing results.
引用
收藏
页码:5785 / 5794
页数:10
相关论文
共 33 条
  • [1] Filling-in by joint interpolation of vector fields and gray levels
    Ballester, C
    Bertalmio, M
    Caselles, V
    Sapiro, G
    Verdera, J
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2001, 10 (08) : 1200 - 1211
  • [2] PatchMatch: A Randomized Correspondence Algorithm for Structural Image Editing
    Barnes, Connelly
    Shechtman, Eli
    Finkelstein, Adam
    Goldman, Dan B.
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03):
  • [3] Image inpainting
    Bertalmio, M
    Sapiro, G
    Caselles, V
    Ballester, C
    [J]. SIGGRAPH 2000 CONFERENCE PROCEEDINGS, 2000, : 417 - 424
  • [4] A Naturalistic Open Source Movie for Optical Flow Evaluation
    Butler, Daniel J.
    Wulff, Jonas
    Stanley, Garrett B.
    Black, Michael J.
    [J]. COMPUTER VISION - ECCV 2012, PT VI, 2012, 7577 : 611 - 625
  • [5] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
    Carreira, Joao
    Zisserman, Andrew
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
  • [6] Coherent Online Video Style Transfer
    Chen, Dongdong
    Liao, Jing
    Yuan, Lu
    Yu, Nenghai
    Hua, Gang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1114 - 1123
  • [7] Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting
    Cho, Donghyeon
    Park, Jinsun
    Oh, Tae-Hyun
    Tai, Yu-Wing
    Kweon, In So
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4568 - 4577
  • [8] Efros A. A., 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision, P1033, DOI 10.1109/ICCV.1999.790383
  • [9] How Not to Be Seen - Object Removal from Videos of Crowded Scenes
    Granados, M.
    Tompkin, J.
    Kim, K.
    Grau, O.
    Kautz, J.
    Theobalt, C.
    [J]. COMPUTER GRAPHICS FORUM, 2012, 31 (02) : 219 - 228
  • [10] Granados M, 2012, LECT NOTES COMPUT SC, V7572, P682, DOI 10.1007/978-3-642-33718-5_49