Deep Video Inpainting

被引：145

作者：

Kim, Dahun ^{[1
]}

Woo, Sanghyun ^{[1
]}

Lee, Joon-Young ^{[2
]}

Kweon, In So ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea

[2] Adobe Res, San Jose, CA USA

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.1109/CVPR.2019.00594

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video inpainting aims to fill spatio-temporal holes with plausible content in a video. Despite tremendous progress of deep neural networks for image inpainting, it is challenging to extend these methods to the video domain due to the additional time dimension. In this work, we propose a novel deep network architecture for fast video inpainting. Built upon an image-based encoder-decoder model, our framework is designed to collect and refine information from neighbor frames and synthesize still-unknown regions. At the same time, the output is enforced to be temporally consistent by a recurrent feedback and a temporal memory module. Compared with the state-of-the-art image inpainting algorithm, our method produces videos that are much more semantically correct and temporally smooth. In contrast to the prior video completion method which relies on time-consuming optimization, our method runs in near real-time while generating competitive video results. Finally, we applied our framework to video retargeting task, and obtain visually pleasing results.

引用

页码：5785 / 5794

页数：10

共 33 条

[1] Filling-in by joint interpolation of vector fields and gray levels
Ballester, C
Bertalmio, M
Caselles, V
Sapiro, G
Verdera, J
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2001, 10 (08) : 1200 - 1211
[2] PatchMatch: A Randomized Correspondence Algorithm for Structural Image Editing
Barnes, Connelly
Shechtman, Eli
Finkelstein, Adam
Goldman, Dan B.
[J]. ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03):
[3] Image inpainting
Bertalmio, M
Sapiro, G
Caselles, V
Ballester, C
[J]. SIGGRAPH 2000 CONFERENCE PROCEEDINGS, 2000, : 417 - 424
[4] A Naturalistic Open Source Movie for Optical Flow Evaluation
Butler, Daniel J.
Wulff, Jonas
Stanley, Garrett B.
Black, Michael J.
[J]. COMPUTER VISION - ECCV 2012, PT VI, 2012, 7577 : 611 - 625
[5] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Carreira, Joao
Zisserman, Andrew
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
[6] Coherent Online Video Style Transfer
Chen, Dongdong
Liao, Jing
Yuan, Lu
Yu, Nenghai
Hua, Gang
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1114 - 1123
[7] Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting
Cho, Donghyeon
Park, Jinsun
Oh, Tae-Hyun
Tai, Yu-Wing
Kweon, In So
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4568 - 4577
[8] Efros A. A., 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision, P1033, DOI 10.1109/ICCV.1999.790383
[9] How Not to Be Seen - Object Removal from Videos of Crowded Scenes
Granados, M.
Tompkin, J.
Kim, K.
Grau, O.
Kautz, J.
Theobalt, C.
[J]. COMPUTER GRAPHICS FORUM, 2012, 31 (02) : 219 - 228
[10] Granados M, 2012, LECT NOTES COMPUT SC, V7572, P682, DOI 10.1007/978-3-642-33718-5_49

← 1 2 3 4 →