Image Inpainting by End-to-End Cascaded Refinement With Mask Awareness

被引:89
作者
Zhu, Manyu [1 ,2 ]
He, Dongliang [1 ]
Li, Xin [1 ]
Li, Chao [1 ]
Li, Fu [1 ]
Liu, Xiao [1 ,3 ]
Ding, Errui [1 ]
Zhang, Zhaoxiang [4 ]
机构
[1] Baidu Inc, Dept Comp Vis VIS Technol, Beijing 100085, Peoples R China
[2] ByteDance Inc, Beijing 100089, Peoples R China
[3] TAL Educ Grp, Beijing 100080, Peoples R China
[4] Chinese Acad Sci CASIA, Inst Automat, Beijing 100190, Peoples R China
关键词
Convolution; Decoding; Kernel; Feature extraction; Shape; Image reconstruction; Task analysis; Image inpainting; mask awareness; dynamic filtering; cascaded refinement;
D O I
10.1109/TIP.2021.3076310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inpainting arbitrary missing regions is challenging because learning valid features for various masked regions is nontrivial. Though U-shaped encoder-decoder frameworks have been witnessed to be successful, most of them share a common drawback of mask unawareness in feature extraction because all convolution windows (or regions), including those with various shapes of missing pixels, are treated equally and filtered with fixed learned kernels. To this end, we propose our novel mask-aware inpainting solution. Firstly, a Mask-Aware Dynamic Filtering (MADF) module is designed to effectively learn multi-scale features for missing regions in the encoding phase. Specifically, filters for each convolution window are generated from features of the corresponding region of the mask. The second fold of mask awareness is achieved by adopting Point-wise Normalization (PN) in our decoding phase, considering that statistical natures of features at masked points differentiate from those of unmasked points. The proposed PN can tackle this issue by dynamically assigning point-wise scaling factor and bias. Lastly, our model is designed to be an end-to-end cascaded refinement one. Supervision information such as reconstruction loss, perceptual loss and total variation loss is incrementally leveraged to boost the inpainting results from coarse to fine. Effectiveness of the proposed framework is validated both quantitatively and qualitatively via extensive experiments on three public datasets including Places2, CelebA and Paris StreetView.
引用
收藏
页码:4855 / 4866
页数:12
相关论文
共 55 条
[1]  
[Anonymous], 2015, arXiv
[2]   PatchMatch: A Randomized Correspondence Algorithm for Structural Image Editing [J].
Barnes, Connelly ;
Shechtman, Eli ;
Finkelstein, Adam ;
Goldman, Dan B. .
ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03)
[3]   Image inpainting [J].
Bertalmio, M ;
Sapiro, G ;
Caselles, V ;
Ballester, C .
SIGGRAPH 2000 CONFERENCE PROCEEDINGS, 2000, :417-424
[4]   Strong-continuation, contrast-invariant inpainting with a third-order optimal PDE [J].
Bertalmio, Marcelo .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (07) :1934-1938
[5]   Exemplar-Based Inpainting: Technical Review and New Heuristics for Better Geometric Reconstructions [J].
Buyssens, Pierre ;
Daisy, Maxime ;
Tschumperle, David ;
Lezoray, Olivier .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (06) :1809-1824
[6]   Geometrically Guided Exemplar-Based Inpainting [J].
Cao, Frederic ;
Gousseau, Yann ;
Masnou, Simon ;
Perez, Patrick .
SIAM JOURNAL ON IMAGING SCIENCES, 2011, 4 (04) :1143-1179
[7]   Region filling and object removal by exemplar-based image inpainting [J].
Criminisi, A ;
Pérez, P ;
Toyama, K .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (09) :1200-1212
[8]  
Deng S., 2020, P IEEE CVF C COMP VI, P14560
[9]   Image Inpainting Using Nonlocal Texture Matching and Nonlinear Filtering [J].
Ding, Ding ;
Ram, Sundaresh ;
Rodriguez, Jeffrey J. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) :1705-1719
[10]   What Makes Paris Look Like Paris? [J].
Doersch, Carl ;
Singh, Saurabh ;
Gupta, Abhinav ;
Sivic, Josef ;
Efros, Alexei A. .
COMMUNICATIONS OF THE ACM, 2015, 58 (12) :103-110