Spatially adaptive multi-scale contextual attention for image inpainting

被引：0

作者：

Xueting Wang

Yiyan Chen

Toshihiko Yamasaki

机构：

[1] The University of Tokyo,Department of Information Communication and Engineering

[2] AI Laboratory,undefined

[3] CyberAgent,undefined

[4] Inc.,undefined

来源：

Multimedia Tools and Applications | 2022年 / 81卷

关键词：

Image inpainting; Spatially adaptive; Contextual attention; Multi-scale attention;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Image inpainting is the task to fill missing regions of an image. Recently, researchers have achieved a great performance by using convolutional neural networks (CNNs) with the conventional patch-matching method. Existing methods compute the attention scores, which are based on the similarity of patches between the known and missing regions. Considering that patches at different spatial positions can convey different levels of detail, we propose a spatially adaptive multi-scale attention score that uses the patches of different scales to compute scores for each pixel at different positions. Through experiments on the Paris Street View and Places datasets, our proposal shows slight improvement compared with some related methods on the quantitative evaluation metrics commonly used in the existing methods. Moreover, we found that these quantitative metrics are not appropriate enough considering the subjective impressions of the generated images. Therefore, we conducted subjective evaluation through user study for comparison, which shows that our proposal has superiority of performance generating much more detailed and subjectively plausible images.

引用

页码：31831 / 31846

页数：15

共 33 条

[1]

Ballester C(2001)Filling-in by joint interpolation of vector fields and gray levels IEEE Trans Image Process 10 1200-1211

[2]

Bertalmio M(2020)Salient object detection in the distributed cloud-edge intelligent network IEEE Netw 34 216-224

[3]

Caselles V(2014)Generative adversarial networks arXiv:http://arxiv.org/abs/1406.2661 4 6-es

[4]

Sapiro G(2007)Scene completion using millions of photographs ACM Trans Graph (TOG) 26 4-968

[5]

Verdera J(2020)Robust detection of image operator chain with two-stream convolutional neural network IEEE J Sel Top Sig Process 14 955-1464

[6]

Gao Z(2017)Places: a 10 million image database for scene recognition IEEE Trans Pattern Anal Mach Intell 40 1452-undefined

[7]

Zhang H(undefined)undefined undefined undefined undefined-undefined

[8]

Dong S(undefined)undefined undefined undefined undefined-undefined

[9]

Sun S(undefined)undefined undefined undefined undefined-undefined

[10]

Wang X(undefined)undefined undefined undefined undefined-undefined

← 1 2 3 4 →