Atrous Pyramid Transformer with Spectral Convolution for Image Inpainting

被引：9

作者：

Huang, Muqi ^{[1
,3
]}

Zhang, Lefei ^{[2
,4
]}

机构：

[1] Wuhan Univ, Wuhan, Hubei, Peoples R China

[2] Wuhan Univ, Hubei Luojia Lab, Wuhan, Hubei, Peoples R China

[3] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China

[4] Wuhan Univ, Sch Comp Sci, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China

来源：

PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年

基金：

中国国家自然科学基金;

关键词：

image inpainting; spectral transform; transformer; OBJECT REMOVAL;

D O I：

10.1145/3503161.3548348

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Owing to the ability of extracting features of images on long-range dependencies naturally, transformer is possible to reconstruct the damaged areas of images with the information from the uncorrupted regions globally. In this paper, we propose a two-stage framework based on a novel atrous pyramid transformer (APT) for image inpainting that recovers the structure and texture of an image progressively. Specifically, the patches of APT blocks are embedded in an atrous pyramid manner to explicitly enhance the correlation for both inter-and intra-windows to restore the high-level semantic structures of images more precisely, which could be served as a guide map for the second phase. Subsequently, a dual spectral transform convolution (DSTC) module is further designed to work together with APT to infer the low-level features of the generated areas. The DSTC module decouples the image signal into high frequency and low frequency for capturing texture information with a global view. Experiments on the CelebA-HQ, Paris StreetView, and Places2 demonstrate the superiority of the proposed approach. Code is available at: https://github.com/MuqiH/APT-with-DSTC.git.

引用

页码：4674 / 4683

页数：10

共 47 条

[1] Filling-in by joint interpolation of vector fields and gray levels [J].

Ballester, C ;

Bertalmio, M ;

Caselles, V ;

Sapiro, G ;

Verdera, J .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2001, 10 (08) :1200-1211

[2] PatchMatch: A Randomized Correspondence Algorithm for Structural Image Editing [J].

Barnes, Connelly ;

Shechtman, Eli ;

Finkelstein, Adam ;

Goldman, Dan B. .

ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03)

[3] Image inpainting [J].

Bertalmio, M ;

Sapiro, G ;

Caselles, V ;

Ballester, C .

SIGGRAPH 2000 CONFERENCE PROCEEDINGS, 2000, :417-424

[4] Learning a Sketch Tensor Space for Image Inpainting of Man-made Scenes [J].

Cao, Chenjie ;

Fu, Yanwei .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :14489-14498

[5] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[6]

Chi L., 2020, ADV NEUR IN, V33

[7] Region filling and object removal by exemplar-based image inpainting [J].

Criminisi, A ;

Pérez, P ;

Toyama, K .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (09) :1200-1212

[8]

Criminisi Antonio, 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, V2, pII

[9]

DARABI S, 2012, ACM T GRAPHIC, V31, DOI DOI 10.1145/2185520.2185578

[10] Learning Contextual Transformer Network for Image Inpainting [J].

Deng, Ye ;

Hui, Siqi ;

Zhou, Sanping ;

Meng, Deyu ;

Wang, Jinjun .

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :2529-2538

← 1 2 3 4 5 →