Image inpainting based on CNN-Transformer framework via structure and texture restoration

被引:1
作者
Li, Zhan [1 ]
Han, Nan [1 ]
Wang, Yuning [1 ]
Zhang, Yanan [1 ]
Yan, Jing [2 ]
Du, Yingfei [1 ]
Geng, Guohua [1 ]
机构
[1] Northwest Univ, Sch Informat Sci & Technol, Xian 710127, Peoples R China
[2] Shaanxi Prov Inst Archaeol, Xian 710054, Peoples R China
关键词
Image inpainting; Convolutional Neural Network; Transformer; Structure and texture; Ancient murals; OBJECT REMOVAL;
D O I
10.1016/j.asoc.2024.112671
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Both coherent structure and fine texture are of great importance in image inpainting. However, existing methods fail to consider both structure and texture simultaneously, resulting in discontinuous edges, blurred details and artifacts of inpainting results. In addition, Convolutional Neural Network (CNN) suffers from its limited modeling capability for long-range dependencies. To address these issues, we propose a novel method called Image inpainting based on CNN-Transformer framework via structure and texture restoration (CTSTNet). In CTSTNet, the Structure Reconstruction Network (SRN) restores the overall structure of the damaged image by leveraging its ability to model long dependencies. Meanwhile, the Texture Reconstruction Network (TRN) restores the grayscale map and edge map to generate the texture of the damaged image. Finally, Image Completion Network (ICN) fuses the structure and texture using a Context Aggregation Module (CAM) to generate the inpainting result. Experimental results on two public datasets CelebA, Paris StreetView, and our constructed mural dataset MuralCN, demonstrate that CTSTNet achieves significant improvements in evaluation metrics and visual quality compared to other related methods. This indicates that our CTSTNet can effectively restore images and can also be applied in the field of cultural relic image restoration.
引用
收藏
页数:9
相关论文
共 46 条
[1]   Image inpainting [J].
Bertalmio, M ;
Sapiro, G ;
Caselles, V ;
Ballester, C .
SIGGRAPH 2000 CONFERENCE PROCEEDINGS, 2000, :417-424
[2]   Nontexture inpainting by curvature-driven diffusions [J].
Chan, TF .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2001, 12 (04) :436-449
[3]   Mathematical models for local nontexture inpaintings [J].
Chan, TF ;
Shen, JH .
SIAM JOURNAL ON APPLIED MATHEMATICS, 2002, 62 (03) :1019-1043
[4]   Image inpainting algorithm based on inference attention module and two-stage network [J].
Chen, Yuantao ;
Xia, Runlong ;
Yang, Kai ;
Zou, Ke .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
[5]   DNNAM: Image inpainting algorithm via deep neural networks and attention mechanism [J].
Chen, Yuantao ;
Xia, Runlong ;
Yang, Kai ;
Zou, Ke .
APPLIED SOFT COMPUTING, 2024, 154
[6]   MICU: Image super-resolution via multi-level information compensation and U-net [J].
Chen, Yuantao ;
Xia, Runlong ;
Yang, Kai ;
Zou, Ke .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
[7]   MFMAM: Image inpainting via multi-scale feature module with attention module [J].
Chen, Yuantao ;
Xia, Runlong ;
Yang, Kai ;
Zou, Ke .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 238
[8]  
Cheng W. H., 2005, P INT C COMP GRAPH I, P64
[9]   Region filling and object removal by exemplar-based image inpainting [J].
Criminisi, A ;
Pérez, P ;
Toyama, K .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (09) :1200-1212
[10]   What Makes Paris Look like Paris? [J].
Doersch, Carl ;
Singh, Saurabh ;
Gupta, Abhinav ;
Sivic, Josef ;
Efros, Alexei A. .
ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04)