Research Progress of Image Inpainting Methods Based on Deep Learning

被引:0
作者
Chen, Wenxiang [1 ,2 ]
Tian, Qichuan [1 ,2 ]
Lian, Lu [1 ,2 ]
Zhang, Xiaohang [1 ,2 ]
Wang, Haoji [1 ,2 ]
机构
[1] College of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing
[2] Beijing Key Laboratory of Intelligent Processing for Building Big Data, Beijing
关键词
computer vision; deep learning; image inpainting;
D O I
10.3778/j.issn.1002-8331.2406-0100
中图分类号
学科分类号
摘要
Image inpainting is the process of recovering and repairing damaged or missing parts of an image through algorithms or techniques, which is a significant research focus in the field of computer vision. This paper reviews the development trajectory of deep learning-based image inpainting methods in recent years, and categorizes them into single-modal and multi-modal methods. The single-modal image inpainting methods are divided into convolutional autoencoder-based methods, GAN-based methods, Transformer-based methods and diffusion model-based methods. Meanwhile, the multimodal image inpainting methods include text-guided methods, audio-guided methods, video-guided methods and multimodal fusion-based methods. Furthermore, this paper provides a comparative analysis of the principles, advantages and disadvantages of various methods. It also introduces commonly used datasets and evaluation metrics, assesses the performance of representative methods on standard datasets, and discusses current challenges and future directions in this domain. © 2024 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:58 / 73
页数:15
相关论文
共 90 条
[1]  
LUO H Y, ZHENG Y H., Survey of research on image inpainting methods, Journal of Frontiers of Computer Science and Technology, 16, 10, pp. 2193-2218, (2022)
[2]  
WAN Z, ZHANG B, CHEN D, Et al., Old photo restoration via deep latent space translation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, 2, pp. 2071-2087, (2022)
[3]  
LIU J X, CHEN R, AN S P., Reference prior and generative prior linked distorted old photos restoration, Journal of Image and Graphics, 27, 5, pp. 1657-1668, (2022)
[4]  
WU Z, XUAN H, SUN C, Et al., Semi-supervised video inpainting with cycle consistency constraints, Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 22586-22595, (2023)
[5]  
WU J, LI X, SI C, Et al., Towards language-driven video inpainting via multimodal large language models, Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12501-12511, (2024)
[6]  
WEI T T, KUO C, TSENG Y C, Et al., MPVF: 4D medical image inpainting by multi-pyramid voxel flows, IEEE Journal of Biomedical and Health Informatics, 27, 12, pp. 5872-5882, (2023)
[7]  
SHOBI V M, DHANASEELAN F R., Voxel representation of brain images inpainting via regional pixel semantic network and pyramidal attention ae-quantile differential mechanism model, Computers in Biology and Medicine, 170, (2024)
[8]  
LYU J F, SHAO L Z, LEI X M., Image inpainting algorithm based on deep neural networks, Computer Engineering and Applications, 59, 20, pp. 1-12, (2023)
[9]  
LECUN Y, BOSER B, DENKER J S, Et al., Back-propagation applied to handwritten zip code recognition, Neural Computation, 1, 4, pp. 541-551, (1989)
[10]  
RUMELHART D E, HINTON G E, WILLIAMS R J., Learning internal representations by error propagation, (1985)