CAFIN: cross-attention based face image repair network

被引：0

作者：

Li, Yaqian ^{[1
]}

Li, Kairan ^{[1
]}

Li, Haibin ^{[1
]}

Zhang, Wenming ^{[1
]}

机构：

[1] Yanshan Univ, Key Lab Ind Comp Control Engn Hebei Prov, Qinhuangdao 066004, Peoples R China

来源：

MULTIMEDIA SYSTEMS | 2024年 / 30卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Image restoration; Generative adversarial networks; Cross-attention; Feature fusion;

D O I：

10.1007/s00530-024-01466-x

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To address issues such as instability during the training of Generative Adversarial Networks, insufficient clarity in facial structure restoration, inadequate utilization of known information, and lack of attention to color information in images, a Cross-Attention Restoration Network is proposed. Initially, in the decoding part of the basic first-stage U-Net network, a combination of sub-pixel convolution and upsampling modules is employed to remedy the low-quality image restoration issue associated with single upsampling in the image recovery process. Subsequently, the restoration part of the first-stage network and the un-restored images are used to compute cross-attention in both spatial and channel dimensions, recovering the complete facial restoration image from the known repaired information. At the same time, we propose a loss function based on HSV space, assigning appropriate weights within the function to significantly improve the color aspects of the image. Compared to classical methods, this model exhibits good performance in terms of peak signal-to-noise ratio, structural similarity, and FID.

引用

页数：10

共 27 条

[1] Aitken A, 2017, Arxiv, DOI arXiv:1707.02937
[2] Image inpainting
Bertalmio, M
Sapiro, G
Caselles, V
Ballester, C
[J]. SIGGRAPH 2000 CONFERENCE PROCEEDINGS, 2000, : 417 - 424
[3] Nontexture inpainting by curvature-driven diffusions
Chan, TF
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2001, 12 (04) : 436 - 449
[4] Dai T., 2019, 2019 IEEE INT C IM P, P3796
[5] Guo Q., 2021, P 29 ACM INT C MULT
[6] Image Inpainting via Conditional Texture and Structure Dual Generation
Guo, Xiefan
Yang, Hongyu
Huang, Di
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14114 - 14123
[7] Reducing the dimensionality of data with neural networks
Hinton, G. E.
Salakhutdinov, R. R.
[J]. SCIENCE, 2006, 313 (5786) : 504 - 507
[8] Densely Connected Convolutional Networks
Huang, Gao
Liu, Zhuang
van der Maaten, Laurens
Weinberger, Kilian Q.
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
[9] Globally and Locally Consistent Image Completion
Iizuka, Satoshi
Simo-Serra, Edgar
Ishikawa, Hiroshi
[J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):
[10] Kingma D. P., 2014, Adam: A method for stochastic optimization, DOI 10.48550/arXiv.1412.6980

← 1 2 3 →