CAFIN: cross-attention based face image repair network

被引:0
作者
Li, Yaqian [1 ]
Li, Kairan [1 ]
Li, Haibin [1 ]
Zhang, Wenming [1 ]
机构
[1] Yanshan Univ, Key Lab Ind Comp Control Engn Hebei Prov, Qinhuangdao 066004, Peoples R China
基金
中国国家自然科学基金;
关键词
Image restoration; Generative adversarial networks; Cross-attention; Feature fusion;
D O I
10.1007/s00530-024-01466-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To address issues such as instability during the training of Generative Adversarial Networks, insufficient clarity in facial structure restoration, inadequate utilization of known information, and lack of attention to color information in images, a Cross-Attention Restoration Network is proposed. Initially, in the decoding part of the basic first-stage U-Net network, a combination of sub-pixel convolution and upsampling modules is employed to remedy the low-quality image restoration issue associated with single upsampling in the image recovery process. Subsequently, the restoration part of the first-stage network and the un-restored images are used to compute cross-attention in both spatial and channel dimensions, recovering the complete facial restoration image from the known repaired information. At the same time, we propose a loss function based on HSV space, assigning appropriate weights within the function to significantly improve the color aspects of the image. Compared to classical methods, this model exhibits good performance in terms of peak signal-to-noise ratio, structural similarity, and FID.
引用
收藏
页数:10
相关论文
共 27 条
  • [1] Aitken A, 2017, Arxiv, DOI arXiv:1707.02937
  • [2] Image inpainting
    Bertalmio, M
    Sapiro, G
    Caselles, V
    Ballester, C
    [J]. SIGGRAPH 2000 CONFERENCE PROCEEDINGS, 2000, : 417 - 424
  • [3] Nontexture inpainting by curvature-driven diffusions
    Chan, TF
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2001, 12 (04) : 436 - 449
  • [4] Dai T., 2019, 2019 IEEE INT C IM P, P3796
  • [5] Guo Q., 2021, P 29 ACM INT C MULT
  • [6] Image Inpainting via Conditional Texture and Structure Dual Generation
    Guo, Xiefan
    Yang, Hongyu
    Huang, Di
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14114 - 14123
  • [7] Reducing the dimensionality of data with neural networks
    Hinton, G. E.
    Salakhutdinov, R. R.
    [J]. SCIENCE, 2006, 313 (5786) : 504 - 507
  • [8] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
  • [9] Globally and Locally Consistent Image Completion
    Iizuka, Satoshi
    Simo-Serra, Edgar
    Ishikawa, Hiroshi
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):
  • [10] Kingma D. P., 2014, Adam: A method for stochastic optimization, DOI 10.48550/arXiv.1412.6980