Multi-domain Information Fusion for Key-Points Guided GAN Inversion

被引:0
|
作者
Xu, Ruize [1 ]
Qiu, Xiaowen [2 ]
He, Boan [2 ]
Ge, Weifeng [2 ]
Zhang, Wenqiang [1 ,2 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
关键词
GAN Inversion; Image Editing; Facial Key-points;
D O I
10.1007/978-981-99-8552-4_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, GAN inversion has emerged as a powerful technique for bridging the gap between real and fake image domains, and it has become increasingly important for enabling pre-trained GAN models for real image editing applications. However, current GAN inversion methods are limited by network parameters and model structures, and there is still room for improvement in accurate reconstruction and latent editing tasks. In this paper, we propose a two-stage model that fine-tunes a pre-trained Masked Autoencoder in the first stage and utilizes multi-layers information fusion to obtain an initial global latent code. We then use this latent code as global queries for the subsequent cross-attention-based fusion of local key patch, key point feature, and residual image information in the second stage, guided by facial landmarks. This allows our model to better embed images in the W+ space and perform related attribute editing, achieving better results than current state-of-the-art methods. We conduct extensive experiments to demonstrate the capabilities of our model, as well as the roles of relevant modules, and study the effects of different domain information on inversion.
引用
收藏
页码:146 / 157
页数:12
相关论文
共 50 条
  • [41] Space emitter fine feature identification based on multi-domain fusion
    Wang Xiaohan
    Yan Yi
    Fan Yanan
    Li Xue
    Mou Jiao
    CHINESE SPACE SCIENCE AND TECHNOLOGY, 2023, 43 (04) : 126 - 136
  • [42] Multi-Domain Feature Fusion for Emotion Classification Using DEAP Dataset
    Khateeb, Muhammad
    Anwar, Syed Muhammad
    Alnowami, Majdi
    IEEE ACCESS, 2021, 9 : 12134 - 12142
  • [43] Multi-level Stress Assessment Using Multi-domain Fusion of ECG Signal
    Ahmad, Zeeshan
    Khan, Naimul Mefraz
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 4518 - 4521
  • [44] Multi-domain clustering pruning: Exploring space and frequency similarity based on GAN
    Zhang, Junsan
    Feng, Yeqi
    Wang, Chao
    Shao, Mingwen
    Jiang, Yujie
    Wang, Jian
    NEUROCOMPUTING, 2023, 542
  • [45] Multi-Source Multi-Domain Data Fusion for Cyberattack Detection in Power Systems
    Sahu, Abhijeet
    Mao, Zeyu
    Wlazlo, Patrick
    Huang, Hao
    Davis, Katherine
    Goulart, Ana
    Zonouz, Saman
    IEEE ACCESS, 2021, 9 : 119118 - 119138
  • [46] MDVA-GAN: multi-domain visual attribution generative adversarial networks
    Muhammad Nawaz
    Feras Al-Obeidat
    Abdallah Tubaishat
    Tehseen Zia
    Fahad Maqbool
    Alvaro Rocha
    Neural Computing and Applications, 2023, 35 : 8035 - 8050
  • [47] MDVA-GAN: multi-domain visual attribution generative adversarial networks
    Nawaz, Muhammad
    Al-Obeidat, Feras
    Tubaishat, Abdallah
    Zia, Tehseen
    Maqbool, Fahad
    Rocha, Alvaro
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (11): : 8035 - 8050
  • [48] Unsupervised Multi-Domain Progressive Stain Transfer Guided by Style Encoding Dictionary
    Guan, Xianchao
    Wang, Yifeng
    Lin, Yiyang
    Li, Xi
    Zhang, Yongbing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 767 - 779
  • [49] Modal-Guided Multi-Domain Inconsistency Learning for Face Forgery Detection
    Guo, Zishuo
    Zhang, Baopeng
    Fan, Jack
    Teng, Zhu
    Fan, Jianping
    APPLIED SCIENCES-BASEL, 2025, 15 (01):
  • [50] Multi-Domain Information Exposure using ALTO: The Good, the Bad and the Solution
    Lachos, Danny
    Rothenberg, Christian
    Xiang, Qiao
    Yang, Y. Richard
    Ohlman, Borje
    Randriamasy, Sabine
    Contreras, Luis M.
    Gao, Kai
    PROCEEDINGS OF THE 2020 APPLIED NETWORKING RESEARCH WORKSHOP, ANRW 2020, 2020, : 52 - 54