Multi-domain Information Fusion for Key-Points Guided GAN Inversion

被引:0
|
作者
Xu, Ruize [1 ]
Qiu, Xiaowen [2 ]
He, Boan [2 ]
Ge, Weifeng [2 ]
Zhang, Wenqiang [1 ,2 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
关键词
GAN Inversion; Image Editing; Facial Key-points;
D O I
10.1007/978-981-99-8552-4_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, GAN inversion has emerged as a powerful technique for bridging the gap between real and fake image domains, and it has become increasingly important for enabling pre-trained GAN models for real image editing applications. However, current GAN inversion methods are limited by network parameters and model structures, and there is still room for improvement in accurate reconstruction and latent editing tasks. In this paper, we propose a two-stage model that fine-tunes a pre-trained Masked Autoencoder in the first stage and utilizes multi-layers information fusion to obtain an initial global latent code. We then use this latent code as global queries for the subsequent cross-attention-based fusion of local key patch, key point feature, and residual image information in the second stage, guided by facial landmarks. This allows our model to better embed images in the W+ space and perform related attribute editing, achieving better results than current state-of-the-art methods. We conduct extensive experiments to demonstrate the capabilities of our model, as well as the roles of relevant modules, and study the effects of different domain information on inversion.
引用
收藏
页码:146 / 157
页数:12
相关论文
共 50 条
  • [31] Stalled information based routing in multi-domain multilayer networks
    Szigeti, J
    Tapolcai, J
    Cinkler, T
    Henk, T
    Sallai, G
    NETWORKS 2004 11TH INTERNATIONAL TELECOMMUNICATIONS NETWORK STRATEGY AND PLANNING SYMPOSIUM, PROCEEDINGS, 2004, : 297 - 302
  • [32] TOWARDS SCALABLE INFORMATION-SEEKING MULTI-DOMAIN DIALOGUE
    Papangelis, Alexandros
    Kotti, Margarita
    Stylianou, Yannis
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6064 - 6068
  • [33] Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
    Gomez, Raul
    Liu, Yahui
    De Nadai, Marco
    Karatzas, Dimosthenis
    Lepri, Bruno
    Sebe, Nicu
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3164 - 3172
  • [34] Exploiting Multi-domain Visual Information for Fake News Detection
    Qi, Peng
    Cao, Juan
    Yang, Tianyun
    Guo, Junbo
    Li, Jintao
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 518 - 527
  • [35] Memory-Guided Multi-View Multi-Domain Fake News Detection
    Zhu, Yongchun
    Sheng, Qiang
    Cao, Juan
    Nan, Qiong
    Shu, Kai
    Wu, Minghui
    Wang, Jindong
    Zhuang, Fuzhen
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 7178 - 7191
  • [36] Multi-domain Public Key Infrastructure for Vehicle-to-Grid Network
    Vaidya, Binod
    Makrakis, Dimitrios
    Mouftah, Hussein T.
    2015 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2015), 2015, : 1572 - 1577
  • [37] Security-proved authenticated key agreement protocol for multi-domain
    Zhu, Hui
    Li, Hui
    Wang, Yumin
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2009, 37 (05): : 53 - 56
  • [38] Electricity theft detection method based on multi-domain feature fusion
    Zhao, Hong-shan
    Sun, Cheng-yan
    Ma, Li-bo
    Xue, Yang
    Guo, Xiao-mei
    Chang, Jie-ying
    IET SCIENCE MEASUREMENT & TECHNOLOGY, 2023, 17 (03) : 93 - 104
  • [39] Design, engineering and preparation of a multi-domain fusion vector for gene delivery
    Sadeghian, Faranak
    Hosseinkhani, Saman
    Alizadeh, Abdolali
    Hatefi, Arash
    INTERNATIONAL JOURNAL OF PHARMACEUTICS, 2012, 427 (02) : 393 - 399
  • [40] Multi-Domain Feature Fusion for Emotion Classification Using DEAP Dataset
    Khateeb, Muhammad
    Anwar, Syed Muhammad
    Alnowami, Majdi
    IEEE Access, 2021, 9 : 12134 - 12142