Multi-domain Information Fusion for Key-Points Guided GAN Inversion

被引:0
|
作者
Xu, Ruize [1 ]
Qiu, Xiaowen [2 ]
He, Boan [2 ]
Ge, Weifeng [2 ]
Zhang, Wenqiang [1 ,2 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
关键词
GAN Inversion; Image Editing; Facial Key-points;
D O I
10.1007/978-981-99-8552-4_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, GAN inversion has emerged as a powerful technique for bridging the gap between real and fake image domains, and it has become increasingly important for enabling pre-trained GAN models for real image editing applications. However, current GAN inversion methods are limited by network parameters and model structures, and there is still room for improvement in accurate reconstruction and latent editing tasks. In this paper, we propose a two-stage model that fine-tunes a pre-trained Masked Autoencoder in the first stage and utilizes multi-layers information fusion to obtain an initial global latent code. We then use this latent code as global queries for the subsequent cross-attention-based fusion of local key patch, key point feature, and residual image information in the second stage, guided by facial landmarks. This allows our model to better embed images in the W+ space and perform related attribute editing, achieving better results than current state-of-the-art methods. We conduct extensive experiments to demonstrate the capabilities of our model, as well as the roles of relevant modules, and study the effects of different domain information on inversion.
引用
收藏
页码:146 / 157
页数:12
相关论文
共 50 条
  • [1] Decentralized Policy Information Points for Multi-Domain Environments
    Rahman, M. Ridwanur
    Salehi, Ahmad S.
    Rudolph, Carsten
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1286 - 1293
  • [2] Multi-Domain Information Fusion for Insider Threat Detection
    Eldardiry, Hoda
    Bart, Evgeniy
    Liu, Juan
    Hanley, John
    Price, Bob
    Brdiczka, Oliver
    IEEE CS SECURITY AND PRIVACY WORKSHOPS (SPW 2013), 2013, : 45 - 51
  • [3] Isoelectric points of multi-domain proteins
    Carugo, Oliviero
    BIOINFORMATION, 2007, 2 (03) : 101 - 104
  • [4] Steganalysis of AMR Speech Stream Based on Multi-Domain Information Fusion
    Guo, Chuanpeng
    Yang, Wei
    Huang, Liusheng
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4077 - 4090
  • [5] Fault diagnosis of blade crack based on multi-domain feature and information fusion
    Ma, Tianchi
    Shen, Junxian
    Song, Di
    Xu, Feiyun
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2024, 54 (06): : 1567 - 1573
  • [6] Multi-domain Public Key Infrastructure for Information Security with Use of a Multi-Agent System
    Aye, Nilar
    Khin, Hlaing Su
    Win, Toe Toe
    KoKo, Tayzar
    Than, MoMo Zin
    Hattori, Fumio
    Kuwabara, Kazuhiro
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT I,, 2013, 7802 : 365 - 374
  • [7] A multi-classification algorithm based on multi-domain information fusion for motor imagery BCI
    Wang, Jiaqi
    Chen, Wanzhong
    Li, Mingyang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [8] Infrastructure key to successful multi-domain operations
    McCullough, Amy, 2018, Air Force Association (101):
  • [9] Learning Geometric Information via Transformer Network for Key-Points Based Motion Segmentation
    Li, Qiming
    Cheng, Jinghang
    Gao, Yin
    Li, Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 7856 - 7869
  • [10] Intelligent identification of radar active jamming type based on multi-domain information fusion
    Cao, Fei
    Gao, Zejun
    He, Chuan
    Feng, Xiaowei
    Xu, Jianfeng
    Xue, Chunling
    Qin, Jianqiang
    Journal of Physics: Conference Series, 2023, 2480 (01):