Controllable image synthesis methods, applications and challenges: a comprehensive survey

被引:0
作者
Huang, Shanshan [1 ]
Li, Qingsong [1 ]
Liao, Jun [1 ]
Wang, Shu [3 ]
Liu, Li [1 ]
Li, Lian [2 ]
机构
[1] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 400000, Peoples R China
[2] Hefei Univ Technol, Sch Comp Sci Informat Engn, Hefei 230601, Peoples R China
[3] Southwest Univ, Sch Mat & Energy, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Controllable image synthesis; Deep generative model; Causal learning; GAN inversion; Interpretable representation learning; Artificial intelligence-generated content; ADVERSARIAL NETWORKS; GAN INVERSION; TRANSLATION; GENERATION;
D O I
10.1007/s10462-024-10987-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Controllable Image Synthesis (CIS) is a methodology that allows users to generate desired images or manipulate specific attributes of images by providing precise input conditions or modifying latent representations. In recent years, CIS has attracted considerable attention in the field of image processing, with significant advances in consistency, controllability and harmony. However, several challenges still remain, particularly regarding the fine-grained controllability and interpretability of synthesized images. In this paper, we comprehensively and systematically review the CIS from problem definition, taxonomy and evaluation systems to existing challenges and future research directions. First, the definition of CIS is given, and several representative deep generative models are introduced in detail. Second, the existing CIS methods are divided into three categories according to the different control manners used and discuss the typical work in each category critically. Furthermore, we introduce the public datasets and evaluation metrics commonly used in image synthesis and analyze the representative CIS methods. Finally, we present several open issues and discuss the future research direction of CIS.
引用
收藏
页数:46
相关论文
共 33 条
  • [21] Utility-Scale Energy Storage Systems: A Comprehensive Review of Their Applications, Challenges, and Future Directions
    Luo, Wensheng
    Stynski, Sebastian
    Chub, Andrii
    Franquelo, Leopoldo Garcia
    Malinowski, Mariusz
    Vinnikov, Dmitri
    IEEE INDUSTRIAL ELECTRONICS MAGAZINE, 2021, 15 (04) : 17 - 27
  • [22] A survey and taxonomy of adversarial neural networks for text-to-image synthesis
    Agnese, Jorge
    Herrera, Jonathan
    Tao, Haicheng
    Zhu, Xingquan
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 10 (04)
  • [23] A Survey of Visual Analytics Techniques and Applications: State-of-the-Art Research and Future Challenges
    Sun, Guo-Dao
    Wu, Ying-Cai
    Liang, Rong-Hua
    Liu, Shi-Xia
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2013, 28 (05) : 852 - 867
  • [24] Machine learning for autonomous vehicle's trajectory prediction: A comprehensive survey, challenges, and future research directions
    Bharilya, Vibha
    Kumar, Neetesh
    VEHICULAR COMMUNICATIONS, 2024, 46
  • [25] Silver nanoparticles: Synthesis methods, bio-applications and properties
    Abbasi, Elham
    Milani, Morteza
    Aval, Sedigheh Fekri
    Kouhi, Mohammad
    Akbarzadeh, Abolfazl
    Nasrabadi, Hamid Tayefi
    Nikasa, Parisa
    Joo, San Woo
    Hanifehpour, Younes
    Nejati-Koshki, Kazem
    Samiei, Mohammad
    CRITICAL REVIEWS IN MICROBIOLOGY, 2016, 42 (02) : 173 - 180
  • [26] Potential of generative adversarial net algorithms in image and video processing applications- a survey
    Sharma, Akanksha
    Jindal, Neeru
    Rana, P. S.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (37-38) : 27407 - 27437
  • [27] A comprehensive review on MRI to CT and MRI to PET image synthesis using deep learning
    Meharban, M. S.
    Sabu, M. K.
    Santhanakrishnan, T.
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2023, 43 (03) : 207 - 232
  • [28] Cell-free protein synthesis system for bioanalysis: Advances in methods and applications
    Gu, Yanqiu
    Fan, Fang
    Liu, Yue
    Chai, Yifeng
    Yuan, Yongfang
    Chen, Xiaofei
    TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2023, 161
  • [29] Evolving typical meteorological year (TMY) data for building energy simulation: a comprehensive review of methods, challenges, and future directions
    Rady, Mohammed
    Muhammad, Mohd Khairul Idlan
    Shahid, Shamsuddin
    ADVANCES IN BUILDING ENERGY RESEARCH, 2025,
  • [30] A comprehensive survey on image captioning: from handcrafted to deep learning-based techniques, a taxonomy and open research issues
    Sharma, Himanshu
    Padha, Devanand
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (11) : 13619 - 13661