Fine-Grained Image Editing Using ControlNet: Expanding Possibilities in Visual Manipulation

被引:0
|
作者
Xu, Longfei [1 ]
Huang, Hongbo [1 ]
Zhao, Yushuang [1 ]
Pan, Shuwen [1 ]
Zheng, Yaolin [1 ]
Yan, Xiaoxu [1 ]
Huang, Linkai [1 ]
Wu, Lishan [1 ]
机构
[1] Beijing Informat Sci & Technol Univ, Beijing, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024 | 2024年 / 14867卷
基金
中国国家自然科学基金;
关键词
Diffusion Probabilistic Model; Controlnet; Image Editing;
D O I
10.1007/978-981-97-5597-4_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, diffusion probabilistic models have emerged as a hot topic in computer vision. Image creation programs such as Imagen, Latent Diffusion Models, and Stable Diffusion have shown outstanding generative powers, sparking considerable community discussions. They frequently, however, lack the ability to precisely modify real-world images. In this paper, we propose a novel ControlNet-based image editing framework that enables alteration of real images based on pose maps, scribbling maps, and other features without the need for training or fine-tuning. Given a guiding image as input, we edit the initial noise generated from the guiding image to influence the generation process. Then features extracted from the guiding image are directly injected into the generation process of the translated image. We also construct a classifier guidance based on strong correspondences between intermediate features of the ControlNet branches. The editing signals are converted into gradients to guide the sampling direction. At the end of this paper, we demonstrate high-quality results of our proposed model in image editing tasks.
引用
收藏
页码:27 / 38
页数:12
相关论文
共 50 条
  • [1] Efficient Image Embedding for Fine-Grained Visual Classification
    Payatsuporn, Soranan
    Kijsirikul, Boonserm
    2022-14TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST 2022), 2022, : 40 - 45
  • [2] Fine-Grained Image Style Transfer with Visual Transformers
    Wang, Jianbo
    Yang, Huan
    Fu, Jianlong
    Yamasaki, Toshihiko
    Guo, Baining
    COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 427 - 443
  • [3] Image Manipulation Localization Using SpatialChannel Fusion Excitation and Fine-Grained Feature Enhancement
    Li, Fengyong
    Zhai, Huajun
    Zhang, Xinpeng
    Qin, Chuan
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 14
  • [4] Fine-grained Image-to-Image Transformation towards Visual Recognition
    Xiong, Wei
    He, Yutong
    Zhang, Yixuan
    Luo, Wenhan
    Ma, Lin
    Luo, Jiebo
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5839 - 5848
  • [5] Leveraging Fine-Grained Labels to Regularize Fine-Grained Visual Classification
    Wu, Junfeng
    Yao, Li
    Liu, Bin
    Ding, Zheyuan
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION (ICCMS 2019) AND 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND APPLICATIONS (ICICA 2019), 2019, : 133 - 136
  • [6] Fine-Grained Visual Entailment
    Thomas, Christopher
    Zhang, Yipeng
    Chang, Shih-Fu
    COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 398 - 416
  • [7] Fine-Grained Visual Prompting
    Yang, Lingfeng
    Wang, Yueze
    Li, Xiang
    Wang, Xinlong
    Yang, Jian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Fine-grained Image Classification by Visual-Semantic Embedding
    Xu, Huapeng
    Qi, Guilin
    Li, Jingjing
    Wang, Meng
    Xu, Kang
    Gao, Huan
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1043 - 1049
  • [9] Fine-Grained Image Search
    Xie, Lingxi
    Wang, Jingdong
    Zhang, Bo
    Tian, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (05) : 636 - 647
  • [10] LoopNet for fine-grained fashion attributes editing
    Zou, Xingxing
    Zhu, Shumin
    Wong, Wai Keung
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259