HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models

被引:0
|
作者
Zhang, Shen [1 ]
Chen, Zhaowei [1 ]
Zhao, Zhenyu [1 ]
Chen, Yuhao [1 ]
Tang, Yao [1 ]
Liang, Jiajun [1 ]
机构
[1] MEGVII Technol, Beijing, Peoples R China
来源
关键词
Higher-Resolution Image Synthesis; High-Efficiency Diffusion;
D O I
10.1007/978-3-031-72983-6_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diffusion models have become a mainstream approach for high-resolution image synthesis. However, directly generating higher-resolution images from pretrained diffusion models will encounter unreasonable object duplication and exponentially increase the generation time. In this paper, we discover that object duplication arises from feature duplication in the deep blocks of the U-Net. Concurrently, We pinpoint the extended generation times to self-attention redundancy in U-Net's top blocks. To address these issues, we propose a tuning-free higher-resolution framework named HiDiffusion. Specifically, HiDiffusion contains Resolution-Aware U-Net (RAU-Net) that dynamically adjusts the feature map size to resolve object duplication and engages Modified Shifted Window Multi-head Self-Attention (MSW-MSA) that utilizes optimized window attention to reduce computations. we can integrate HiDiffusion into various pretrained diffusion models to scale image generation resolutions even to 4096x4096 at 1.5- 6x the inference speed of previous methods. Extensive experiments demonstrate that our approach can address object duplication and heavy computation issues, achieving state-of-the-art performance on higher-resolution image synthesis tasks.
引用
收藏
页码:145 / 161
页数:17
相关论文
共 10 条
  • [1] BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion
    Kim, Gwanghyun
    Kim, Hayeon
    Seo, Hoigi
    Kang, Dong Un
    Chun, Se Young
    arXiv,
  • [2] BeyondScene: Higher-Resolution Human-Centric Scene Generation with Pretrained Diffusion
    Kim, Gwanghyun
    Kim, Hayeon
    Seo, Hoigi
    Kang, Dong Un
    Chun, Se Young
    COMPUTER VISION - ECCV 2024, PT LXIV, 2025, 15122 : 126 - 142
  • [3] Toward Efficient Calibration of Higher-Resolution Earth System Models
    Fletcher, Christopher G.
    McNally, William
    Virgin, John G.
    King, Fraser
    JOURNAL OF ADVANCES IN MODELING EARTH SYSTEMS, 2022, 14 (07)
  • [4] Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
    Guo, Lanqing
    He, Yingqing
    Chen, Haoxin
    Xia, Menghan
    Cun, Xiaodong
    Wang, Yufei
    Huang, Siyu
    Zhang, Yong
    Wang, Xintao
    Chen, Qifeng
    Shan, Ying
    Wen, Bihan
    COMPUTER VISION - ECCV 2024, PT XXXVI, 2025, 15094 : 39 - 55
  • [5] Coarse to superfine: can hyperspectral soil organic carbon models predict higher-resolution information?
    Kabiri, Shayan
    O'Rourke, Sharon M.
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2024, 12
  • [6] Comparison of conventional and higher-resolution reduced-FOV diffusion-weighted imaging of breast tissue
    Baron, Paul
    Wielema, Mirjam
    Dijkstra, Hildebrand
    Potze, Jan Hendrik
    Dorrius, Monique D.
    Sijens, Paul E.
    MAGNETIC RESONANCE MATERIALS IN PHYSICS BIOLOGY AND MEDICINE, 2023, 36 (04) : 613 - 619
  • [7] Comparison of conventional and higher-resolution reduced-FOV diffusion-weighted imaging of breast tissue
    Paul Baron
    Mirjam Wielema
    Hildebrand Dijkstra
    Jan Hendrik Potze
    Monique D. Dorrius
    Paul E. Sijens
    Magnetic Resonance Materials in Physics, Biology and Medicine, 2023, 36 : 613 - 619
  • [8] Enhancing Regional Seismic Velocity Models With Higher-Resolution Local Results Using Sparse Dictionary Learning
    Zhang, Hao
    Ben-Zion, Yehuda
    JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH, 2024, 129 (01)
  • [9] Enhancing 5G MIMO Core Spectral Efficiency with Higher-Resolution Multi-User MIMO and Multi-Beam Operation
    Onggosanusi E.
    Zhang M.
    Zhang Y.
    Kang J.
    IEEE Communications Standards Magazine, 2022, 6 (01): : 20 - 26
  • [10] SODIUM CLUSTER IONIZATION-POTENTIALS REVISITED - HIGHER-RESOLUTION MEASUREMENTS FOR NAX (X-LESS-THAN-23) AND THEIR RELATION TO BONDING MODELS
    KAPPES, MM
    SCHAR, M
    ROTHLISBERGER, U
    YERETZIAN, C
    SCHUMACHER, E
    CHEMICAL PHYSICS LETTERS, 1988, 143 (03) : 251 - 258