CRS-Diff: Controllable Remote Sensing Image Generation With Diffusion Model

被引:3
作者
Tang, Datao [1 ,2 ]
Cao, Xiangyong [1 ,2 ]
Hou, Xingsong [3 ]
Jiang, Zhongyuan [4 ]
Liu, Junmin [5 ]
Meng, Deyu [2 ,5 ,6 ]
机构
[1] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Key Lab Intelligent Networks & Network Secur, Minist Educ, Xian 710049, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Informat & Commun Engn, Xian 710049, Shaanxi, Peoples R China
[4] Xidian Univ, Sch Cyber Engn, Xian 710049, Shaanxi, Peoples R China
[5] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Shaanxi, Peoples R China
[6] Macau Univ Scienceand Technol, Macao Inst Syst Engn, Taipa, Macao, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Diffusion models; Image synthesis; Image resolution; Text to image; Remote sensing; Training; Task analysis; Controllable generation; deep learning; diffusion model; remote sensing (RS) image;
D O I
10.1109/TGRS.2024.3453414
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The emergence of generative models has revolutionized the field of remote sensing (RS) image generation. Despite generating high-quality images, existing methods are limited in relying mainly on text control conditions, and thus do not always generate images accurately and stably. In this article, we propose CRS-Diff, a new RS generative framework specifically tailored for RS image generation, leveraging the inherent advantages of diffusion models while integrating more advanced control mechanisms. Specifically, CRS-Diff can simultaneously support text-condition, metadata-condition, and image-condition control inputs, thus enabling more precise control to refine the generation process. To effectively integrate multiple condition control information, we introduce a new conditional control mechanism to achieve multiscale feature fusion (FF), thus enhancing the guiding effect of control conditions. To the best of our knowledge, CRS-Diff is the first multiple-condition controllable RS generative model. Experimental results in single-condition and multiple-condition cases have demonstrated the superior ability of our CRS-Diff to generate RS images both quantitatively and qualitatively compared with previous methods. Additionally, our CRS-Diff can serve as a data engine that generates high-quality training data for downstream tasks, e.g., road extraction. The code is available at https://github.com/Sonettoo/CRS-Diff.
引用
收藏
页数:14
相关论文
共 58 条
  • [21] Mou Chong, 2023, ARXIV
  • [22] Nabiyev N., 2023, ARXIV
  • [23] Nichol Alex, 2021, arXiv
  • [24] Podell Dustin, 2023, ARXIV
  • [25] Radford A, 2021, PR MACH LEARN RES, V139
  • [26] Ramesh A., 2022, arXiv
  • [27] Ramesh A, 2021, PR MACH LEARN RES, V139
  • [28] Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-Shot Cross-Dataset Transfer
    Ranftl, Rene
    Lasinger, Katrin
    Hafner, David
    Schindler, Konrad
    Koltun, Vladlen
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1623 - 1637
  • [29] Reed S, 2016, ADV NEUR IN, V29
  • [30] Reed S, 2016, PR MACH LEARN RES, V48