DMDiff: A Dual-Branch Multimodal Conditional Guided Diffusion Model for Cloud Removal Through SAR-Optical Data Fusion

被引：3

作者：

Zhang, Wenjuan ^{[1
]}

Mei, Junlin ^{[1
,2
]}

Wang, Yuxi ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] Int Res Ctr Big Data Sustainable Dev Goals, Beijing 100094, Peoples R China

来源：

REMOTE SENSING | 2025年 / 17卷 / 06期

基金：

中国国家自然科学基金;

关键词：

diffusion models; cloud removal; synthetic aperture radar (SAR); remote sensing image; SENTINEL-2; IMAGERY; COVER;

D O I：

10.3390/rs17060965

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Optical remote sensing images, as a significant data source for Earth observation, are often impacted by cloud cover, which severely limits their widespread application in Earth sciences. Synthetic aperture radar (SAR), with its all-weather, all-day observation capabilities, serves as a valuable auxiliary data source for cloud removal (CR) tasks. Despite substantial progress in deep learning (DL)-based CR methods utilizing SAR data in recent years, challenges remain in preserving fine texture details and maintaining image visual authenticity. To address these limitations, this study proposes a novel diffusion-based CR method called the Dual-branch Multimodal Conditional Guided Diffusion Model (DMDiff). Considering the intrinsic differences in data characteristics between SAR and optical images, we design a dual-branch feature extraction architecture to enable adaptive feature extraction based on the characteristics of the data. Then, a cross-attention mechanism is employed to achieve deep fusion of the multimodal feature extracted above, effectively guiding the progressive diffusion process to restore cloud-covered regions in optical images. Furthermore, we propose an image adaptive prediction (IAP) strategy within the diffusion model, specifically tailored to the characteristics of remote sensing data, which achieves a nearly 20 dB improvement in PSNR compared to the traditional noise prediction (NP) strategy. Extensive experiments on the airborne, WHU-OPT-SAR, and LuojiaSET-OSFCR datasets demonstrate that DMDiff outperforms SOTA methods in terms of both signal fidelity and visual perceptual quality. Specifically, on the LuojiaSET-OSFCR dataset, our method achieves a remarkable 17% reduction in the FID metric over the second-best method, while also yielding significant enhancements in quality assessment metrics such as PSNR and SSIM.

引用

页数：33

共 52 条

[1] Cloud cover in Landsat observations of the Brazilian Amazon [J].

Asner, GP .

INTERNATIONAL JOURNAL OF REMOTE SENSING, 2001, 22 (18) :3855-3862

[2] Conditional Diffusion for SAR to Optical Image Translation [J].

Bai, Xinyu ;

Pu, Xinyang ;

Xu, Feng .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 :1-5

[3] SAR TO OPTICAL IMAGE SYNTHESIS FOR CLOUD REMOVAL WITH GENERATIVE ADVERSARIAL NETWORKS [J].

Bermudez, J. D. ;

Happ, P. N. ;

Oliveira, D. A. B. ;

Feitosa, R. Q. .

ISPRS TC I MID-TERM SYMPOSIUM INNOVATIVE SENSING - FROM SENSORS TO METHODS AND APPLICATIONS, 2018, 4-1 :5-11

[4] Dual Aggregation Transformer for Image Super-Resolution [J].

Chen, Zheng ;

Zhang, Yulun ;

Gu, Jinjin ;

Kong, Linghe ;

Yang, Xiaokang ;

Yu, Fisher .

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :12278-12287

[5]

Dhariwal P, 2021, ADV NEUR IN, V34

[6]

Ding H., 2022, P AS C COMP VIS, P469

[7] Structure and Content-Guided Video Synthesis with Diffusion Models [J].

Esser, Patrick ;

Chiu, Johnathan ;

Atighehchian, Parmida ;

Granskog, Jonathan ;

Germanidis, Anastasis .

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, :7312-7322

[8] Cloud Removal with Fusion of High Resolution Optical and SAR Images Using Generative Adversarial Networks [J].

Gao, Jianhao ;

Yuan, Qiangqiang ;

Li, Jie ;

Zhang, Hai ;

Su, Xin .

REMOTE SENSING, 2020, 12 (01)

[9] Implicit Diffusion Models for Continuous Super-Resolution [J].

Gao, Sicheng ;

Liu, Xuhui ;

Zeng, Bohan ;

Xu, Sheng ;

Li, Yanjing ;

Luo, Xiaoyan ;

Liu, Jianzhuang ;

Zhen, Xiantong ;

Zhang, Baochang .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :10021-10030

[10]

Grohnfeldt C, 2018, INT GEOSCI REMOTE SE, P1726, DOI 10.1109/IGARSS.2018.8519215

← 1 2 3 4 5 6 →