DMDiff: A Dual-Branch Multimodal Conditional Guided Diffusion Model for Cloud Removal Through SAR-Optical Data Fusion

被引:3
作者
Zhang, Wenjuan [1 ]
Mei, Junlin [1 ,2 ]
Wang, Yuxi [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Int Res Ctr Big Data Sustainable Dev Goals, Beijing 100094, Peoples R China
基金
中国国家自然科学基金;
关键词
diffusion models; cloud removal; synthetic aperture radar (SAR); remote sensing image; SENTINEL-2; IMAGERY; COVER;
D O I
10.3390/rs17060965
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Optical remote sensing images, as a significant data source for Earth observation, are often impacted by cloud cover, which severely limits their widespread application in Earth sciences. Synthetic aperture radar (SAR), with its all-weather, all-day observation capabilities, serves as a valuable auxiliary data source for cloud removal (CR) tasks. Despite substantial progress in deep learning (DL)-based CR methods utilizing SAR data in recent years, challenges remain in preserving fine texture details and maintaining image visual authenticity. To address these limitations, this study proposes a novel diffusion-based CR method called the Dual-branch Multimodal Conditional Guided Diffusion Model (DMDiff). Considering the intrinsic differences in data characteristics between SAR and optical images, we design a dual-branch feature extraction architecture to enable adaptive feature extraction based on the characteristics of the data. Then, a cross-attention mechanism is employed to achieve deep fusion of the multimodal feature extracted above, effectively guiding the progressive diffusion process to restore cloud-covered regions in optical images. Furthermore, we propose an image adaptive prediction (IAP) strategy within the diffusion model, specifically tailored to the characteristics of remote sensing data, which achieves a nearly 20 dB improvement in PSNR compared to the traditional noise prediction (NP) strategy. Extensive experiments on the airborne, WHU-OPT-SAR, and LuojiaSET-OSFCR datasets demonstrate that DMDiff outperforms SOTA methods in terms of both signal fidelity and visual perceptual quality. Specifically, on the LuojiaSET-OSFCR dataset, our method achieves a remarkable 17% reduction in the FID metric over the second-best method, while also yielding significant enhancements in quality assessment metrics such as PSNR and SSIM.
引用
收藏
页数:33
相关论文
共 52 条
[1]   Cloud cover in Landsat observations of the Brazilian Amazon [J].
Asner, GP .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2001, 22 (18) :3855-3862
[2]   Conditional Diffusion for SAR to Optical Image Translation [J].
Bai, Xinyu ;
Pu, Xinyang ;
Xu, Feng .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 :1-5
[3]   SAR TO OPTICAL IMAGE SYNTHESIS FOR CLOUD REMOVAL WITH GENERATIVE ADVERSARIAL NETWORKS [J].
Bermudez, J. D. ;
Happ, P. N. ;
Oliveira, D. A. B. ;
Feitosa, R. Q. .
ISPRS TC I MID-TERM SYMPOSIUM INNOVATIVE SENSING - FROM SENSORS TO METHODS AND APPLICATIONS, 2018, 4-1 :5-11
[4]   Dual Aggregation Transformer for Image Super-Resolution [J].
Chen, Zheng ;
Zhang, Yulun ;
Gu, Jinjin ;
Kong, Linghe ;
Yang, Xiaokang ;
Yu, Fisher .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :12278-12287
[5]  
Dhariwal P, 2021, ADV NEUR IN, V34
[6]  
Ding H., 2022, P AS C COMP VIS, P469
[7]   Structure and Content-Guided Video Synthesis with Diffusion Models [J].
Esser, Patrick ;
Chiu, Johnathan ;
Atighehchian, Parmida ;
Granskog, Jonathan ;
Germanidis, Anastasis .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, :7312-7322
[8]   Cloud Removal with Fusion of High Resolution Optical and SAR Images Using Generative Adversarial Networks [J].
Gao, Jianhao ;
Yuan, Qiangqiang ;
Li, Jie ;
Zhang, Hai ;
Su, Xin .
REMOTE SENSING, 2020, 12 (01)
[9]   Implicit Diffusion Models for Continuous Super-Resolution [J].
Gao, Sicheng ;
Liu, Xuhui ;
Zeng, Bohan ;
Xu, Sheng ;
Li, Yanjing ;
Luo, Xiaoyan ;
Liu, Jianzhuang ;
Zhen, Xiantong ;
Zhang, Baochang .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :10021-10030
[10]  
Grohnfeldt C, 2018, INT GEOSCI REMOTE SE, P1726, DOI 10.1109/IGARSS.2018.8519215