Masked Image Training for Generalizable Deep Image Denoising

被引:60
作者
Chen, Haoyu [1 ]
Gu, Jinjin [2 ,3 ]
Liu, Yihao [2 ,4 ,5 ]
Magid, Salma Abdel [7 ]
Dong, Chao [2 ,4 ]
Wang, Qiong [6 ]
Pfister, Hanspeter [7 ]
Zhu, Lei [1 ,8 ]
机构
[1] Hong Kong Univ Sci & Technol Guangzhou, Hong Kong, Peoples R China
[2] Shanghai AI Lab, Shanghai, Peoples R China
[3] Univ Sydney, Sydney, Australia
[4] Chinese Acad Sci, Shenzhen Inst Adv Technol, ShenZhen Key Lab Comp Vis & Pattern Recognit, Beijing, Peoples R China
[5] Univ Chinese Acad Sci, Beijing, Peoples R China
[6] Chinese Acad Sci, Shenzhen Inst Adv Technol, Guangdong Prov Key Lab Comp Vision & Virtual Rea, Beijing, Peoples R China
[7] Harvard Univ, Cambridge, MA USA
[8] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年
基金
中国国家自然科学基金;
关键词
SPARSE;
D O I
10.1109/CVPR52729.2023.00169
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When capturing and storing images, devices inevitably introduce noise. Reducing this noise is a critical task called image denoising. Deep learning has become the de facto method for image denoising, especially with the emergence of Transformer-based models that have achieved notable state-of-the-art results on various image tasks. However, deep learning-based methods often suffer from a lack of generalization ability. For example, deep models trained on Gaussian noise may perform poorly when tested on other noise distributions. To address this issue, we present a novel approach to enhance the generalization performance of denoising networks, known as masked training. Our method involves masking random pixels of the input image and reconstructing the missing information during training. We also mask out the features in the self-attention layers to avoid the impact of training-testing inconsistency. Our approach exhibits better generalization ability than other deep learning models and is directly applicable to real-world scenarios. Additionally, our interpretability analysis demonstrates the superiority of our method.
引用
收藏
页码:1692 / 1703
页数:12
相关论文
共 86 条
[1]   A High-Quality Denoising Dataset for Smartphone Cameras [J].
Abdelhamed, Abdelrahman ;
Lin, Stephen ;
Brown, Michael S. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1692-1700
[2]  
[Anonymous], 2022, CVPR, DOI DOI 10.1109/CVPR52688.2022.00564
[3]  
[Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00583
[4]  
[Anonymous], 2020, INT C MACH LEARN
[5]   Real Image Denoising with Feature Attention [J].
Anwar, Saeed ;
Barnes, Nick .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3155-3164
[6]  
Arbelaez P., 2010, TPAMI
[7]  
Bao Hangbo, 2021, PROC INT C LEARN REP
[8]   Unprocessing Images for Learned Raw Denoising [J].
Brooks, Tim ;
Mildenhall, Ben ;
Xue, Tianfan ;
Chen, Jiawen ;
Sharlet, Dillon ;
Barron, Jonathan T. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11028-11037
[9]  
Brown TB, 2020, ADV NEUR IN, V33
[10]   A non-local algorithm for image denoising [J].
Buades, A ;
Coll, B ;
Morel, JM .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, :60-65