Multistage Curvature-Guided Network for Progressive Single Image Reflection Removal

被引:18
作者
Song, Binbin [1 ,2 ]
Zhou, Jiantao [1 ,2 ]
Wu, Haiwei [1 ,2 ]
机构
[1] Univ Macau, State Key Lab Internet Things Smart City, Fac Sci & Technol, Macau, Peoples R China
[2] Univ Macau, Dept Comp & Informat Sci, Fac Sci & Technol, Macau, Peoples R China
关键词
Decoding; Semantics; Image restoration; Feature extraction; Training data; Training; Network architecture; Single image reflection removal; multi-stage network; curvature guidance; non-local attention; PHYSICALLY-BASED APPROACH; SEPARATION;
D O I
10.1109/TCSVT.2022.3168828
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Thanks to the powerful learning capability, deep neural networks (DNNs) have acquired broad applications in single image reflection removal. The DNN-based algorithms relax the constraints of specific priors and learn to generate visually pleasant background layers from massive training data. However, most of them employ a single network structure to recover both the semantic information and local details of the background, which may lead to obvious reflection residue or even failure. To mitigate this deficiency, in this work, we propose a Multi-stage Curvature-guided De-Reflection Network (MCDRNet), which combines multiple network architectures in a unified framework to progressively reconstruct the background layer and refine the fine-grained details. Our framework consists of three stages, where the encoder-decoders are exploited in the first two stages to recover the semantic components of background layers with lower scales and a variant ResNet is applied in the last stage to refine the background details with the original input resolution. In the first two stages, to introduce the structural guidance for the reflection removal, we cascade another decoder branch to restore the curvature map of the background. In addition, at the end of the first two stages, instead of directly passing the intermediate estimates to the next stage, we propose a Non-local Attention Module (NAM) to augment and transmit the features from decoders. Extensive experimental results on several public datasets demonstrate that the proposed MCDRNet outperforms the state-of-the-art methods quantitatively and generates visually better reflection removal results. The source code and pre-trained models are available at https://github.com/NamecantbeNULL/MCDRNet.
引用
收藏
页码:6515 / 6529
页数:15
相关论文
共 55 条
[1]   Removing photography artifacts using gradient projection and flash-exposure sampling [J].
Agrawal, A ;
Raskar, R ;
Nayar, SK ;
Li, YZ .
ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03) :828-835
[2]  
[Anonymous], 2015, INT C LEARN REPR
[3]   Single Image Reflection Suppression [J].
Arvanitopoulos, Nikolaos ;
Achanta, Radhakrishna ;
Susstrunk, Sabine .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1752-1760
[4]   Graph Cuts for Curvature Based Image Denoising [J].
Bae, Egil ;
Shi, Juan ;
Tai, Xue-Cheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (05) :1199-1210
[5]   Single-Image Blind Deblurring Using Multi-Scale Latent Structure Prior [J].
Bai, Yuanchao ;
Jia, Huizhu ;
Jiang, Ming ;
Liu, Xianming ;
Xie, Xiaodong ;
Gao, Wen .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) :2033-2045
[6]   A non-local algorithm for image denoising [J].
Buades, A ;
Coll, B ;
Morel, JM .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, :60-65
[8]  
Chang Y., 2020, INT J COMPUT VISION, V128, P1
[9]   Removing Object Reflections in Videos by Global Optimization [J].
Conte, Donatello ;
Foggia, Pasquale ;
Percannella, Gennaro ;
Vento, Mario .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (11) :1623-1633
[10]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893