Learning Spatially Variant Linear Representation Models for Joint Filtering

被引:29
作者
Dong, Jiangxin [1 ]
Pan, Jinshan [1 ]
Ren, Jimmy S. [2 ,3 ]
Lin, Liang [4 ]
Tang, Jinhui [1 ]
Yang, Ming-Hsuan [5 ,6 ,7 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
[3] Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Shanghai 200240, Peoples R China
[4] Sun Yat Sen Univ, Guangzhou 510275, Peoples R China
[5] Univ Calif Merced, Merced, CA 95344 USA
[6] Yonsei Univ, Seoul 03722, South Korea
[7] Google, Mountain View, CA 94043 USA
基金
中国国家自然科学基金; 国家重点研发计划; 美国国家科学基金会;
关键词
Task analysis; Convolutional neural networks; Image restoration; Computational modeling; Linear programming; Optimization; Kernel; Spatially variant linear representation model; convolutional neural network; joint filtering; DEPTH ENHANCEMENT; IMAGE;
D O I
10.1109/TPAMI.2021.3102575
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Joint filtering mainly uses an additional guidance image as a prior and transfers its structures to the target image in the filtering process. Different from existing approaches that rely on local linear models or hand-designed objective functions to extract the structural information from the guidance image, we propose a new joint filtering method based on a spatially variant linear representation model (SVLRM), where the target image is linearly represented by the guidance image. However, learning SVLRMs for vision tasks is a highly ill-posed problem. To estimate the spatially variant linear representation coefficients, we develop an effective approach based on a deep convolutional neural network (CNN). As such, the proposed deep CNN (constrained by the SVLRM) is able to model the structural information of both the guidance and input images. We show that the proposed approach can be effectively applied to a variety of applications, including depth/RGB image upsampling and restoration, flash deblurring, natural image denoising, and scale-aware filtering. In addition, we show that the linear representation model can be extended to high-order representation models (e.g., quadratic and cubic polynomial representations). Extensive experimental results demonstrate that the proposed method performs favorably against the state-of-the-art methods that have been specifically designed for each task.
引用
收藏
页码:8355 / 8370
页数:16
相关论文
共 79 条
[71]   Dual Adversarial Network: Toward Real-World Noise Removal and Noise Generation [J].
Yue, Zongsheng ;
Zhao, Qian ;
Zhang, Lei ;
Meng, Deyu .
COMPUTER VISION - ECCV 2020, PT X, 2020, 12355 :41-58
[72]   FFDNet: Toward a Fast and Flexible Solution for CNN-Based Image Denoising [J].
Zhang, Kai ;
Zuo, Wangmeng ;
Zhang, Lei .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (09) :4608-4622
[73]   Learning Deep CNN Denoiser Prior for Image Restoration [J].
Zhang, Kai ;
Zuo, Wangmeng ;
Gu, Shuhang ;
Zhang, Lei .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2808-2817
[74]   Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising [J].
Zhang, Kai ;
Zuo, Wangmeng ;
Chen, Yunjin ;
Meng, Deyu ;
Zhang, Lei .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (07) :3142-3155
[75]   100+Times FasterWeighted Median Filter (WMF) [J].
Zhang, Qi ;
Xu, Li ;
Jia, Jiaya .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2830-2837
[76]  
Zhang Q, 2014, LECT NOTES COMPUT SC, V8691, P815, DOI 10.1007/978-3-319-10578-9_53
[77]  
Zhu J, 2017, IEEE IMAGE PROC, P4068, DOI 10.1109/ICIP.2017.8297047
[78]   Robust Flash Deblurring [J].
Zhuo, Shaojie ;
Guo, Dong ;
Sim, Terence .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :2440-2447
[79]  
Zoran D, 2011, IEEE I CONF COMP VIS, P479, DOI 10.1109/ICCV.2011.6126278