Controllable Image Processing via Adaptive FilterBank Pyramid

被引:14
作者
Chen, Dongdong [1 ]
Fan, Qingnan [2 ]
Liao, Jing [3 ]
Aviles-Rivero, Angelica [4 ,5 ]
Yuan, Lu [6 ]
Yu, Nenghai [1 ]
Hua, Gang [7 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230026, Peoples R China
[2] Stanford Univ, Comp Sci Dept, Stanford, CA 94305 USA
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[4] Univ Cambridge, DAMTP, Cambridge CB2 1TN, England
[5] Univ Cambridge, DPMMS, Cambridge CB2 1TN, England
[6] Microsoft Cloud AI, Redmond, WA 98052 USA
[7] Wormpex AI Res LLC, Bellevue, WA 98004 USA
基金
英国工程与自然科学研究理事会;
关键词
Tuning; Task analysis; Image processing; Filter banks; Smoothing methods; Adaptation models; Adaptive systems; Controllable image processing; image restoration; adaptive filterbank;
D O I
10.1109/TIP.2020.3009844
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional image processing operators often provide some control parameters to tweak the final results. Recently, different convolutional neural networks have been used to approximate or improve these operators. However, in those methods, one single model can only handle one operator of a specific parameter value and does not support parameter tuning. In this paper, we propose a new plugin module, "Adaptive Filterbank Pyramid", which can be inserted into a backbone network to support multiple operators and continuous parameter tuning. Our module explicitly represents one operator with one filterbank pyramid. To generate the results of a specific operator, the corresponding filterbank pyramid is convolved with the intermediate feature pyramid produced by the backbone network. The weights of the filterbank pyramid are directly regressed by another sub-network, which is jointly trained with the backbone network and adapted to the input parameter, thus enabling continuous parameter tuning. We applied the proposed module for a large variety of image processing tasks, including image smoothing, image denoising, image deblocking, image enhancement and neural style transfer. Experiments show that our method is generalized to different types of image processing tasks and different backbone network structures. Compared to the single-operator-single-parameter baseline, our method can produce comparable results but is significantly more efficient in both training and testing.
引用
收藏
页码:8043 / 8054
页数:12
相关论文
共 32 条
[1]   A non-local algorithm for image denoising [J].
Buades, A ;
Coll, B ;
Morel, JM .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, :60-65
[2]   Gated Context Aggregation Network for Image Dehazing and Deraining [J].
Chen, Dongdong ;
He, Mingming ;
Fan, Qingnan ;
Liao, Jing ;
Zhang, Liheng ;
Hou, Dongdong ;
Yuan, Lu ;
Hua, Gang .
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1375-1383
[3]   Coherent Online Video Style Transfer [J].
Chen, Dongdong ;
Liao, Jing ;
Yuan, Lu ;
Yu, Nenghai ;
Hua, Gang .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1114-1123
[4]   StyleBank: An Explicit Representation for Neural Image Style Transfer [J].
Chen, Dongdong ;
Yuan, Lu ;
Liao, Jing ;
Yu, Nenghai ;
Hua, Gang .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2770-2779
[5]   Fast Image Processing with Fully-Convolutional Networks [J].
Chen, Qifeng ;
Xu, Jia ;
Koltun, Vladlen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2516-2525
[6]   Trainable Nonlinear Reaction Diffusion: A Flexible Framework for Fast and Effective Image Restoration [J].
Chen, Yunjin ;
Pock, Thomas .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1256-1272
[7]   Image denoising by sparse 3-D transform-domain collaborative filtering [J].
Dabov, Kostadin ;
Foi, Alessandro ;
Katkovnik, Vladimir ;
Egiazarian, Karen .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (08) :2080-2095
[8]  
Doersch C., 2016, ARXIV
[9]   Compression Artifacts Reduction by a Deep Convolutional Network [J].
Dong, Chao ;
Deng, Yubin ;
Loy, Chen Change ;
Tang, Xiaoou .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :576-584
[10]  
Durand F, 2002, ACM T GRAPHIC, V21, P257, DOI 10.1145/566570.566574