Deep Bilateral Learning for Real-Time Image Enhancement

被引:609
作者
Gharbi, Michael [1 ]
Chen, Jiawen [2 ]
Barron, Jonathan T. [2 ]
Hasinoff, Samuel W. [2 ]
Durand, Fredo [1 ,3 ,4 ]
机构
[1] MIT CSAIL, Cambridge, MA 02139 USA
[2] Google Res, Mountain View, CA USA
[3] INRIA, Le Chesnay, France
[4] Univ Cote dAzur, Nice, France
来源
ACM TRANSACTIONS ON GRAPHICS | 2017年 / 36卷 / 04期
关键词
real-time image processing; deep learning; data-driven methods; convolutional neural networks;
D O I
10.1145/3072959.3073592
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Performance is a critical challenge in mobile image processing. Given a reference imaging pipeline, or even human-adjusted pairs of images, we seek to reproduce the enhancements and enable real-time evaluation. For this, we introduce a new neural network architecture inspired by bilateral grid processing and local affine color transforms. Using pairs of input/output images, we train a convolutional neural network to predict the coefficients of a locally-affine model in bilateral space. Our architecture learns to make local, global, and content-dependent decisions to approximate the desired image transformation. At runtime, the neural network consumes a low-resolution version of the input image, produces a set of affine transformations in bilateral space, upsamples those transformations in an edge-preserving fashion using a new slicing node, and then applies those upsampled transformations to the full-resolution image. Our algorithm processes high-resolution images on a smartphone in milliseconds, provides a real-time viewfinder at 1080p resolution, and matches the quality of state-of-the-art approximation techniques on a large class of image operators. Unlike previous work, our model is trained off-line from data and therefore does not require access to the original operator at runtime. This allows our model to learn complex, scene-dependent transformations for which no reference implementation is available, such as the photographic edits of a human retoucher.
引用
收藏
页数:12
相关论文
共 46 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2016, CORR
[3]  
[Anonymous], 2010, COMPUTER GRAPHICS FO
[4]  
[Anonymous], 2016, ACM TOG
[5]  
[Anonymous], ACM TOG
[6]  
[Anonymous], ACM TOG
[7]  
[Anonymous], 2015, ICLR
[8]  
[Anonymous], 2016, EUR C COMP VIS
[9]  
[Anonymous], TPAMI
[10]  
[Anonymous], 1998, BILATERAL FILTERING