Multi-Grid Back-Projection Networks

被引:2
作者
Michelini, Pablo Navarrete [1 ]
Chen, Wenbin [1 ]
Liu, Hanwen [1 ]
Zhu, Dan [1 ]
Jiang, Xingqun [1 ]
机构
[1] BOE Technol Grp Co Ltd, Dept Artificial Intelligence, Beijing, Peoples R China
关键词
Image resolution; Videos; Standards; Distortion; Training; Technological innovation; Image reconstruction; Multigrid; iterative backprojection; super resolution; convolutional networks;
D O I
10.1109/JSTSP.2021.3049641
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multi-grid back-projection (MGBP) is a fully-convolutional network architecture that can learn to restore images and videos with upscaling artifacts. Using the same strategy of multi-grid partial differential equation (PDE) solvers this multiscale architecture scales computational complexity efficiently with increasing output resolutions. The basic processing block is inspired in the iterative back-projection (IBP) algorithm and constitutes a type of cross-scale residual block with feedback from low resolution references. The architecture performs in par with state-of-the-arts alternatives for regression targets that aim to recover an exact copy of a high resolution image or video from which only a downscale image is known. A perceptual quality target aims to create more realistic outputs by introducing artificial changes that can be different from a high resolution original content as long as they are consistent with the low resolution input. For this target we propose a strategy using noise inputs in different resolution scales to control the amount of artificial details generated in the output. The noise input controls the amount of innovation that the network uses to create artificial realistic details. The effectiveness of this strategy is shown in benchmarks and it is explained as a particular strategy to traverse the perception-distortion plane.
引用
收藏
页码:279 / 294
页数:16
相关论文
共 76 条
[51]   NTIRE 2019 Challenge on Video Deblurring and Super-Resolution: Dataset and Study [J].
Nah, Seungjun ;
Baik, Sungyong ;
Hong, Seokil ;
Moon, Gyeongsik ;
Son, Sanghyun ;
Timofte, Radu ;
Lee, Kyoung Mu .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :1996-2005
[52]  
Park SC, 2003, IEEE SIGNAL PROC MAG, V20, P21, DOI 10.1109/MSP.2003.1203207
[53]  
Proakis J.G, 2007, DIGIT SIGNAL PROCESS, V4th ed.
[54]  
Purohit K, 2018, P EUR C COMP VIS, P132
[55]   THE STATISTICS OF NATURAL IMAGES [J].
RUDERMAN, DL .
NETWORK-COMPUTATION IN NEURAL SYSTEMS, 1994, 5 (04) :517-548
[56]   Deep learning [J].
Rusk, Nicole .
NATURE METHODS, 2016, 13 (01) :35-35
[57]   EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis [J].
Sajjadi, Mehdi S. M. ;
Schoelkopf, Bernhard ;
Hirsch, Michael .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4501-4510
[58]   Study of Subjective and Objective Quality Assessment of Video [J].
Seshadrinathan, Kalpana ;
Soundararajan, Rajiv ;
Bovik, Alan Conrad ;
Cormack, Lawrence K. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (06) :1427-1441
[59]   Image information and visual quality [J].
Sheikh, HR ;
Bovik, AC .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (02) :430-444
[60]  
Shuhang, 2019, IEEE CVF INT C COMP, P3467