A Progressive Architecture for Learned Fractional Downsampling

被引:3
作者
Chen, Li-Heng [1 ]
Bampis, Christos G. [2 ]
Li, Zhi [2 ]
Sole, Joel [2 ]
Bovik, Alan C. [1 ]
机构
[1] Univ Texas Austin, Lab Image & Video Engn, Austin, TX 78712 USA
[2] Netflix Inc, Los Gatos, CA USA
来源
2021 PICTURE CODING SYMPOSIUM (PCS) | 2021年
关键词
downsampling; convolutional neural networks; adaptive video streaming; VIDEO QUALITY ASSESSMENT; IMAGE;
D O I
10.1109/PCS50896.2021.9477407
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In many image and video processing applications, the ability to resize by a fractional factor, such as from 1080p to 720p, is essential. However, conventional CNN layers can only be used to alter the resolution of their inputs with integer scale factors. In this paper, we propose a downsampling network architecture that progressively reconstructs residuals at different scales. In particular, the aforementioned problem is solved by combining an upsampling sub-network and a downsampling sub-network, both with integer scale factor. As an application, we apply the proposed downsampling network to an adaptive bitrate video streaming scenario. We extensively evaluate with different video codecs and upsampling algorithms to show the generality of our model. Our experimental results show that improvements in coding efficiency over the conventional Lanczos downsampling and state-of-the-art methods are attained, measured in different perceptual video quality models on large-resolution test videos.
引用
收藏
页码:46 / 50
页数:5
相关论文
共 44 条
[1]   Video Compression Based on Spatio-Temporal Resolution Adaptation [J].
Afonso, Mariana ;
Zhang, Fan ;
Bull, David R. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (01) :275-280
[2]   NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].
Agustsson, Eirikur ;
Timofte, Radu .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131
[3]  
[Anonymous], 2016, ACM T GRAPHIC
[4]  
Balle J., 2017, P INT C LEARN REPR, P1
[5]   SpEED-QA: Spatial Efficient Entropic Differencing for Image and Video Quality [J].
Bampis, Christos G. ;
Gupta, Praful ;
Soundararajan, Rajiv ;
Bovik, Alan C. .
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (09) :1333-1337
[6]  
Bhat M., 2020, 2020 IEEE INT C MULT, P1
[7]  
Bjontegaard G., 2001, VCEGM33 ITUT VID COD
[8]   Super-resolution through neighbor embedding [J].
Chang, H ;
Yeung, DY ;
Xiong, Y .
PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, 2004, :275-282
[9]  
Chen C, 2018, IEEE IMAGE PROC, P3269, DOI 10.1109/ICIP.2018.8451307
[10]   Image Super-Resolution Using Deep Convolutional Networks [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307