A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding

被引:269
作者
Dai, Yuanying [1 ]
Liu, Dong [1 ]
Wu, Feng [1 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei, Anhui, Peoples R China
来源
MULTIMEDIA MODELING (MMM 2017), PT I | 2017年 / 10132卷
关键词
Artifact reduction; Convolutional neural network (CNN); High Efficiency Video Coding (EVC); Intra coding; Post-processing;
D O I
10.1007/978-3-319-51811-4_3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Lossy image and video compression algorithms yield visually annoying artifacts including blocking, blurring, and ringing, especially at low bit-rates. To reduce these artifacts, post-processing techniques have been extensively studied. Recently, inspired by the great success of convolutional neural network (CNN) in computer vision, some researches were performed on adopting CNN in post-processing, mostly for JPEG compressed images. In this paper, we present a CNN-based post-processing algorithm for High Efficiency Video Coding (HEVC), the state-of-the-art video coding standard. We redesign a Variable-filter-size Residuelearning CNN (VRCNN) to improve the performance and to accelerate network training. Experimental results show that using our VRCNN as post-processing leads to on average 4.6% bit-rate reduction compared to HEVC baseline. The VRCNN outperforms previously studied networks in achieving higher bit-rate reduction, lower memory cost, and multiplied computational speedup.
引用
收藏
页码:28 / 39
页数:12
相关论文
共 18 条
[1]  
Bjontegaard G, 2001, document VCEG-M33
[2]   Compression Artifacts Reduction by a Deep Convolutional Network [J].
Dong, Chao ;
Deng, Yubin ;
Loy, Chen Change ;
Tang, Xiaoou .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :576-584
[3]   Learning a Deep Convolutional Network for Image Super-Resolution [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 :184-199
[4]   Sample Adaptive Offset in the HEVC Standard [J].
Fu, Chih-Ming ;
Alshina, Elena ;
Alshin, Alexander ;
Huang, Yu-Wen ;
Chen, Ching-Yeh ;
Tsai, Chia-Yang ;
Hsu, Chih-Wei ;
Lei, Shaw-Min ;
Park, Jeong-Hoon ;
Han, Woo-Jin .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (12) :1755-1764
[5]  
Girshick R., 2014, IEEE C COMP VIS PATT, DOI [DOI 10.1109/CVPR.2014.81, 10.1109/CVPR.2014.81]
[6]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[7]  
JIA Y, 2014, P 22 ACM INT C MULT, DOI [DOI 10.1145/2647868.2654889, 10.1145/2647868.2654889]
[8]  
Kim J, 2016, PROC CVPR IEEE, P1637, DOI [10.1109/CVPR.2016.181, 10.1109/CVPR.2016.182]
[9]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[10]  
Nair V., 2010, PROC INT C MACH LEAR, P807, DOI DOI 10.5555/3104322.3104425