Video Compression Artifact Reduction via Spatio-Temporal Multi-Hypothesis Prediction

被引:31
作者
Zhang, Xinfeng [1 ]
Xiong, Ruiqin [2 ]
Lin, Weisi [1 ]
Ma, Siwei [2 ]
Liu, Jiaying [3 ]
Gao, Wen [2 ]
机构
[1] Nanyang Technol Univ, Rapid Rich Object Search Lab, Singapore 639798, Singapore
[2] Peking Univ, Inst Digital Media, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China
[3] Peking Univ, Inst Comp Sci & Technol, Beijing 100871, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金; 北京市自然科学基金;
关键词
Compression artifacts; block transform coding; auto-regressive; non-local estimation; multiple hypotheses; DEBLOCKING; ALGORITHM; SPARSE; DCT;
D O I
10.1109/TIP.2015.2485780
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Annoying compression artifacts exist in most of lossy coded videos at low bit rates, which are caused by coarse quantization of transform coefficients or motion compensation from distorted frames. In this paper, we propose a compression artifact reduction approach that utilizes both the spatial and the temporal correlation to form multi-hypothesis predictions from spatio-temporal similar blocks. For each transform block, three predictions with their reliabilities are estimated, respectively. The first prediction is constructed by inversely quantizing transform coefficients directly, and its reliability is determined by the variance of quantization noise. The second prediction is derived by representing each transform block with a temporal auto-regressive (TAR) model along its motion trajectory, and its corresponding reliability is estimated from local prediction errors of the TAR model. The last prediction infers the original coefficients from similar blocks in non-local regions, and its reliability is estimated based on the distribution of coefficients in these similar blocks. Finally, all the predictions are adaptively fused according to their reliabilities to restore high-quality videos. The experimental results show that the proposed method can efficiently reduce most of the compression artifacts and improve both subjective and objective quality of block transform coded videos.
引用
收藏
页码:6048 / 6061
页数:14
相关论文
共 32 条
[11]   NONLINEAR SPACE-VARIANT POSTPROCESSING OF BLOCK CODED IMAGES [J].
RAMAMURTHI, B ;
GERSHO, A .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (05) :1258-1268
[12]  
Reeve H. C. III, 1983, Proceedings of ICASSP 83. IEEE International Conference on Acoustics, Speech and Signal Processing, P1212
[13]   Overview of the High Efficiency Video Coding (HEVC) Standard [J].
Sullivan, Gary J. ;
Ohm, Jens-Rainer ;
Han, Woo-Jin ;
Wiegand, Thomas .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (12) :1649-1668
[14]   Rate-distortion optimization for video compression [J].
Sullivan, GJ ;
Wiegand, T .
IEEE SIGNAL PROCESSING MAGAZINE, 1998, 15 (06) :74-90
[15]   Postprocessing of low bit-rate block DCT coded images based on a fields of experts prior [J].
Sun, Deqing ;
Cham, Wai-Kuen .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (11) :2743-2751
[16]   Kernel regression for image processing and reconstruction [J].
Takeda, Hiroyuki ;
Farsiu, Sina ;
Milanfar, Peyman .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (02) :349-366
[17]   Adaptive Loop Filtering for Video Coding [J].
Tsai, Chia-Yang ;
Chen, Ching-Yeh ;
Yamakage, Tomoo ;
Chong, In Suk ;
Huang, Yu-Wen ;
Fu, Chih-Ming ;
Itoh, Takayuki ;
Watanabe, Takashi ;
Chujoh, Takeshi ;
Karczewicz, Marta ;
Lei, Shaw-Min .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2013, 7 (06) :934-945
[18]   Image quality assessment: From error visibility to structural similarity [J].
Wang, Z ;
Bovik, AC ;
Sheikh, HR ;
Simoncelli, EP .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (04) :600-612
[19]   Overview of the H.264/AVC video coding standard [J].
Wiegand, T ;
Sullivan, GJ ;
Bjontegaard, G ;
Luthra, A .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (07) :560-576
[20]   An efficient wavelet-based deblocking algorithm for highly compressed images [J].
Wu, SH ;
Yan, H ;
Tan, Z .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (11) :1193-1198