Sparse/DCT (S/DCT) Two-Layered Representation of Prediction Residuals for Video Coding

被引:24
作者
Kang, Je-Won [1 ,2 ]
Gabbouj, Moncef [3 ]
Kuo, C. -C. Jay [4 ,5 ]
机构
[1] Qualcomm Technol Inc, Multimedia R&D, San Diego, CA 92121 USA
[2] Qualcomm Technol Inc, Standard Team, San Diego, CA 92121 USA
[3] Tampere Univ Technol, Dept Signal Proc, Tampere 33720, Finland
[4] Univ So Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90089 USA
[5] Univ So Calif, Signal & Image Proc Inst, Los Angeles, CA 90089 USA
关键词
rho domain rate model; discrete cosine transform (DCT); high efficiency video coding (HEVC); multilayered coding; overcomplete dictionary based video coding; residual coding; sparse representation; IMAGE; ALGORITHM;
D O I
10.1109/TIP.2013.2256917
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a cascaded sparse/DCT (S/DCT) two-layer representation of prediction residuals, and implement this idea on top of the state-of-the-art high efficiency video coding (HEVC) standard. First, a dictionary is adaptively trained to contain featured patterns of residual signals so that a high portion of energy in a structured residual can be efficiently coded via sparse coding. It is observed that the sparse representation alone is less effective in the R-D performance due to the side information overhead at higher bit rates. To overcome this problem, the DCT representation is cascaded at the second stage. It is applied to the remaining signal to improve coding efficiency. The two representations successfully complement each other. It is demonstrated by experimental results that the proposed algorithm outperforms the HEVC reference codec HM5.0 in the Common Test Condition.
引用
收藏
页码:2711 / 2722
页数:12
相关论文
共 39 条
  • [21] Automatic Single-Image-Based Rain Streaks Removal via Image Decomposition
    Kang, Li-Wei
    Lin, Chia-Wen
    Fu, Yu-Hsiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (04) : 1742 - 1755
  • [22] Improved H.264/AVC Lossless Intra Coding with Two-Layered Residual Coding (TRC)
    Kim, Seung-Hwan
    Kang, Je-Won
    Kuo, C. -C. Jay
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (07) : 1005 - 1010
  • [23] Zero-Quantized Inter DCT Coefficient Prediction for Real-Time Video Coding
    Li, Jin
    Gabbouj, Moncef
    Takala, Jarmo
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (02) : 249 - 259
  • [24] Motion-Aware Decoding of Compressed-Sensed Video
    Liu, Ying
    Li, Ming
    Pados, Dimitris A.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (03) : 438 - 444
  • [25] Mairal J, 2010, J MACH LEARN RES, V11, P19
  • [26] MATCHING PURSUITS WITH TIME-FREQUENCY DICTIONARIES
    MALLAT, SG
    ZHANG, ZF
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (12) : 3397 - 3415
  • [27] Mode-Dependent DCT/DST Without 4 X 4 Full Matrix Multiplication for Intra Prediction, 2011, CE7 JCTVCE125
  • [28] NEFF R, 1994, P SOC PHOTO-OPT INS, V2308, P47, DOI 10.1117/12.185994
  • [29] Covariance analysis of motion-compensated frame differences
    Niehsen, T
    Brünig, M
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (04) : 536 - 539
  • [30] Rubinstein R., 2008, CS200808 HAIF STAT U