Wyner-Ziv Video Coding using Hadamard Transform and Deep Learning

被引:0
作者
Kouma, Jean-Paul [1 ]
Soderstrom, Ulrik [1 ]
机构
[1] Umea Univ, Dept Appl Phys & Elect, S-90187 Umea, Sweden
关键词
Wyner-Ziv; video coding; rate distortion; Hadamard transform; Deep learning; Expectation Maximization;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Predictive schemes are current standards of video coding. Unfortunately they do not apply well for lightweight devices such as mobile phones. The high encoding complexity is the bottleneck of the Quality of Experience (QoE) of a video conversation between mobile phones. A considerable amount of research has been conducted towards tackling that bottleneck. Most of the schemes use the so-called Wyner-Ziv Video Coding Paradigm, with results still not comparable to those of predictive coding. This paper shows a novel approach for Wyner-Ziv video compression. It is based on the Reinforcement Learning and Hadamard Transform. Our Scheme shows very promising results.
引用
收藏
页码:582 / 589
页数:8
相关论文
共 14 条
  • [1] Transform-domain Wyner-Ziv codec for video
    Aaron, A
    Rane, S
    Setton, E
    Girod, B
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2004, PTS 1 AND 2, 2004, 5308 : 520 - 528
  • [2] Aaron A, 2002, CONF REC ASILOMAR C, P240
  • [3] AARON A, 2004, P PICT COD S
  • [4] Cornish Christopher John, 1989, LEARNING DELAYED REW
  • [5] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [6] Fast H.264 Intra-prediction mode selection using joint spatial and transform domain features
    Kim, Changsung
    Shih, Hsuan-Huei
    Kuo, C. -C. Jay
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2006, 17 (02) : 291 - 310
  • [7] Kouma JP, 2011, NEW APPROACHES TO CHARACTERIZATION AND RECOGNITION OF FACES, P29
  • [8] Motion feature and Hadamard coefficient-based fast multiple reference frame motion estimation for H.264
    Liu, Zhenyu
    Li, Lingfeng
    Song, Yang
    Li, Shen
    Goto, Satoshi
    Ikenaga, Takeshi
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2008, 18 (05) : 620 - 632
  • [9] Mnih V., 2013, PLAYING ATARI DEEP R
  • [10] PRISM: A video coding paradigm with motion estimation at the decoder
    Puri, Rohit
    Majumdar, Abhik
    Ramchandran, Karman
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (10) : 2436 - 2448