Conditional Entropy Coding for Efficient Video Compression

被引:30
作者
Liu, Jerry [1 ]
Wang, Shenlong [1 ,2 ]
Ma, Wei-Chiu [1 ,3 ]
Shah, Meet [1 ]
Hu, Rui [1 ]
Dhawan, Pranaab [1 ]
Urtasun, Raquel [1 ,2 ]
机构
[1] Uber ATG, Pittsburgh, PA 15201 USA
[2] Univ Toronto, Toronto, ON, Canada
[3] MIT, Cambridge, MA 02139 USA
来源
COMPUTER VISION - ECCV 2020, PT XVII | 2020年 / 12362卷
关键词
D O I
10.1007/978-3-030-58520-4_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a very simple and efficient video compression framework that only focuses on modeling the conditional entropy between frames. Unlike prior learning-based approaches, we reduce complexity by not performing any form of explicit transformations between frames and assume each frame is encoded with an independent state-of-the-art deep image compressor. We first show that a simple architecture modeling the entropy between the image latent codes is as competitive as other neural video compression works and video codecs while being much faster and easier to implement. We then propose a novel internal learning extension on top of this architecture that brings an additional similar to 10% bitrate savings without trading off decoding speed. Importantly, we show that our approach outperforms H.265 and other deep learning baselines in MS-SSIM on higher bitrate UVG video, and against all video codecs on lower framerates, while being thousands of times faster in decoding than deep models utilizing an autoregressive entropy model.
引用
收藏
页码:453 / 468
页数:16
相关论文
共 39 条
[1]  
Balle J., 2017, INT C LEARNING REPRE, P1
[2]  
Balle J., 2015, arXiv
[3]  
Balle Johannes, 2018, 6 INT C LEARN REPR
[4]  
Campos J., 2019, Content adaptive optimization for neural image compression
[5]  
Carreira J., 2017, A new model and the kinetics dataset
[6]  
cdvl, Consumer digital video library
[7]   Neural Inter-Frame Compression for Video Coding [J].
Djelouah, Abdelaziz ;
Campos, Joaquim ;
Schaub-Meyer, Simone ;
Schroers, Christopher .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6430-6438
[8]  
Glasner D, 2009, IEEE I CONF COMP VIS, P349, DOI 10.1109/ICCV.2009.5459271
[9]   Video Compression With Rate-Distortion Autoencoders [J].
Habibian, Amirhossein ;
van Rozendaal, Ties ;
Tomczak, Jakub M. ;
Cohen, Taco S. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7032-7041
[10]  
Han J., 2019, Deep generative video compression