HDVC: Deep Video Compression With Hyperprior-Based Entropy Coding

被引:4
作者
Hu, Yusong [1 ]
Jung, Cheolkon [1 ]
Qin, Qipu [1 ]
Han, Jiang [1 ]
Liu, Yang [2 ]
Li, Ming [2 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] Guangdong OPPO Mobile Telecommun Corp Ltd, Dongguan 523860, Peoples R China
基金
中国国家自然科学基金;
关键词
Image coding; Video compression; Optical imaging; Adaptive optics; Estimation; Entropy coding; Image reconstruction; Deep learning; Hyperprior; entropy coding; learned video compression; deep learning; end-to-end;
D O I
10.1109/ACCESS.2024.3350643
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose deep video compression with hyperprior-based entropy coding, named HDVC. The proposed method is based on the deep video compression (DVC) framework that replaces traditional block-based video compression with end-to-end video compression based on deep learning, aiming to improve compression efficiency and reduce computational complexity while maintaining visual quality. Based on the DVC framework, we introduce hyperprior-based entropy coding into motion compression and optimize motion vector estimation (i.e. optical flow estimation) using window attention and fast residual channel attention. Moreover, we introduce residual channel attention intermediate module into both encoding and decoding to enhance residuals and the quality of reconstructed frames. We adopt hyperprior-based entropy coding in residual compression to model feature distribution. Besides, we use learned image compression for intraframe coding based on fast residual channel attention network to generate reference frames. Experimental results show that the proposed method achieves better PSNR and MS-SSIM performance than both traditional block-based and recent deep learning-based video compression methods on UVG dataset.
引用
收藏
页码:17541 / 17551
页数:11
相关论文
共 32 条
[1]  
Ball‚ J, 2018, Arxiv, DOI arXiv:1802.01436
[2]  
Begaint J., 2020, arXiv
[3]   Overview of the Versatile Video Coding (VVC) Standard and its Applications [J].
Bross, Benjamin ;
Wang, Ye-Kui ;
Ye, Yan ;
Liu, Shan ;
Chen, Jianle ;
Sullivan, Gary J. ;
Ohm, Jens-Rainer .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) :3736-3764
[4]   Learning for Video Compression [J].
Chen, Zhibo ;
He, Tianyu ;
Jin, Xin ;
Wu, Feng .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (02) :566-576
[5]  
Cheng ZX, 2020, PROC CVPR IEEE, P7936, DOI 10.1109/CVPR42600.2020.00796
[6]   Neural Inter-Frame Compression for Video Coding [J].
Djelouah, Abdelaziz ;
Campos, Joaquim ;
Schaub-Meyer, Simone ;
Schroers, Christopher .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6430-6438
[7]  
Han S, 2016, Arxiv, DOI arXiv:1510.00149
[8]  
Hendrycks D, 2020, Arxiv, DOI arXiv:1606.08415
[9]   FVC: A New Framework towards Deep Video Compression in Feature Space [J].
Hu, Zhihao ;
Lu, Guo ;
Xu, Dong .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :1502-1511
[10]  
Hu ZH, 2020, Img Proc Comp Vis Re, V12347, P193, DOI 10.1007/978-3-030-58536-5_12