End-to-end video compression for surveillance and conference videos

被引:0
作者
Shenhao Wang
Yu Zhao
Han Gao
Mao Ye
Shuai Li
机构
[1] University of Electronic Science and Technology of China,School of Physics
[2] University of Electronic Science and Technology of China,School of Computer Science and Engineering
[3] Shandong University,School of Information Communication
来源
Multimedia Tools and Applications | 2022年 / 81卷
关键词
Deep learning; End-to-end video compression; Surveillance and conference videos; Online update;
D O I
暂无
中图分类号
学科分类号
摘要
The storage and transmission tasks of surveillance and conference videos are an important branch of video compression. Since surveillance and conference videos have strong inter-frame correlation, considerable continuity at the image level and motion level between the consecutive frames exists. However, traditional video codec networks cannot fully use the characteristics of surveillance and conference videos during compression. Therefore, based on the DVC video codec framework, we propose a “MV residual + MV optimization” coding strategy for surveillance and conference videos to further reduce the compression rate and improve the quality of compressed video frames. During the testing stage, the online update strategy is promoted, which adapts the network’s parameters to different surveillance and conference videos. Our contribution is to propose an optical flow residual coding method for videos with strong inter-frame correlation, implement optical flow optimization at decoding end and online update strategy at the encoding end. Experiments show that our method can outperform DVC framework, especially on CUHK Square surveillance video with 1.2dB improvement.
引用
收藏
页码:42713 / 42730
页数:17
相关论文
共 46 条
[1]  
Krizhevsky A(2012)Imagenet classification with deep convolutional neural networks Adv Neur Inform Process Syst 25 1097-1105
[2]  
Sutskever I(1998)Gradient-based learning applied to document recognition Proc IEEE 86 2278-2324
[3]  
Hinton GE(2018)Fully connected network-based intra prediction for image coding IEEE Trans Image Process 27 3236-3247
[4]  
LeCun Y(2003)Context-based adaptive binary arithmetic coding in the h. 264/avc video compression standard IEEE Trans Circ Syst Video Technol 13 620-636
[5]  
Bottou L(2020)Motion segmentation-based surveillance video compression using adaptive particle swarm optimization Neural Comput Applic 32 11443-11457
[6]  
Bengio Y(2001)The jpeg 2000 still image compression standard IEEE Signal Process Mag 18 36-58
[7]  
Haffner P(2020)SP-GAN: self-growing and pruning generative adversarial networks IEEE Trans Neural Netw Learn Syst 32 2458-2469
[8]  
Li J(2012)Overview of the high efficiency video coding (HEVC) standard IEEE Trans Circ Syst Video Technol 22 1649-1668
[9]  
Li B(1992)The jpeg still picture compression standard IEEE Trans Consum Electron 38 xviii-xxxiv
[10]  
Xu J(2021)Foreground-background parallel compression with residual encoding for surveillance video IEEE Trans Circuits Syst Video Technol 31 2711-2724