Deep Feature Domain Motion Estimation and Multi-Layer Multi-Hypothesis Motion Compensation Net for Video Compression Codec

被引:0
作者
Yang C. [1 ]
Lü Z. [1 ]
机构
[1] School of Electronic and Information Engineering, South China University of Technology, Guangzhou
来源
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science) | 2022年 / 50卷 / 10期
关键词
Codec network; Deep learning; Motion estimation; Multi-hypothesis prediction; Video compression;
D O I
10.12141/j.issn.1000-565X.220221
中图分类号
学科分类号
摘要
Traditional video compression coding methods are widely used. In order to further improve the compression performance, research on deep learning-based video compression coding methods has received increasing attention. Existing deep learning video compression coding methods realize motion compensation based on optical flow, which will produce artifacts during the optical flow alignment process, reducing the accuracy of prediction. This paper proposed a motion estimation idea in the deep feature domain, and designed a corresponding neural network to extract motion information in the deep feature domain. On this basis, it proposed a multi-layer multi-hypothesis prediction motion compensation network. By using the multi-hypothesis prediction module in the deep feature domain, the shallow feature domain and the pixel domain, the accuracy of motion compensation was improved, thereby improving the overall rate-distortion performance. Simulation results show that the inter-frame prediction results of the algorithm in the paper mitigate artifacts and the visual effect is significantly better than optical flow alignment. At the same time, the proposed algorithm achieves better rate-distortion performance compared with traditional H.264 and H.265 methods and single-frame reference methods DVC and DVCpro based on deep learning. Compared with the DCVC method at the forefront of research, the algorithm reduces the coding time by approximately 26.8% while the rate distortion performance is similar. Taking the H.264 encoding result as the benchmark, under the condition of the same bit rate, the decoding quality was improved by 3.73 dB, 4.76 dB and 2.65 dB on HEVC test sequences ClassB, ClassD and ClassE. The simulation experiment results show that, when compressing and coding video sequences, the algorithm proposed in the paper can improve the accuracy of motion compensation prediction frames, reduce the prediction error, shortens the residual signal compression coding code stream and improve the overall rate distortion performance. © 2022, Editorial Department, Journal of South China University of Technology. All right reserved.
引用
收藏
页码:51 / 61
页数:10
相关论文
共 24 条
[1]  
WIEGAND T, SULLIVAN G J, BJONTEGAARD G, Et al., Overview of the H.264/AVC video coding standard [J], IEEE Transactions on Circuits and Systems for Video Technology, 13, 7, pp. 560-576, (2003)
[2]  
SULLIVAN G J, OHM J R, HAN W J, Et al., Overview of the high efficiency video coding (HEVC) standard [J], IEEE Transactions on Circuits and Systems for Video Technology, 22, 12, pp. 1649-1668, (2012)
[3]  
LU G, OUYANG W, XU D, Et al., DVC:An end-to-end deep video compression framework [C], Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11006-11015, (2019)
[4]  
LU G, ZHANG X, OUYANG W, Et al., An end-to-end learning framework for video compression [J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 10, pp. 3292-3308, (2021)
[5]  
YANG X, YANG C., ImrNet:An iterative motion compensation and residual reconstruction network for video compressed sensing [C], Proceedings of ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 2350-2354, (2021)
[6]  
WEI Z, YANG C, XUAN Y., Efficient video compressed sensing reconstruction via exploiting spatial-temporal correlation with measurement constraint [C], Proceedings of 2021 IEEE International Conference on Multimedia and Expo, pp. 1-6, (2021)
[7]  
XUAN Yunyi, YANG Chunling, Two-stage recursive enhancement reconstruction based on video inter-frame group sparse representation in compressed video sensing, Acta Electronica Sinica, 49, 3, pp. 435-442, (2021)
[8]  
HU Z, CHEN Z, XU D, Et al., Improving deep video compression by resolution-adaptive flow coding [C], Proceedings of European Conference on Computer Vision, pp. 193-209, (2020)
[9]  
LU G, CAI C, ZHANG X, Et al., Content adaptive and error propagation aware deep video compression [C], Proceedings of European Conference on Computer Vision, pp. 456-472, (2020)
[10]  
LIN J, LIU D, LI H, Et al., M-LVC:Multiple frames prediction for learned video compression [C], Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3546-3554, (2020)