Multi-Scale Motion Alignment and Frame Reconstruction for Efficient Deep Video Compression

被引：0

作者：

Yang, Gongning ^{[1
]}

Wei, Xiaojie ^{[1
]}

Lin, Hongbin ^{[1
]}

机构：

[1] Fuzhou Univ, Fujian Key Lab Intelligent Proc & Wireless Transmi, Fuzhou 350116, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2024年 / 31卷

关键词：

Convolution; Decoding; Motion compensation; Video compression; Feature extraction; Encoding; Video codecs; Deep video compression; end-to-end video codec; flexible rate adjustment; video coding;

D O I：

10.1109/LSP.2024.3443516

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

As video data continues to grow, the burden on network transmission increases significantly. Efficient video compression techniques are crucial to meet the rising demand for multimedia content. In this letter, we propose a Multi-scale Motion Alignment and Frame Reconstruction-based Video Codec (MFVC) for efficient video compression. MFVC focuses on optimizing the motion compensation and video reconstruction processes within a deep video compression framework. First, we design a Multi-Scale Motion Alignment Network (MSMA-Net) to achieve precise motion compensation, which extracts multi-scale features from video frames and utilizes flow information for deformable convolution. Second, we design a Frame Reconstruction Network (FR-Net) to recover high-quality video frames, which utilizes reference information for feature enhancement without additional bitrate consumption. Moreover, to achieve smooth rate adjustment, we introduce a feature scaling technique. Experimental results show that MFVC reduces bitrate by 7.86%/48.34% compared to VVC (VTM 13.2) at the same PSNR/MS-SSIM.

引用

页码：2125 / 2129

页数：5

共 50 条

[1] Multi-Scale Warping for Video Frame Interpolation
Choi, Whan
Koh, Yeong Jun
Kim, Chang-Su
IEEE ACCESS, 2021, 9 : 150470 - 150479
[2] An Efficient Video Coding System With an Adaptive Overfitted Multi-Scale Attention Network
He, Gang
Wu, Chang
Xu, Li
Li, Lei
Xu, Ziyao
Xie, Weiying
Li, Yunsong
IEEE ACCESS, 2021, 9 : 64022 - 64032
[3] Multi-Scale Inter-Communication Spatio-Temporal Network for Video Compression Artifacts Reduction
Zhang, Tingrong
Teng, Qizhi
He, Xiaohai
Ren, Chao
Chen, Zhengxin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (03) : 1229 - 1233
[4] Multi-Scale Attention Generative Adversarial Networks for Video Frame Interpolation
Xiao, Jian
Bi, Xiaojun
IEEE ACCESS, 2020, 8 : 94842 - 94851
[5] Learning Motion-Guided Multi-Scale Memory Features for Video Shadow Detection
Lin, Junhao
Shen, Jiaxing
Yang, Xin
Fu, Huazhu
Zhang, Qing
Li, Ping
Sheng, Bin
Wang, Liansheng
Zhu, Lei
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12288 - 12300
[6] Fast reliable multi-scale motion region detection in video processing
Lu, Jiangbo
Lafruit, Gauthier
Catthoor, Francky
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 689 - +
[7] End-to-End Learnable Multi-Scale Feature Compression for VCM
Kim, Yeongwoong
Jeong, Hyewon
Yu, Janghyun
Kim, Younhee
Lee, Jooyoung
Jeong, Se Yoon
Kim, Hui Yong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3156 - 3167
[8] Hierarchical Motion-Compensated Deep Network for Video Compression
Liu, Ying
Du, Pengli
Li, Yuzhu
BIG DATA III: LEARNING, ANALYTICS, AND APPLICATIONS, 2021, 11730
[9] Multi-Model Motion Prediction for 360-Degree Video Compression
Regensky, Andy
Herglotz, Christian
Kaup, Andre
IEEE ACCESS, 2023, 11 : 117004 - 117017
[10] Enhanced Motion Compensation for Deep Video Compression
Guo, Haifeng
Kwong, Sam
Jia, Chuanmin
Wang, Shiqi
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 673 - 677

← 1 2 3 4 5 →