Multi-Scale Motion Alignment and Frame Reconstruction for Efficient Deep Video Compression

被引:0
|
作者
Yang, Gongning [1 ]
Wei, Xiaojie [1 ]
Lin, Hongbin [1 ]
机构
[1] Fuzhou Univ, Fujian Key Lab Intelligent Proc & Wireless Transmi, Fuzhou 350116, Peoples R China
关键词
Convolution; Decoding; Motion compensation; Video compression; Feature extraction; Encoding; Video codecs; Deep video compression; end-to-end video codec; flexible rate adjustment; video coding;
D O I
10.1109/LSP.2024.3443516
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As video data continues to grow, the burden on network transmission increases significantly. Efficient video compression techniques are crucial to meet the rising demand for multimedia content. In this letter, we propose a Multi-scale Motion Alignment and Frame Reconstruction-based Video Codec (MFVC) for efficient video compression. MFVC focuses on optimizing the motion compensation and video reconstruction processes within a deep video compression framework. First, we design a Multi-Scale Motion Alignment Network (MSMA-Net) to achieve precise motion compensation, which extracts multi-scale features from video frames and utilizes flow information for deformable convolution. Second, we design a Frame Reconstruction Network (FR-Net) to recover high-quality video frames, which utilizes reference information for feature enhancement without additional bitrate consumption. Moreover, to achieve smooth rate adjustment, we introduce a feature scaling technique. Experimental results show that MFVC reduces bitrate by 7.86%/48.34% compared to VVC (VTM 13.2) at the same PSNR/MS-SSIM.
引用
收藏
页码:2125 / 2129
页数:5
相关论文
共 50 条
  • [1] Multi-Scale Warping for Video Frame Interpolation
    Choi, Whan
    Koh, Yeong Jun
    Kim, Chang-Su
    IEEE ACCESS, 2021, 9 : 150470 - 150479
  • [2] An Efficient Video Coding System With an Adaptive Overfitted Multi-Scale Attention Network
    He, Gang
    Wu, Chang
    Xu, Li
    Li, Lei
    Xu, Ziyao
    Xie, Weiying
    Li, Yunsong
    IEEE ACCESS, 2021, 9 : 64022 - 64032
  • [3] Multi-Scale Inter-Communication Spatio-Temporal Network for Video Compression Artifacts Reduction
    Zhang, Tingrong
    Teng, Qizhi
    He, Xiaohai
    Ren, Chao
    Chen, Zhengxin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (03) : 1229 - 1233
  • [4] Multi-Scale Attention Generative Adversarial Networks for Video Frame Interpolation
    Xiao, Jian
    Bi, Xiaojun
    IEEE ACCESS, 2020, 8 : 94842 - 94851
  • [5] Learning Motion-Guided Multi-Scale Memory Features for Video Shadow Detection
    Lin, Junhao
    Shen, Jiaxing
    Yang, Xin
    Fu, Huazhu
    Zhang, Qing
    Li, Ping
    Sheng, Bin
    Wang, Liansheng
    Zhu, Lei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12288 - 12300
  • [6] Fast reliable multi-scale motion region detection in video processing
    Lu, Jiangbo
    Lafruit, Gauthier
    Catthoor, Francky
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 689 - +
  • [7] End-to-End Learnable Multi-Scale Feature Compression for VCM
    Kim, Yeongwoong
    Jeong, Hyewon
    Yu, Janghyun
    Kim, Younhee
    Lee, Jooyoung
    Jeong, Se Yoon
    Kim, Hui Yong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3156 - 3167
  • [8] Hierarchical Motion-Compensated Deep Network for Video Compression
    Liu, Ying
    Du, Pengli
    Li, Yuzhu
    BIG DATA III: LEARNING, ANALYTICS, AND APPLICATIONS, 2021, 11730
  • [9] Multi-Model Motion Prediction for 360-Degree Video Compression
    Regensky, Andy
    Herglotz, Christian
    Kaup, Andre
    IEEE ACCESS, 2023, 11 : 117004 - 117017
  • [10] Enhanced Motion Compensation for Deep Video Compression
    Guo, Haifeng
    Kwong, Sam
    Jia, Chuanmin
    Wang, Shiqi
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 673 - 677