Deep Reference Generation With Multi-Domain Hierarchical Constraints for Inter Prediction

被引:16
作者
Liu, Jiaying [1 ]
Xia, Sifeng [1 ]
Yang, Wenhan [1 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing 100871, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
High efficient video coding (HEVC); inter prediction; frame interpolation; deep learning; multi-domain hierarchical constraints; factorized kernel convolution; NETWORK; CNN;
D O I
10.1109/TMM.2019.2961504
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Inter prediction is an important module in video coding for temporal redundancy removal, where similar reference blocks are searched from previously coded frames and employed to predict the block to be coded. Although existing video codecs can estimate and compensate for block-level motions, their inter prediction performance is still heavily affected by the remaining inconsistent pixel-wise displacement caused by irregular rotation and deformation. In this paper, we address the problem by proposing a deep frame interpolation network to generate additional reference frames in coding scenarios. First, we summarize the previous adaptive convolutions used for frame interpolation and propose a factorized kernel convolutional network to improve the modeling capacity and simultaneously keep its compact form. Second, to better train this network, multi-domain hierarchical constraints are introduced to regularize the training of our factorized kernel convolutional network. For spatial domain, we use a gradually down-sampled and up-sampled auto-encoder to generate the factorized kernels for frame interpolation at different scales. For quality domain, considering the inconsistent quality of the input frames, the factorized kernel convolution is modulated with quality-related features to learn to exploit more information from high quality frames. For frequency domain, a sum of absolute transformed difference loss that performs frequency transformation is utilized to facilitate network optimization from the view of coding performance. With the well-designed frame interpolation network regularized by multi-domain hierarchical constraints, our method surpasses HEVC on average 3.8% BD-rate saving for the luma component under the random access configuration and also obtains on average 0.83% BD-rate saving over the upcoming VVC.
引用
收藏
页码:2497 / 2510
页数:14
相关论文
共 37 条
  • [21] Onsite Early Prediction of PGA Using CNN With Multi-Scale and Multi-Domain P-Waves as Input
    Hsu, Ting-Yu
    Huang, Chao-Wen
    FRONTIERS IN EARTH SCIENCE, 2021, 9
  • [22] mIDEEPre: Multi-Functional Enzyme Function Prediction With Hierarchical Multi-Label Deep Learning
    Zou, Zhenzhen
    Tian, Shuye
    Gao, Xin
    Li, Yu
    FRONTIERS IN GENETICS, 2019, 9
  • [23] A fusion framework based on multi-domain features and deep learning features of phonocardiogram for coronary artery disease detection
    Li, Han
    Wang, Xinpei
    Liu, Changchun
    Zeng, Qiang
    Zheng, Yansong
    Chu, Xi
    Yao, Lianke
    Wang, Jikuo
    Jiao, Yu
    Karmakar, Chandan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 120
  • [24] Multi-Domain Joint Synthetic Aperture Radar Ship Detection Method Integrating Complex Information with Deep Learning
    Tian, Chaoyang
    Lv, Zongsen
    Xue, Fengli
    Wu, Xiayi
    Liu, Dacheng
    REMOTE SENSING, 2024, 16 (19)
  • [25] Domain generalization for rotating machinery real-time remaining useful life prediction via multi-domain orthogonal degradation feature exploration
    Shang, Jie
    Xu, Danyang
    Qiu, Haobo
    Jiang, Chen
    Gao, Liang
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2025, 223
  • [26] Deep Learning-Based Multi-Domain Framework for End-to-End Services in 5G Networks
    Yanjia Tian
    Yan Dong
    Xiang Feng
    Journal of Grid Computing, 2023, 21
  • [27] Deep Learning-Based Multi-Domain Framework for End-to-End Services in 5G Networks
    Tian, Yanjia
    Dong, Yan
    Feng, Xiang
    JOURNAL OF GRID COMPUTING, 2023, 21 (04)
  • [28] Individualized prediction of multi-domain intelligence quotient in bipolar disorder patients using resting-state functional connectivity
    Li, Xiaoyu
    Wei, Wei
    Qian, Linze
    Li, Xiaojing
    Li, Mingli
    Kakkos, Ioannis
    Wang, Qiang
    Yu, Hua
    Guo, Wanjun
    Ma, Xiaohong
    Matsopoulos, George K.
    Zhao, Liansheng
    Deng, Wei
    Sun, Yu
    Li, Tao
    BRAIN RESEARCH BULLETIN, 2025, 222
  • [29] DeepMDR: A Deep-Learning-Assisted Control Plane System for Scalable, Protocol-Independent, and Multi-Domain Network Automation
    Li, Deyun
    Fang, Hongqiang
    Zhang, Xu
    Qi, Jin
    Zhu, Zuqing
    IEEE COMMUNICATIONS MAGAZINE, 2021, 59 (03) : 62 - 68
  • [30] Evaluating Deep Learning Techniques for Blind Image Super-Resolution within a High-Scale Multi-Domain Perspective
    de Santiago Junior, Valdivino Alexandre
    AI, 2023, 4 (03) : 598 - 619