Enhanced Machine Learning-based Inter Coding for VVC

被引:4
|
作者
Benjak, Martin [1 ]
Meuel, Holger [1 ]
Laude, Thorsten [1 ]
Ostermann, Jorn [1 ]
机构
[1] Leibniz Univ Hannover, Inst Informat Verarbeitung, Hannover, Germany
来源
3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021) | 2021年
关键词
VVC; inter coding; video coding; machine learning; recurrent neural networks;
D O I
10.1109/ICAIIC51459.2021.9415184
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an enhanced machine learning-based inter coding algorithm for VVC. Conceptually, the reference pictures from the decoded picture buffer are processed using a recurrent neural network to generate an artificial reference picture at the time instance of the currently coded picture. The network is trained using a SATD cost function to minimize the bit rate cost for the prediction error rather than the pixel-wise difference. By this we achieved average weighted BD-rate gains of 0.94%. The coding time increased about 5% for the encoder and 300% for the decoder due to the use of a neural network.
引用
收藏
页码:21 / 25
页数:5
相关论文
共 50 条
  • [21] An Enhanced Machine Learning-Based Analysis of Teaching and Learning Process for Higher Education System
    Alsafyani, Majed
    ADVANCES IN INFORMATION SYSTEMS, ARTIFICIAL INTELLIGENCE AND KNOWLEDGE MANAGEMENT, ICIKS 2023, 2024, 486 : 321 - 332
  • [22] A Hardware-Friendly Lightweight Partition Decision Algorithm for VVC Intra and Inter Coding
    Zan, Zhao
    Huang, Leilei
    Chen, Shushi
    Zeng, Xiaoyang
    Fan, Yibo
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 941 - 945
  • [23] Learning-Based Complexity Reduction Scheme for VVC Intra-Frame Prediction
    Saldanha, Mario
    Sanchez, Gustavo
    Marcon, Cesar
    Agostini, Luciano
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [24] Machine Learning-Based Fast Angular Prediction Mode Decision Technique in Video Coding
    Ryu, Sookyung
    Kang, Je-Won
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5525 - 5538
  • [25] Machine Learning-Based Coding Unit Depth Decisions for Flexible Complexity Allocation in High Efficiency Video Coding
    Zhang, Yun
    Kwong, Sam
    Wang, Xu
    Yuan, Hui
    Pan, Zhaoqing
    Xu, Long
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (07) : 2225 - 2238
  • [26] Fast CTU Partition Decision Algorithm for VVC Intra and Inter Coding
    Tang, Na
    Cao, Jian
    Liang, Fan
    Wang, Jun
    Liu, Hongmei
    Wang, Xiaoyang
    Du, Xiaorong
    2019 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2019), 2019, : 361 - 364
  • [27] Reinforcement Learning based ROI Bit Allocation for Gaming Video Coding in VVC
    Ren, Guangjie
    Liu, Zizheng
    Chen, Zhenzhong
    Liu, Shan
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [28] Machine Learning-Based Lung Cancer Classification and Enhanced Accuracy on CT Images
    Gaddala, Lalitha Kumari
    Radha, Vijaya Kumar Reddy
    Buraga, Srinivasa Rao
    Narla, Venkata Lalitha
    Kodepogu, Koteswara Rao
    Yalamanchili, Surekha
    TRAITEMENT DU SIGNAL, 2024, 41 (02) : 1073 - 1078
  • [29] Machine learning based video coding optimizations: A survey
    Zhang, Yun
    Kwong, Sam
    Wang, Shiqi
    INFORMATION SCIENCES, 2020, 506 : 395 - 423
  • [30] Machine learning-based actuation orchestration for inter-/intra-data center networks
    Spadaro, Salvatore
    Pages, Albert
    Agraz, Fernando
    2023 INTERNATIONAL CONFERENCE ON PHOTONICS IN SWITCHING AND COMPUTING, PSC, 2023,