Multi-feature fusion for efficient inter prediction in versatile video coding

被引:1
作者
Wei, Xiaojie [1 ]
Zeng, Hongji [1 ]
Fang, Ying [1 ]
Lin, Liqun [1 ]
Chen, Weiling [1 ]
Xu, Yiwen [1 ]
机构
[1] Fuzhou Univ, Fuzhou Coll Town, Fujian Key Lab Intelligent Proc & Wireless Transmi, 2 North Wulong River Ave, Fuzhou, Fujian, Peoples R China
关键词
Versatile video coding; Complexity optimization; Block partition; CNN; Multi-feature fusion; CU PARTITION; OPTIMIZATION; DECISION;
D O I
10.1007/s11554-024-01564-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Versatile Video Coding (VVC) introduces various advanced coding techniques and tools, such as QuadTree with nested Multi-type Tree (QTMT) partition structure, and outperforms High Efficiency Video Coding (HEVC) in terms of coding performance. However, the improvement of coding performance leads to an increase in coding complexity. In this paper, we propose a multi-feature fusion framework that integrates the rate-distortion-complexity optimization theory with deep learning techniques to reduce the complexity of QTMT partition for VVC inter-prediction. Firstly, the proposed framework extracts features of luminance, motion, residuals, and quantization information from video frames and then performs feature fusion through a convolutional neural network to predict the minimum partition size of Coding Units (CUs). Next, a novel rate-distortion-complexity loss function is designed to balance computational complexity and compression performance. Then, through this loss function, we can adjust various distributions of rate-distortion-complexity costs. This adjustment impacts the prediction bias of the network and sets constraints on different block partition sizes to facilitate complexity adjustment. Compared to anchor VTM-\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}13.0, the proposed method saves the encoding time by 10.14% to 56.62%, with BDBR increase confined to a range of 0.31% to 6.70%. The proposed method achieves a broader range of complexity adjustments while ensuring coding performance, surpassing both traditional methods and deep learning-based methods.
引用
收藏
页数:14
相关论文
共 50 条
[31]   Multi-feature fusion RFE random forest for schizophrenia classification and treatment response prediction [J].
Wang, Chang ;
Zhang, Rui ;
Zhang, Jiyuan ;
Ren, Yaning ;
Pang, Ting ;
Chen, Xiangyu ;
Li, Xiao ;
Zhao, Zongya ;
Yang, Yongfeng ;
Ren, Wenjie ;
Yu, Yi .
SCIENTIFIC REPORTS, 2025, 15 (01)
[32]   Remote anomaly detection for underwater gliders based on multi-feature fusion [J].
Yang, Ming ;
Shen, Zhaowei ;
Wang, Yanhui ;
Chen, Jun ;
Han, Wei ;
Yang, Shaoqiong .
OCEAN ENGINEERING, 2023, 284
[33]   Person Reidentification via Multi-Feature Fusion With Adaptive Graph Learning [J].
Zhou, Runwu ;
Chang, Xiaojun ;
Shi, Lei ;
Shen, Yi-Dong ;
Yang, Yi ;
Nie, Feiping .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (05) :1592-1601
[34]   HIERARCHICAL MULTI-FEATURE FUSION FOR MULTIMODAL DATA ANALYSIS [J].
Zhang, Hong ;
Chen, Li ;
Liu, Jun ;
Yuan, Junsong .
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, :5916-5920
[35]   Adaptive Multi-Feature Fusion for Underwater Diver Classification [J].
Yang Juan ;
Xu Feng ;
Wei Zhiheng ;
An Xudong ;
Liu Jia ;
Ji Yongqiang ;
Wen Tao .
2013 IEEE/OES ACOUSTICS IN UNDERWATER GEOSCIENCES SYMPOSIUM (RIO ACOUSTICS 2013), 2013,
[36]   An adaptive KCF tracking via multi-feature fusion [J].
Guo De-quan ;
Peng Sheng ;
Ling Sheng-gui ;
Yang Hong-yu ;
Liu Hong .
2017 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV 2017), 2017, :255-260
[37]   Multi-feature fusion for snowy traffic sign detection [J].
Wang, Zhanyu ;
Liu, Lintao ;
Du, Xuejing .
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (05)
[38]   Adaptive Multi-feature Fusion for Correlation Filter Tracking [J].
Liu, Linfeng ;
Yan, Xiaole ;
Shen, Qiu .
COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2019, 463 :1057-1066
[39]   Pipeline signal feature extraction with improved VMD and multi-feature fusion [J].
Zhou, Yina ;
Zhang, Yong ;
Yang, Dandi ;
Lu, Jingyi ;
Dong, Hongli ;
Li, Gongfa .
SYSTEMS SCIENCE & CONTROL ENGINEERING, 2020, 8 (01) :318-327
[40]   Survival Situation Awareness Based on Multi-feature Fusion [J].
Zhao, Jinhui ;
Shuo, Liangxun ;
Qian, Xu .
INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 :2528-+