Learning-based bypass zone search algorithm for fast motion estimation

被引:3
作者
Goncalves, Paulo [1 ]
Correa, Guilherme [1 ]
Agostini, Luciano [1 ]
Porto, Marcelo [1 ]
机构
[1] Fed Univ Pelotas UFPel, Grad Program Comp PPGC, Video Technol Res Grp ViTech, Pelotas, RS, Brazil
关键词
Motion estimation; Machine learning; Search pattern; Video coding; High efficiency video coding; MODE DECISION;
D O I
10.1007/s11042-022-13094-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video coding has been widely explored by academia and industry in recent years, mainly due to the great popularization of video applications and multimedia-capable devices. The Motion Estimation (ME) process receives special attention since it is one of the most complex steps in video coding. The Test Zone Search (TZS) is the main algorithm employed for integer ME in recent video codecs, such as those based on the High Efficiency Video Coding (HEVC), and has been used in the standardization process of the future Versatile Video Coding (VVC) standard. However, even though it is designed as a fast ME algorithm, the computational effort required by TZS is still very high, compromising the encoding process in multimedia-capable devices that operate on limited energy or computational resources. This work presents the Bypass Zone Search (BZS) algorithm, a learning-based solution for fast ME that improves TZS, aiming at a better tradeoff between compression efficiency and computational cost. First, a set of analyses on TZS is presented, which allowed the design of two strategies to reduce the ME computational cost. The first one, named as Learning-based Bypass Motion Estimation (LBME), consists of a machine learning-based approach that predicts whether the best motion vector has already been found and bypasses the remaining ME steps. The second strategy, named as Astroid Raster Pattern (ARP), is a novel search pattern developed for the most complex TZS step, the Raster Search. By combining the two proposed strategies in BZS, the ME processing time is reduced by 60.98% (Random Access) and 63.05% (Low Delay) in comparison to TZS. The overall HEVC encoding time is reduced by 14.32% (Random Access) and 17.64% (Low Delay), with a negligible loss of 0.0837% (Random Access) and 0.04% (Low Delay) in BD-rate.
引用
收藏
页码:3535 / 3560
页数:26
相关论文
共 39 条
[1]  
Afonso V, 2020, J INTEGR CIRC SYST, V11, P106, DOI 10.29292/jics.v11i2.435
[2]   Efficient algorithms for geometric optimization [J].
Agarwal, PK ;
Sharir, M .
ACM COMPUTING SURVEYS, 1998, 30 (04) :412-458
[3]  
[Anonymous], VIDEO TECHNOLOGY RES
[4]  
[Anonymous], 2014, Cisco visual networking index: Global mobile data traffic forecast update, 2014-2019
[5]  
[Anonymous], 2020, ISOIECJCT1SC29WG11
[6]  
[Anonymous], 1999, ITU-T Recommendation H.320
[7]   A new fast motion estimation algorithm using fast mode decision for high-efficiency video coding standard [J].
Belghith, Fatma ;
Kibeya, Hassan ;
Loukil, Hassen ;
Ben Ayed, Mohamed Ali ;
Masmoudi, Nouri .
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2016, 11 (04) :675-691
[8]  
BJONTEGAARD G, 2001, VCEGM33
[9]  
Bross B., 2020, JVET-S2001
[10]   Novel cross-diamond-hexagonal search algorithms for fast block motion estimation [J].
Cheung, CH ;
Po, LM .
IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (01) :16-22