On Combining Wavefront and Tile Parallelism with a Novel GPU-Friendly Fast Search

被引:2
作者
Papaioannou, Georgios I. [1 ]
Koziri, Maria [2 ]
Loukopoulos, Thanasis [1 ]
Anagnostopoulos, Ioannis [1 ]
机构
[1] Univ Thessaly, Dept Comp Sci & Biomed Informat, Lamia 35131, Greece
[2] Univ Thessaly, Dept Informat & Telecommun, Lamia 35131, Greece
关键词
video coding; tile parallelism; wavefront parallelism; HEVC; VCC; WTP; motion estimation; GPU; HEVC MOTION ESTIMATION; OPTIMIZATION; DECISION; SAD;
D O I
10.3390/electronics12102223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the necessity of supporting ever-increasing demands in video resolution leads to new video coding standards, the challenge of harnessing their computational overhead becomes important. Such overhead stems not only from the increased image data due to higher resolutions but also from the coding techniques per se that are introduced by each standard to improve compression. All modern standards in the field of video coding offer high compression efficiency, but this is achieved by increasing the computational complexity of the encoding part. Ultra-High-Definition (UHD) videos, bring new encoding implementation schemes that are being recommended for CPU and GPU parallelization. Therefore, several works are published to achieve better performance and reduce encoding complexity. Following this idea, we proposed and evaluated a hybrid encoding scheme that utilizes the constant growth of the CPU power with the massive GPU popularity in parallel. Taking advantage of the encoding schemes from the leading video coding standards, such as High-Efficiency Video Coding (HEVC) and Versatile Video Coding (VVC), which support parallel processing thru Wavefront or Tiling, in our work, we combined both of them at the same time as a whole, and in addition, we introduced a GPU-friendly fast search algorithm that is highly parallel and alternative to the default non-parallel TZ-Search. Through an experimental evaluation with common test sequences, the proposed GPU Fast Motion Estimation with our previous Wavefront per Tile Parallelism (WTP) was shown to provide valid trade-off between speedup and video coding efficiency, effectively combining the best of two worlds, i.e., WTP using CPUs and parallel Motion Estimation with GPUs.
引用
收藏
页数:17
相关论文
共 38 条
[1]  
[Anonymous], 2014, CUDA C PROGRAMMING G
[2]  
[Anonymous], 2001, 13 VCEG M NO VCEGM33
[3]  
[Anonymous], 2012, P 9 M JCT VC GEN SWI
[4]   A new fast motion estimation algorithm using fast mode decision for high-efficiency video coding standard [J].
Belghith, Fatma ;
Kibeya, Hassan ;
Loukil, Hassen ;
Ben Ayed, Mohamed Ali ;
Masmoudi, Nouri .
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2016, 11 (04) :675-691
[5]  
Catanzaro B., 2010, OPENCL OPTIMIZATION
[6]   Heterogeneous CPU plus GPU approaches for HEVC [J].
Cebrian-Marquez, Gabriel ;
Galiano, Vicente ;
Migallon, Hector ;
Luis Martinez, Jose ;
Cuenca, Pedro ;
Lopez-Granado, Otoniel .
JOURNAL OF SUPERCOMPUTING, 2019, 75 (03) :1215-1226
[7]  
Chen W.N., 2008, P 2008 IEEE INT C MU
[8]  
Ezzahra F., 2019, RECENT ADV ELECT ELE, V12, P317
[9]   An Efficient VLSI Architecture for Fast Motion Estimation Exploiting Zero Motion Prejudgment Technique and a New Quadrant-Based Search Algorithm in HEVC [J].
Francis, Shajin H. ;
Rajesh, P. ;
Raja, M. Ramkumar .
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (03) :1751-1774
[10]  
Gogoi S., 2019, P IEEE INT S SMART E