Performance Comparison of Speculative Taskloop and OpenMP-for-Loop Thread-Level Speculation on Hardware Transactional Memory

被引:1
作者
Salamanca, Juan [1 ]
机构
[1] Sao Paulo State Univ Unesp, DEMAC, IGCE, Sao Paulo, Brazil
来源
2022 21ST INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC 2022) | 2022年
基金
巴西圣保罗研究基金会;
关键词
OpenMP; taskloop; Thread-Level Speculation; parallel for; Hardware Transactional Memory;
D O I
10.1109/ISPDC55340.2022.00021
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Speculative Taskloop (STL) is a loop parallelization technique that takes the best of Task-based Parallelism and Thread-Level Speculation to speed up loops with may loop-carried dependencies that were previously difficult for compilers to parallelize. Previous studies show the efficiency of STL when implemented using Hardware Transactional Memory and the advantages it offers compared to a typical DOACROSS technique such as OpenMP ordered. This paper presents a performance comparison between STL and a previously proposed technique that implements Thread-Level Speculation (TLS) in the for worksharing construct (FOR-TLS) over a set of loops from cbench and SPEC2006 benchmarks. The results show interesting insights on how each technique can be more appropriate depending on the characteristics of the evaluated loop. Experimental results reveal that by implementing both techniques on top of HTM, speed-ups of up to 2.41x can be obtained for STL and up to 2x for FOR-TLS.
引用
收藏
页码:83 / 90
页数:8
相关论文
共 21 条
[1]  
[Anonymous], 1997, Advances in computers
[2]   Toward efficient and robust software speculative parallelization on multiprocessors [J].
Cintra, M ;
Llanos, DR .
ACM SIGPLAN NOTICES, 2003, 38 (10) :13-24
[3]   OpenMP Loop Scheduling Revisited: Making a Case for More Schedules [J].
Ciorba, Florina M. ;
Iwainsky, Christian ;
Buder, Patrick .
EVOLVING OPENMP FOR EVOLVING ARCHITECTURES, 2018, 11128 :21-36
[4]  
cTuning Foundation, 2016, CBENCH COLL BENCHM
[5]  
Cytron R., 1986, Proceedings of the 1986 International Conference on Parallel Processing (Cat. No.86CH2355-6), P836
[6]  
Henning JL, 2006, ACM SIGARCH Computer Architecture News, V34, P1, DOI DOI 10.1145/1186736.1186737
[7]  
Odaira R, 2014, I S WORKL CHAR PROC, P212, DOI 10.1109/IISWC.2014.6983060
[8]  
OpenMP-ARB, 2015, OpenMP application program interface version 4.5
[9]   The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization [J].
Rauchwerger, L ;
Padua, DA .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1999, 10 (02) :160-180
[10]  
Rauchwerger L, 2011, SPECULATIVE PARALLEL, P1901