Using hardware-transactional-memory support to implement speculative task execution

被引:0
|
作者
Salamanca, Juan [1 ]
Baldassin, Alexandro [2 ]
机构
[1] Univ Campinas UNICAMP, Campinas, Brazil
[2] Sao Paulo State Univ Unesp, Dept Stat Appl Math & Comp DEMAC IGCE, Sao Paulo, Brazil
关键词
Speculative task execution; Hardware transactional memory; Speculative taskloop; LEVEL SPECULATION; PRIVATIZATION;
D O I
10.1016/j.jpdc.2024.104939
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Loops take up most of the time of computer programs, so optimizing them so that they run in the shortest time possible is a continuous task. However, this task is not negligible; on the contrary, it is an open area of research since many irregular loops are hard to parallelize. Generally, these loops have loop-carried (DOACROSS) dependencies and the appearance of dependencies could depend on the context. Many techniques have been studied to be able to parallelize these loops efficiently; however, for example in the OpenMP standard there is no efficient way to parallelize them. This article presents Speculative Task Execution (STE), a technique that enables the execution of OpenMP tasks in a speculative way to accelerate certain hot -code regions (such as loops) marked by OpenMP directives. It also presents a detailed analysis of the application of Hardware Transactional Memory (HTM) support for executing tasks speculatively and describes a careful evaluation of the implementation of STE using HTM on modern machines. In particular, we consider the scenario in which speculative tasks are generated by the OpenMP taskloop construct ( Speculative Taskloop (STL) ). As a result, it provides evidence to support several important claims about the performance of STE over HTM in modern processor architectures. Experimental results reveal that: (a) by implementing STL on top of HTM for hot -code regions, speed-ups of up to 5.39x can be obtained in IBM POWER8 and of up to 2.41x in Intel processors using 4 cores; and (b) STL-ROT, a variant of STL using rollback-only transactions (ROTs), achieves speed-ups of up to 17 .70x in IBM POWER9 processor using 20 cores.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Using value prediction to increase the power of speculative execution hardware
    Gabbay, F
    Mendelson, A
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1998, 16 (03): : 234 - 270
  • [22] Using Hardware Transactional Memory for Data Race Detection
    Gupta, Shantanu
    Sultan, Florin
    Cadambi, Srihari
    Ivancic, Franjo
    Roetteler, Martin
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 267 - +
  • [23] Software Support and Evaluation of Hardware Transactional Memory on Blue Gene/Q
    Wang, Amy
    Gaudet, Matthew
    Wu, Peng
    Ohmacht, Martin
    Amaral, Jose Nelson
    Barton, Christopher
    Silvera, Raul
    Michael, Maged M.
    IEEE TRANSACTIONS ON COMPUTERS, 2015, 64 (01) : 233 - 246
  • [24] Protecting Private Keys of Dilithium Using Hardware Transactional Memory
    Meng, Lingjia
    Fu, Yu
    Zheng, Fangyu
    Ma, Ziqiang
    Wang, Mingyu
    Ye, Dingfeng
    Lin, Jingqiang
    INFORMATION SECURITY, ISC 2023, 2023, 14411 : 288 - 306
  • [25] FaulTM: Error Detection and Recovery Using Hardware Transactional Memory
    Yalcin, Gulay
    Unsal, Osman
    Cristal, Adrian
    DESIGN, AUTOMATION & TEST IN EUROPE, 2013, : 220 - 225
  • [26] Using A Runtime to Overcome The Pathologies in Hardware Transactional Memory Systems
    Yan, Zhichao
    Feng, Dan
    Tan, Yujuan
    JOURNAL OF COMPUTERS, 2013, 8 (01) : 18 - 26
  • [27] Performance Comparison of Speculative Taskloop and OpenMP-for-Loop Thread-Level Speculation on Hardware Transactional Memory
    Salamanca, Juan
    2022 21ST INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC 2022), 2022, : 83 - 90
  • [28] Protecting Private Keys against Memory Disclosure Attacks using Hardware Transactional Memory
    Guan, Le
    Lin, Jingqiang
    Luo, Bo
    Jing, Jiwu
    Wang, Jing
    2015 IEEE SYMPOSIUM ON SECURITY AND PRIVACY SP 2015, 2015, : 3 - 19
  • [29] Transactional Memory with Strong Atomicity Using Off-the-Shelf Memory Protection Hardware
    Abadi, Martin
    Harris, Tim
    Mehrara, Mojtaba
    ACM SIGPLAN NOTICES, 2009, 44 (04) : 185 - 195
  • [30] Increasing the degree of parallelism using speculative execution in task-based runtime systems
    Bramas, Berenger
    PEERJ COMPUTER SCIENCE, 2019, 2019 (03)