Efficient Use of Hardware Transactional Memory for Parallel Mesh Generation

被引:0
作者
Kobayashi, Tetsu [1 ]
Sato, Shigeyuki [1 ]
Iwasaki, Hideya [1 ]
机构
[1] Univ Electrocommun, Dept Commun Engn & Informat, Tokyo, Japan
来源
2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP) | 2015年
关键词
hardware transactional memory; Delaunay mesh refinement; Restricted Transactional Memory;
D O I
10.1109/ICPP.2015.69
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient transactional executions are desirable for parallel implementations of algorithms with graph refinements. Hardware transactional memory (HTM) is promising for easy yet efficient transactional executions. Long HTM transactions, however, abort with high probability because of hardware limitations. Unfortunately, Delaunay mesh refinement (DMR), which is an algorithm with graph refinements for mesh generation, causes long transactions. Its parallel implementation naively based on HTM therefore leads to poor performance. To utilize HTM efficiently for parallel implementation of DMR, we present an approach to shortening transactions. Our HTM-based implementations of DMR achieved significantly higher throughput and better scalability than a naive HTM-based one and lock-based ones. On a quad-core Haswell processor, the absolute speedup of one of our implementations was up to 2.64 with 16 threads.
引用
收藏
页码:600 / 609
页数:10
相关论文
共 50 条
  • [31] Protecting Private Keys of Dilithium Using Hardware Transactional Memory
    Meng, Lingjia
    Fu, Yu
    Zheng, Fangyu
    Ma, Ziqiang
    Wang, Mingyu
    Ye, Dingfeng
    Lin, Jingqiang
    INFORMATION SECURITY, ISC 2023, 2023, 14411 : 288 - 306
  • [32] Analysis of the Interactions Between ILP and TLP With Hardware Transactional Memory
    Nicolas-Conesa, Victor
    Titos-Gil, Ruben
    Fernandez-Pascual, Ricardo
    Ros, Alberto
    Acacio, Manuel E.
    30TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2022), 2022, : 157 - 164
  • [33] Hardware Transactional Memory on Multi-processor FPGA Platform
    Sirkunan, Jeevan
    Ooi, Chia Yee
    Shaikh-Husin, N.
    Hau, Yuan Wen
    Marsono, M. N.
    2014 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2014, : 2744 - 2747
  • [34] Using Hardware Transactional Memory to Enable Speculative Trace Optimization
    Salamanca, Juan
    Amaral, Jose Nelson
    Araujo, Guido
    2015 INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING WORKSHOP (SBAC-PADW), 2015, : 1 - 6
  • [35] Optimised memory allocation for less false abortion and better performance in hardware transactional memory
    Li, Xiuhong
    Gulila, Altenbek
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2020, 35 (04) : 483 - 491
  • [36] The Scalability of Disjoint Data Structures on a New Hardware Transactional Memory System
    Su, Gong
    Heisig, Stephen
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2015, 43 (06) : 1192 - 1217
  • [37] Enhancing scalability in best-effort hardware transactional memory systems
    Quislant, Ricardo
    Gutierrez, Eladio
    Zapata, Emilio L.
    Plata, Oscar
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2017, 104 : 73 - 87
  • [38] Techniques to Improve Performance in Requester-Wins Hardware Transactional Memory
    Armejach, Adria
    Titos-Gil, Ruben
    Negi, Anurag
    Unsal, Osman S.
    Cristal, Adrian
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2013, 10 (04)
  • [39] Software Support and Evaluation of Hardware Transactional Memory on Blue Gene/Q
    Wang, Amy
    Gaudet, Matthew
    Wu, Peng
    Ohmacht, Martin
    Amaral, Jose Nelson
    Barton, Christopher
    Silvera, Raul
    Michael, Maged M.
    IEEE TRANSACTIONS ON COMPUTERS, 2015, 64 (01) : 233 - 246
  • [40] The Scalability of Disjoint Data Structures on a New Hardware Transactional Memory System
    Gong Su
    Stephen Heisig
    International Journal of Parallel Programming, 2015, 43 : 1192 - 1217