Efficient Use of Hardware Transactional Memory for Parallel Mesh Generation

被引:0
作者
Kobayashi, Tetsu [1 ]
Sato, Shigeyuki [1 ]
Iwasaki, Hideya [1 ]
机构
[1] Univ Electrocommun, Dept Commun Engn & Informat, Tokyo, Japan
来源
2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP) | 2015年
关键词
hardware transactional memory; Delaunay mesh refinement; Restricted Transactional Memory;
D O I
10.1109/ICPP.2015.69
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient transactional executions are desirable for parallel implementations of algorithms with graph refinements. Hardware transactional memory (HTM) is promising for easy yet efficient transactional executions. Long HTM transactions, however, abort with high probability because of hardware limitations. Unfortunately, Delaunay mesh refinement (DMR), which is an algorithm with graph refinements for mesh generation, causes long transactions. Its parallel implementation naively based on HTM therefore leads to poor performance. To utilize HTM efficiently for parallel implementation of DMR, we present an approach to shortening transactions. Our HTM-based implementations of DMR achieved significantly higher throughput and better scalability than a naive HTM-based one and lock-based ones. On a quad-core Haswell processor, the absolute speedup of one of our implementations was up to 2.64 with 16 threads.
引用
收藏
页码:600 / 609
页数:10
相关论文
共 50 条
[41]   Privatizing transactions for Lee's algorithm in commercial hardware transactional memory [J].
Quislant, Ricardo ;
Gutierrez, Eladio ;
Zapata, Emilio L. ;
Plata, Oscar .
JOURNAL OF SUPERCOMPUTING, 2018, 74 (04) :1676-1694
[42]   Experiences with Disjoint Data Structures in a New Hardware Transactional Memory System [J].
Su, Gong ;
Heisig, Stephen .
2013 25TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 2013, :9-16
[43]   Parallelizing Stochastic Gradient Descent with Hardware Transactional Memory for Matrix Factorization [J].
Wu, Zhenwei ;
Luo, Yingqi ;
Lu, Kai ;
Wang, Xiaoping .
2018 3RD INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS ENGINEERING (ICISE), 2018, :118-121
[44]   Privatizing transactions for Lee’s algorithm in commercial hardware transactional memory [J].
Ricardo Quislant ;
Eladio Gutierrez ;
Emilio L. Zapata ;
Oscar Plata .
The Journal of Supercomputing, 2018, 74 :1676-1694
[45]   Eliminating Global Interpreter Locks in Ruby through Hardware Transactional Memory [J].
Odaira, Rei ;
Castanos, Jose G. ;
Tomari, Hisanobu .
ACM SIGPLAN NOTICES, 2014, 49 (08) :131-142
[46]   An Effective Conflict Management for Large Transactions in Hardware Transactional Memory System [J].
Fu, Chen ;
Wen, Dongxin ;
Wang, Xiaoqun ;
Yang, Xiaozong .
INFORMATION AND AUTOMATION, 2011, 86 :390-396
[47]   Using hardware-transactional-memory support to implement speculative task execution [J].
Salamanca, Juan ;
Baldassin, Alexandro .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 192
[48]   Selective dynamic serialization for reducing energy consumption in hardware transactional memory systems [J].
Epifanio Gaona ;
J. Rubén Titos-Gil ;
Juan Fernández ;
Manuel E. Acacio .
The Journal of Supercomputing, 2014, 68 :914-934
[49]   Selective dynamic serialization for reducing energy consumption in hardware transactional memory systems [J].
Gaona, Epifanio ;
Titos-Gil, J. Ruben ;
Fernandez, Juan ;
Acacio, Manuel E. .
JOURNAL OF SUPERCOMPUTING, 2014, 68 (02) :914-934
[50]   LockillerTM: Enhancing Performance Lower Bounds in Best-Effort Hardware Transactional Memory [J].
Wan, Li ;
Chao, Fu ;
Li, Qiang ;
Han, Jun .
PROCEEDINGS 2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS 2024, 2024, :865-875