The Performance Implication of Task Size for Applications on the HPX Runtime System

被引:17
作者
Grubel, Patricia [1 ,4 ]
Kaiser, Hartmut [2 ,4 ]
Cook, Jeanine [1 ,3 ,4 ]
Serio, Adrian [2 ,4 ]
机构
[1] New Mexico State Univ, Klipsch Sch Elect & Comp Engn, Las Cruces, NM 88003 USA
[2] Louisiana State Univ, Ctr Computat & Technol, Baton Rouge, LA 70803 USA
[3] Sandia Natl Labs, Livermore, CA 94550 USA
[4] STE AR Grp Stellar Grp Org, Jacksonville, FL USA
来源
2015 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING - CLUSTER 2015 | 2015年
关键词
Task Granularity; Task Scheduling; Task Parallelism; HPX; ParalleX;
D O I
10.1109/CLUSTER.2015.119
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As High Performance Computing moves toward Exascale, where parallel applications will be expected to run on millions of cores concurrently, every component of the computational model must perform optimally. One such component, the task scheduler, can potentially be optimized to runtime application requirements. We focus our study using a task-based runtime system, one possible solution towards Exascale computation. Based on task size and scheduler, the overheads associated with task scheduling vary. Therefore, to minimize overheads and optimize performance, either the task size or the scheduler must adapt. In this paper, we focus on adapting the task size, which can be easily done statically and potentially done dynamically. To this end, we first show how scheduling overheads change with task size or granularity. We then propose and execute a methodology to characterize these overheads and dynamically measure the effects of task granularity. The HPX runtime system [1] employs asynchronous fine-grained task scheduling and incorporates a dynamic performance modeling capability, providing an ideal experimental platform. Using the performance counter capabilities in HPX, we characterize task scheduling overheads and show metrics to determine optimal task size. This is the first step toward the goal of dynamically adapting task size to optimize parallel performance.
引用
收藏
页码:682 / 689
页数:8
相关论文
共 20 条
  • [1] [Anonymous], TR200813 U NOTR DAM
  • [2] [Anonymous], 2014, THE QTHREAD LIB
  • [3] Contreras G, 2008, I S WORKL CHAR PROC, P53
  • [4] Dongarra J., 2001, INT C LIN CLUST HPC
  • [5] Gustafsson Niklas, 2014, N3857 IMPROVEMENTS S
  • [6] Huck K., 2013, ROSS 13
  • [7] *INT, 2010, INT THREAD BUILD BLO
  • [8] Kaiser H, 2015, HPX V0 9 10 GEN PURP
  • [9] Kaiser H., 2014, P 8 INT C PART GLOB, P1
  • [10] Leiserson CE, 2009, DES AUT CON, P522