Processor Allocation for Optimistic Parallelization of Irregular Programs

被引:0
作者
Versaci, Francesco [1 ]
Pingali, Keshav [2 ]
机构
[1] Univ Padua, TU Wien, I-35100AOGJ Padua, Italy
[2] Univ Texas Austin, Austin, TX USA
来源
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT I | 2012年 / 7333卷
关键词
Irregular algorithms; Optimistic parallelization; Automatic parallelization; Amorphous data-parallelism; Processor allocation; Unfriendly seating; Turan's theorem; SEATING ARRANGEMENT; IMPLEMENTATION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Optimistic parallelization is a promising approach for the parallelization of irregular algorithms: potentially interfering tasks are launched dynamically, and the runtime system detects conflicts between concurrent activities, aborting and rolling back conflicting tasks. However, parallelism in irregular algorithms is very complex. In a regular algorithm like dense matrix multiplication, the amount of parallelism can usually be expressed as a function of the problem size, so it is reasonably straightforward to determine how many processors should be allocated to execute a regular algorithm of a certain size (this is called the processor allocation problem). In contrast, parallelism in irregular algorithms can be a function of input parameters, and the amount of parallelism can vary dramatically during the execution of the irregular algorithm. Therefore, the processor allocation problem for irregular algorithms is very difficult. In this paper, we describe the first systematic strategy for addressing this problem. Our approach is based on a construct called the conflict graph, which (i) provides insight into the amount of parallelism that can be extracted from an irregular algorithm, and (ii) can be used to address the processor allocation problem for irregular algorithms. We show that this problem is related to a generalization of the unfriendly seating problem and, by extending Turan's theorem, we obtain a worst-case class of problems for optimistic parallelization, which we use to derive a lower bound on the exploitable parallelism. Finally, using some theoretically derived properties and some experimental facts, we design a quick and stable control strategy for solving the processor allocation problem heuristically.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 23 条
  • [1] Adaptive work-stealing with parallelism feedback
    Agrawal, Kunal
    Leiserson, Charles E.
    He, Yuxiong
    Hsu, Wen Jing
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2008, 26 (03):
  • [2] Alon N., 2015, PROBABILISTIC METHOD
  • [3] STAPL: An adaptive, generic parallel C++ library
    An, P
    Jula, A
    Rus, S
    Saunders, S
    Smith, T
    Tanase, G
    Thomas, N
    Amato, N
    Rauchwerger, L
    [J]. LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2003, 2624 : 193 - 208
  • [4] Blackford L., 1997, ScaLAPACK Users Guide
  • [5] Survey propagation:: An algorithm for satisfiability
    Braunstein, A
    Mézard, M
    Zecchina, R
    [J]. RANDOM STRUCTURES & ALGORITHMS, 2005, 27 (02) : 201 - 226
  • [6] Eppstein D, 2000, HANDBOOK OF COMPUTATIONAL GEOMETRY, P425, DOI 10.1016/B978-044482537-7/50010-3
  • [7] Freedman D., 1962, SIAM REV, V4, P150, DOI DOI 10.1137/1004037
  • [8] FRIEDMAN HD, 1964, SIAM REV, V6, P91
  • [9] The implementation of the Cilk-5 multithreaded language
    Frigo, M
    Leiserson, CE
    Randall, KH
    [J]. ACM SIGPLAN NOTICES, 1998, 33 (05) : 212 - 223
  • [10] The design and implementation of FFTW3
    Frigo, M
    Johnson, SG
    [J]. PROCEEDINGS OF THE IEEE, 2005, 93 (02) : 216 - 231