Adaptively scheduling parallel loops in distributed shared-memory systems

被引:33
|
作者
Yan, Y [1 ]
Jin, CM [1 ]
Zhang, XD [1 ]
机构
[1] INTERVOICE INC,DALLAS,TX
基金
美国国家科学基金会;
关键词
adaptive scheduling algorithms; dynamic information; load balancing; parallel loops; processor affinity; shared-memory systems;
D O I
10.1109/71.569656
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Using runtime information of load distributions and processor affinity, we propose an adaptive scheduling algorithm and its variations from different control mechanisms. The proposed algorithm applies different degrees of aggressiveness to adjust loop scheduling granularities, aiming at improving the execution performance of parallel loops by making scheduling decisions that match the real workload distributions at runtime. We experimentally compared the performance of our algorithm and its variations with several existing scheduling algorithms on two parallel machines: the KSR-1 and the Convex Exemplar. The kernel application programs we used for performance evaluation were carefully selected for different classes of parallel loops. Our results show that using runtime information to adaptively adjust scheduling granularity is an effective way to handle loops with a wide range of load distributions when no prior knowledge of the execution can be used. The overhead caused by collecting runtime information is insignificant in comparison with the performance improvement. Our experiments show that the adaptive algorithm and its five variations outperformed the existing scheduling algorithms.
引用
收藏
页码:70 / 81
页数:12
相关论文
共 50 条
  • [31] ENSURING CORRECT ROLLBACK RECOVERY IN DISTRIBUTED SHARED-MEMORY SYSTEMS
    JANSSENS, B
    FUCHS, WK
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1995, 29 (02) : 211 - 218
  • [32] IMPLEMENTATION OF A PARALLEL UNSTRUCTURED EULER SOLVER ON SHARED-MEMORY AND DISTRIBUTED-MEMORY ARCHITECTURES
    MAVRIPLIS, DJ
    DAS, R
    SALTZ, J
    VERMELAND, RE
    JOURNAL OF SUPERCOMPUTING, 1995, 8 (04): : 329 - 344
  • [33] A SCALABLE DISTRIBUTED SHARED-MEMORY ARCHITECTURE
    KRISHNAMOORTHY, S
    CHOUDHARY, A
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1994, 22 (03) : 547 - 554
  • [34] Implementing the Open Community Runtime for Shared-Memory and Distributed-Memory Systems
    Dokulil, Jiri
    Sandrieser, Martin
    Benkner, Siegfried
    2016 24TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP), 2016, : 364 - 368
  • [35] KNOWLEDGE IN SHARED-MEMORY SYSTEMS
    MERRITT, M
    TAUBENFELD, G
    DISTRIBUTED COMPUTING, 1993, 7 (02) : 99 - 109
  • [36] UNAMBIGUOUS SHARED-MEMORY SYSTEMS
    Morin, Remi
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2010, 21 (04) : 665 - 685
  • [37] Parallelization of While Loops in Nested Loop Programs for Shared-Memory Multiprocessor Systems
    Geuns, Stefan J.
    Bekooij, Marco J. G.
    Bijlsma, Tjerk
    Corporaal, Henk
    2011 DESIGN, AUTOMATION & TEST IN EUROPE (DATE), 2011, : 697 - 702
  • [38] Distributed Shared Arrays: Portable Shared-Memory Programming Interface for Multiple Computer Systems
    Akira Nomoto
    Yasuo Watanabe
    Wataru Kaneko
    Shugo Nakamura
    Kentaro Shimizu
    Cluster Computing, 2004, 7 (1) : 65 - 72
  • [39] Performance analysis tools for parallel Java applications on shared-memory systems
    European Center for Parallelism of Barcelona , Computer Architecture Department, Technical University of Catalonia, Campus Nord UPC, C/ Jordi Girona 1-3, Mòdul C6, Barcelona
    E-08034, Spain
    不详
    EH9 3JZ, United Kingdom
    Proc. Int. Conf. Parallel Process., 1600, (357-364):
  • [40] Dynamic scheduling strategies for shared-memory multiprocessors
    Hamidzadeh, B
    Lilja, DJ
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 1996, : 208 - 215