A Compile-time Cost Model for Automatic OpenMP Decoupled Software Pipelining Parallelization

被引:1
|
作者
Liu, Xiaoxian [1 ]
Zhao, Rongcai [1 ]
Han, Lin [1 ]
机构
[1] State Key Lab Math Engn & Adv Comp, Zhengzhou, Peoples R China
来源
2013 14TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2013) | 2013年
关键词
Automatic Parallelization; OpenMP; Decoupled Software Pipelining; Cost Model;
D O I
10.1109/SNPD.2013.8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prevalence of control flow, recursive data structures, and general pointer accesses in ordinary programs renders the traditional automatic parallelization techniques unsuitable. OpenMP Decoupled Software Pipelining (DSWP) is proposed to exploit pipeline parallelism lurking in ordinary programs, which cannot be dealt with by traditional techniques. While cost model is important in helping evaluate compiler transformations, guiding the compiler in its optimization process and helping achieve load balancing, existing cost models are too simple to be sufficient for the profit evaluation of OpenMP, especially for DSWPed loops. We propose a compile-time cost model for automatic parallelization profit estimate by extending the existing cost model in Open64 loop nest optimizer (LNO) phase in this paper. Moreover, we improve the OpenMP DSWP algorithm based on our cost model, which increases execution efficiency of automatic parallelization. We evaluate our cost model with loops containing complex memory access patterns and control flow structure, which cannot be dealt with by traditional techniques, and NAS Parallel Benchmarks (NPB) 3.3.1. As a result, evident performance improvement for generated DSWPed loops and programs are obtained by using our model.
引用
收藏
页码:253 / 260
页数:8
相关论文
共 45 条
  • [1] An Automatic Parallel-Stage Decoupled Software Pipelining Parallelization Algorithm Based on OpenMP
    Liu, Xiaoxian
    Zhao, Rongcai
    Han, Lin
    Liu, Peng
    2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 1825 - 1831
  • [2] Automatic compile-time parallelization of prolog programs for dependent and-parallelism
    Pontelli, E
    Gupta, G
    Pulvirenti, F
    Ferro, A
    LOGIC PROGRAMMING: PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON LOGIC PROGRAMMING, 1997, : 108 - 122
  • [3] A cost model for automatic OpenMP parallelization
    Li, Yan-Bing (mr.liyanbing@gmail.com), 1600, Chinese Academy of Sciences (25):
  • [4] Compile-time Parallelization of Subscripted Subscript Patterns
    Bhosale, Akshay
    Eigenmann, Rudolf
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 317 - 325
  • [5] Decoupled Software Pipelining Creates Parallelization Opportunities
    Huang, Jialu
    Raman, Arun
    Jablin, Thomas B.
    Zhang, Yun
    Hung, Tzu-Han
    August, David I.
    CGO 2010: THE EIGHTH INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, PROCEEDINGS, 2010, : 121 - +
  • [6] Automatic compile-time parallelization of logic programs for restricted, goal level, independent and parallelism
    Muthukumar, K
    Bueno, F
    de la Banda, MG
    Hermenegildo, M
    JOURNAL OF LOGIC PROGRAMMING, 1999, 38 (02): : 165 - 218
  • [7] Improving the sparse parallelization using semantical information at compile-time
    Bandera, G
    Zapata, EL
    EURO-PAR 2000 PARALLEL PROCESSING, PROCEEDINGS, 2000, 1900 : 331 - 339
  • [8] Compile-time synchronization optimizations for software DSMs
    Han, HS
    Tseng, CW
    FIRST MERGED INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, 1998, : 662 - 669
  • [9] Automatic thread extraction with decoupled software pipelining
    Ottoni, G
    Rangan, R
    Stoler, A
    August, DI
    MICRO-38: PROCEEDINGS OF THE 38TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUMN ON MICROARCHITECTURE, 2005, : 105 - 116
  • [10] Automatic Security Bug Classification: A Compile-time Approach
    Du, Tie
    Zheng, Long
    Chen, Shaopeng
    Jin, Hai
    2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 398 - 405