A Compile-time Cost Model for Automatic OpenMP Decoupled Software Pipelining Parallelization

被引:1
|
作者
Liu, Xiaoxian [1 ]
Zhao, Rongcai [1 ]
Han, Lin [1 ]
机构
[1] State Key Lab Math Engn & Adv Comp, Zhengzhou, Peoples R China
来源
2013 14TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2013) | 2013年
关键词
Automatic Parallelization; OpenMP; Decoupled Software Pipelining; Cost Model;
D O I
10.1109/SNPD.2013.8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prevalence of control flow, recursive data structures, and general pointer accesses in ordinary programs renders the traditional automatic parallelization techniques unsuitable. OpenMP Decoupled Software Pipelining (DSWP) is proposed to exploit pipeline parallelism lurking in ordinary programs, which cannot be dealt with by traditional techniques. While cost model is important in helping evaluate compiler transformations, guiding the compiler in its optimization process and helping achieve load balancing, existing cost models are too simple to be sufficient for the profit evaluation of OpenMP, especially for DSWPed loops. We propose a compile-time cost model for automatic parallelization profit estimate by extending the existing cost model in Open64 loop nest optimizer (LNO) phase in this paper. Moreover, we improve the OpenMP DSWP algorithm based on our cost model, which increases execution efficiency of automatic parallelization. We evaluate our cost model with loops containing complex memory access patterns and control flow structure, which cannot be dealt with by traditional techniques, and NAS Parallel Benchmarks (NPB) 3.3.1. As a result, evident performance improvement for generated DSWPed loops and programs are obtained by using our model.
引用
收藏
页码:253 / 260
页数:8
相关论文
共 45 条
  • [11] A New Compile-time Obfuscation Scheme for Software Protection
    Zhang Yujia
    Pang Jianmin
    2016 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY PROCEEDINGS - CYBERC 2016, 2016, : 1 - 5
  • [12] Compile-Time Planning for Overhead Reduction in Software Dynamic Translators
    Naveen Kumar
    Childers Bruce R.
    Daniel Williams
    Jack W. Davidson
    Mary Lou Soffa
    International Journal of Parallel Programming, 2005, 33 : 103 - 114
  • [13] Compile-time planning for overhead reduction in software dynamic translators
    Kumar, N
    Childers, BR
    Williams, D
    Davidson, JW
    Soffa, ML
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2005, 33 (2-3) : 103 - 114
  • [14] DATA DESCRIPTORS - A COMPILE-TIME MODEL OF DATA AND ADDRESSING
    HOLT, RC
    ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1987, 9 (03): : 367 - 389
  • [15] Compile-Time Detection of False Sharing via Loop Cost Modeling
    Tolubaeva, Munara
    Yan, Yonghong
    Chapman, Barbara
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 557 - 566
  • [16] An integrated compile-time/run-time software distributed shared memory system
    Dwarkadas, S
    Cox, AL
    Zwaenepoel, W
    ACM SIGPLAN NOTICES, 1996, 31 (09) : 186 - 197
  • [17] Integrated compile-time/run-time software distributed shared memory system
    Rice Univ, United States
    Comput Archit News, Special Issu (186-197):
  • [19] Compile-time Automatic Synchronization Insertion and Redundant Synchronization Elimination for GPU Kernels
    Liu, Lifeng
    Liu, Meilin
    Wang, Chongjun
    Wang, Jun
    2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 826 - 834
  • [20] The application of compile-time reflection to software fault tolerance using Ada 95
    Rogers, P
    Wellings, AJ
    RELIABLE SOFTWARE TECHNOLOGY ADA-EUROPE 2005, PROCEEDINGS, 2005, 3555 : 236 - 247