Automatic computation and data decomposition algorithm with no communication

被引:0
作者
Han Lin [1 ]
Zhao Rongcai [1 ]
Pang Jianmin [1 ]
Dong Chunli [1 ]
机构
[1] Natl Digital Switching Syst Engn & Technol R&D Ct, Zhengzhou, Peoples R China
来源
DCABES 2006 PROCEEDINGS, VOLS 1 AND 2 | 2006年
关键词
parallel recognition compiler; computation partition; data distribution;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For distributed memory machines the speed that processor accesses local memories is much faster than its speed to access remote memories. Thus how to decompose data and computation properly to achieve maximum parallelism and minimum communication is a key issue of automatic parallel compilation. In this paper, the authors present an automatic decomposition algorithm based on constraint equations. Using the algorithm, a data and computation decomposition result with no communication can be achieved. By releasing these constraint equations, the authors get decomposition result with more parallelism. As the result may have communications, so an improved method to eliminate some communications by data replication is also presented.
引用
收藏
页码:270 / 273
页数:4
相关论文
共 8 条
  • [1] ANDERSON J, 1993, P SIGPLAN 93 C PROGR, P112
  • [2] HUANG CH, 1992, LECT NOTES COMPUT SC, V589, P186, DOI 10.1007/BFb0038665
  • [3] KENNEDY K, 1995, P SUPERCOMPUTER C
  • [4] DATA OPTIMIZATION - ALLOCATION OF ARRAYS TO REDUCE COMMUNICATION ON SIMD-MACHINES
    KNOBE, K
    LUKAS, JD
    STEELE, GL
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1990, 8 (02) : 102 - 118
  • [5] KULKARNI D, 1991, P 1991 ACM INT C SUP, P206
  • [6] KUMAR KG, 1992, P 1992 ACM INT C SUP, P82
  • [7] LAM MS, 1992, P DARPA SOFTW TECHN, P150
  • [8] WOLF ME, 1992, CSLTR92538 STANF U