An automatic mapping technique for OpenACC kernel code based on deeply fused and heterogeneous many-core architecture

被引:0
作者
Zhang, Libo [1 ]
Mao, Xingquan [1 ]
You, Hongtao [1 ]
Gu, Long [1 ]
Jiang, Xiaocheng [1 ]
机构
[1] Wuxi Jiangnan Inst Comp Technol, Wuxi, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Supercomputer; Heterogeneous; Many-core; Fused; OpenACC; Data layout; Automatic mapping;
D O I
10.1007/s42514-020-00050-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Now the OpenACC has become a popular programming interface for many-core application programming. Internationally, a lot of research have been done on OpenACC for CPU + GPU heterogeneous many-core architecture. Among them, the PGI OpenACC compiler developed by NVIDIA is the most advanced one. But there are few research on OpenACC related to the Home Grown Heterogeneous Many-Core (HGHM) Architecture that is different from GPU. This paper proposes an automatic mapping technique for OpenACC kernel code based on the OpenACC compiler to a heterogeneous and deeply fused many-core architecture. Our approach uses the static analysis and feedback dynamic analysis of the compiler to perform the automatic mapping of the program parallel kernel code to many-core devices, and it greatly improves the transformation quality of the compiler. Experimental results show that this technique can greatly improve the efficiency of using OpenACC to port applications to heterogeneous and fused many-core system without impacting program acceleration performance.
引用
收藏
页码:323 / 331
页数:9
相关论文
共 21 条