Threaded MPI programming model for the Epiphany RISC array processor

被引:14
作者
Richie, David [1 ]
Ross, James [2 ]
Park, Song [3 ]
Shires, Dale [3 ]
机构
[1] Brown Deer Technol, Rockville, MD 20850 USA
[2] Engility Corp, Adelphi, MD USA
[3] US Army Res Lab, Adelphi, MD USA
关键词
2D RISC array; Threaded MPI; Adapteva Epiphany; Parallella; Energy efficiency;
D O I
10.1016/j.jocs.2015.04.023
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The low-power Adapteva Epiphany RISC array processor offers high computational energy-efficiency and parallel scalability. However, extracting performance with a standard parallel programming model remains a great challenge. We present an effective programming model for the Epiphany architecture based on the Message Passing Interface (MPI) standard adapted for coprocessor offload. Using MPI exploits the similarities between the Epiphany architecture and a networked parallel distributed cluster. Furthermore, our approach enables codes written with MPI to execute on the RISC array processor with little modification. We present experimental results for matrix-matrix multiplication using MPI and highlight the importance of fast inter-core data transfers. Using MPI we demonstrate an on-chip performance of 9.1 GFLOPS with an efficiency of 15.3 GFLOPS/W. Threaded MPI exhibits the highest performance reported for the Epiphany architecture using a standard parallel programming model. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:94 / 100
页数:7
相关论文
共 20 条
[1]  
[Anonymous], E16G301 EP 16 COR MI
[2]  
[Anonymous], [No title captured]
[3]  
[Anonymous], 2006, Tech. rep.
[4]  
Bergman K., 2008, DEF ADV RES, V15
[5]   Thousand core chips-a technology perspective [J].
Borkar, Shekhar .
2007 44TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2007, :746-749
[6]  
Cannon L. E, 1969, THESIS MONTANA STATE
[7]  
Demaine E. D., 1997, Conference Record of HPCS'97: 11th Annual International Symposium on High Performance Computing Systems. Papers Presented at the Symposium, P153
[8]   Realizing Efficient Execution of Dataflow Actors on Manycores [J].
Gebrewahid, Essayas ;
Yang, Mingkun ;
Cedersj, Gustav ;
Ul-Abdin, Zain ;
Gaspes, Veronica ;
Janneck, Jorn W. ;
Svensson, Bertil .
2014 12TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2014), 2014, :321-328
[9]  
Gropp W. D., 2001, High Performance Computing - HiPC 2001. 8th International Conference. Proceedings (Lecture Notes in Computer Science Vol.2238), P81
[10]  
Held J., 2006, Technical report