Toward FPGA-Based HPC: Advancing Interconnect Technologies

被引:20
作者
Lant, Joshua [1 ]
Navaridas, Javier [2 ]
Lujan, Mikel [3 ]
Goodacre, John [4 ]
机构
[1] Univ Manchester, APT Grp, Manchester, Lancs, England
[2] Univ Manchester, Adv Processors Technol Grp, Manchester, Lancs, England
[3] Univ Manchester, Dept Comp Sci, Manchester, Lancs, England
[4] Univ Manchester, Manchester, Lancs, England
关键词
FPGA; HPC; Interconnect; Transport Layer;
D O I
10.1109/MM.2019.2950655
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
HPC architects are currently facing myriad challenges from ever tighter power constraints and changing workload characteristics. In this article, we discuss the current state of FPGAs within HPC systems. Recent technological advances show that they are well placed for penetration into the HPC market. However, there are still a number of research problems to overcome; we address the requirements for system architectures and interconnects to enable their proper exploitation, highlighting the necessity of allowing FPGAs to act as full-fledged peers within a distributed system rather than attached to the CPU. We argue that this model requires a reliable, connectionless, hardware-offloaded transport supporting a global memory space. Our results show how our fully fledged hardware implementation gives latency improvements of up to 25% versus a software-based transport, and demonstrates that our solution can outperform the state of the art in HPC workloads such as matrix-matrix multiplication achieving a 10% higher computing throughput.
引用
收藏
页码:25 / 34
页数:10
相关论文
共 9 条
[1]  
Asanovic Krste, 2006, Rep. TR-UCB/EECS-2006-183
[2]   Ultra-low latency communication channels for FPGA-based HPC cluster [J].
Correa, Roberto Sanchez ;
David, Jean Pierre .
INTEGRATION-THE VLSI JOURNAL, 2018, 63 :41-55
[3]   Suitability Analysis of FPGAs for Heterogeneous Platforms in HPC [J].
Escobar, Fernando A. ;
Chang, Xin ;
Valderrama, Carlos .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (02) :600-612
[4]  
Lee VW, 2010, CONF PROC INT SYMP C, P451, DOI 10.1145/1816038.1816021
[5]   Infiniband-Verbs on GPU: A case study of controlling an Infiniband network device from the GPU [J].
Oden, Lena ;
Froening, Holger ;
Pfreundt, Franz-Joseph .
PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, :977-984
[6]  
Strenski D., 2007, HPC WIRE
[7]  
Tate A., 2014, P PADAL WORKSH 2014, P7
[8]   A Study of Pointer-Chasing Performance on Shared-Memory Processor-FPGA Systems [J].
Weisz, Gabriel ;
Melber, Joseph ;
Wang, Yu ;
Fleming, Kermin ;
Nurvitadhi, Eriko ;
Hoe, James C. .
PROCEEDINGS OF THE 2016 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'16), 2016, :264-273
[9]   Characterization of Fixed and Reconfigurable Multi-Core Devices for Application Acceleration [J].
Williams, Jason ;
Massie, Chris ;
George, Alan D. ;
Richardson, Justin ;
Gosrani, Kunal ;
Lam, Herman .
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2010, 3 (04)