Automated Toolchain for Enhanced Productivity in Reconfigurable Multi-accelerator Systems

被引:0
作者
Ortiz, Alberto [1 ]
Zamacola, Rafael [1 ]
Rodriguez, Alfonso [1 ]
Otero, Andres [1 ]
de la Torre, Eduardo [1 ]
机构
[1] Univ Politecn Madrid, Madrid, Spain
来源
APPLIED RECONFIGURABLE COMPUTING. ARCHITECTURES, TOOLS, AND APPLICATIONS, ARC 2020 | 2020年 / 12083卷
关键词
FPGAs; Dynamic and Partial Reconfiguration; Relocation; Productivity; Linux;
D O I
10.1007/978-3-030-44534-8_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ease-of-use and faster implementation times are key challenges that the community has to face to extend the use of FPGAs to non-hardware experts. In this paper, these challenges are tackled by integrating ARTICo(3) and IMPRESS tools to provide the users with a transparent way to build reconfigurable multi-accelerator systems. ARTICo(3) is an integrated framework that provides an automated toolchain to generate a hardware-based processing architecture to transparently manage custom-made accelerators at runtime. IMPRESS is a reconfiguration tool for building highly-flexible reconfigurable systems. The integration of both tools results in an efficient reconfigurable design flow that decouples the implementation of reconfigurable accelerators from the implementation of an ARTICo(3) static architecture that transparently distributes data to the accelerators. This static architecture is generated only once and reused in consecutive kernel implementations. This way, the user only needs to design the accelerators that are automatically implemented using interfaces compatible with the static architecture. From the user point of view, the reconfigurable fabric is a set of slots where accelerators can be transparently offloaded to decrease the workload on the processor. The integration of ARTICo(3) and IMPRESS also allows building relocatable accelerators, thus reducing the overall memory footprint required for the partial bitstreams. Moreover, model-based design of accelerators using Simulink has also been included as an additional option for users with no hardware background to further simplify the use of reconfigurable systems. Experimental results show that the implementation time is improved by up to 2.96x for a 4-slot reconfigurable system implementation with a memory footprint reduction of 4.54x.
引用
收藏
页码:45 / 60
页数:16
相关论文
共 19 条
[1]   RECONOS: AN OPERATING SYSTEM APPROACH FOR RECONFIGURABLE COMPUTING [J].
Agne, Andreas ;
Happe, Markus ;
Keller, Ariane ;
Luebbers, Enno ;
Plattner, Bernhard ;
Platzner, Marco ;
Plessl, Christian .
IEEE MICRO, 2014, 34 (01) :60-71
[2]  
[Anonymous], 2018, Technical report UG909
[3]  
[Anonymous], 2019, Technical report s105 v4
[4]  
[Anonymous], 2019, Technical report UG1023
[5]   LegUp: An Open-Source High-Level Synthesis Tool for FPGA-Based Processor/Accelerator Systems [J].
Canis, Andrew ;
Choi, Jongsok ;
Aldham, Mark ;
Zhang, Victor ;
Kammoona, Ahmed ;
Czajkowski, Tomasz ;
Brown, Stephen D. ;
Anderson, Jason H. .
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2013, 13 (02)
[6]  
Kintali Kiran., 2012, Model-based design with simulink, hdl coder, and xilinx system generator for dsp
[7]  
Koch D., 2012, ARCS 2012, V8146, P1
[8]  
Koch D., 2012, Partial Reconfiguration on FPGAs: Architectures, Tools and Applications, V153, DOI [10.1007/978-1-4614-1225-0, DOI 10.1007/978-1-4614-1225-0]
[9]   A Survey and Evaluation of FPGA High-Level Synthesis Tools [J].
Nane, Razvan ;
Sima, Vlad-Mihai ;
Pilato, Christian ;
Choi, Jongsok ;
Fort, Blair ;
Canis, Andrew ;
Chen, Yu Ting ;
Hsiao, Hsuan ;
Brown, Stephen ;
Ferrandi, Fabrizio ;
Anderson, Jason ;
Bertels, Koen .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2016, 35 (10) :1591-1604
[10]  
Putnam A, 2014, CONF PROC INT SYMP C, P13, DOI 10.1109/ISCA.2014.6853195