Program Balancing in Compilation for Buffered Hybrid Dataflow Processors

被引:1
作者
Bhagyanath, Anoop [1 ]
Schneider, Klaus [1 ]
机构
[1] RPTU Kaiserslautern Landau, Dept Comp Sci, Kaiserslautern, Germany
来源
2023 IEEE 47TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC | 2023年
关键词
D O I
10.1109/COMPSAC57700.2023.00018
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In traditional von Neumann processors, the central register file is an inherent limiting factor in exploiting the instruction-level parallelism (ILP) of programs. To alleviate this problem, many processors follow a hybrid von Neumann/dataflow computing model in which specific instruction sequences are executed in dataflow order by communicating intermediate values directly from producer processing units (PUs) to consumer PUs without using a central register file. However, the intermediate values often reside in local registers of the PUs, which requires a synchronization of the data transports that still limits the exploitation of the ILP. To avoid the use of a central register file and the need for any synchronization between PUs, some newer architectures suggest first-in-first-out (FIFO) buffers instead of local registers at the input and output ports of the PUs. Since values are produced and consumed, and are thus never overwritten (as in registers), the compiler must determine the required number of copies of each value. Furthermore, it is necessary to control the number of copies of values to develop buffer size aware compilation methods. However, the number of variable uses in a sequential program may depend on the future execution. This paper presents transformations for 'balancing' a given program, i.e., transforming the program so that for all points in the program, the number of future uses of all variables can be accurately determined in order to allocate the required buffer sizes in the later compilation phases. The classical space-time trade-off is demonstrated by the experimental results which show an improvement of the processor performance with increasing buffer sizes and vice versa. More importantly, the experimental results demonstrate the potential of buffered hybrid dataflow architectures for a scalable use of ILP.
引用
收藏
页码:57 / 66
页数:10
相关论文
共 38 条
[1]  
Agarwal V, 2000, PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, P248, DOI [10.1145/342001.339691, 10.1109/ISCA.2000.854395]
[2]  
Ananian C, 1999, MITLCSTR801
[3]  
[Anonymous], 2004, Principles and Practices of Interconnection Networks
[4]   Building Timing Predictable Embedded Systems [J].
Axer, Philip ;
Ernst, Rolf ;
Falk, Heiko ;
Girault, Alain ;
Grund, Daniel ;
Guan, Nan ;
Jonsson, Bengt ;
Marwedel, Peter ;
Reineke, Jan ;
Rochange, Christine ;
Sebastian, Maurice ;
Von Hanxleden, Reinhard ;
Wilhelm, Reinhard ;
Yi, Wang .
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2014, 13 (04)
[5]  
Bhagyanath A., 2022, INT S EMBEDDED MULTI
[6]  
Bhagyanath A., 2016, FORMAL METHODS MODEL
[7]  
Bhagyanath A., 2020, Ph. D. Dissertation
[8]  
Bhagyanath A, 2017, INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION (SAMOS 2017), P1, DOI 10.1109/SAMOS.2017.8344605
[9]   FLOW DIAGRAMS TURING MACHINES AND LANGUAGES WITH ONLY 2 FORMATION RULES [J].
BOHM, C ;
JACOPINI, G .
COMMUNICATIONS OF THE ACM, 1966, 9 (05) :366-&
[10]   SSI Properties Revisited [J].
Boissinot, Benoit ;
Brisk, Philip ;
Darte, Alain ;
Rastello, Fabrice .
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2012, 11 (01)