Improving Multibank Memory Access Parallelism with Lattice-Based Partitioning

被引：15

作者：

Cilardo, Alessandro ^{[1
]}

Gallo, Luca ^{[1
]}

机构：

[1] Univ Naples Federico II, Dept Elect Engn & Informat Technol, I-80125 Naples, Italy

来源：

ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION | 2014年 / 11卷 / 04期

关键词：

Design; Languages; Theory; Memory partitioning; polyhedral model; fine-grained distributed shared memory; field-programmable gate arrays;

D O I：

10.1145/2675359

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Emerging architectures, such as reconfigurable hardware platforms, provide the unprecedented opportunity of customizing the memory infrastructure based on application access patterns. This work addresses the problem of automated memory partitioning for such architectures, taking into account potentially parallel data accesses to physically independent banks. Targeted at affine static control parts (SCoPs), the technique relies on the Z-polyhedral model for program analysis and adopts a partitioning scheme based on integer lattices. The approach enables the definition of a solution space including previous works as particular cases. The problem of minimizing the total amount of memory required across the partitioned banks, referred to as storage minimization throughout the article, is tackled by an optimal approach yielding asymptotically zero memory waste or, as an alternative, an efficient approach ensuring arbitrarily small waste. The article also presents a prototype toolchain and a detailed step-by-step case study demonstrating the impact of the proposed technique along with extensive comparisons with alternative approaches in the literature.

引用

页数：25

共 43 条

[1]

Alias C, 2013, DES AUT TEST EUROPE, P575

[2]

[Anonymous], 2011, Encyclopedia of Parallel Computing

[3]

[Anonymous], 2014, PROC ACM SIGDA INT S

[4]

Barvinok A., 2002, A Course in Convexity

[5] Code generation in the polyhedral model is easier than you think [J].

Bastoul, C .

13TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURE AND COMPILATION TECHNIQUES, PROCEEDINGS, 2004, :7-16

[6]

Bayliss S, 2012, FPGA 12: PROCEEDINGS OF THE 2012 ACM-SIGDA INTERNATIONAL SYMPOSIUM ON FIELD PROGRAMMABLE GATE ARRAYS, P195

[7]

Bondhugula U., 2007, PLUTO PRACTICAL FULL

[8] Graphics processing unit (GPU) programming strategies and trends in GPU computing [J].

Brodtkorb, Andre R. ;

Hagen, Trond R. ;

Saetra, Martin L. .

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (01) :4-13

[9]

Chatterjee Siddhartha, 1995, ACM SIGPLAN NOTICES, V28, P149

[10] Synthesis of custom interleaved memory systems [J].

Chen, S ;

Postula, A .

IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2000, 8 (01) :74-83

← 1 2 3 4 5 →