Area-Efficient Near-Associative Memories on FPGAs

被引:7
作者
Dhawan, Udit [1 ]
Dehon, Andre [1 ]
机构
[1] Univ Penn, Dept Elect & Syst Engn, Philadelphia, PA 19104 USA
关键词
Algorithms; Design; Performance; FPGA; BRAM; associative memory; CAM; cache; hashing; MANAGEMENT;
D O I
10.1145/2629471
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Associative memories can map sparsely used keys to values with low latency but can incur heavy area overheads. The lack of customized hardware for associative memories in today's mainstream FPGAs exacerbates the overhead cost of building these memories using the fixed address match BRAMs. In this article, we develop a new, FPGA-friendly, memory system architecture based on a multiple hash scheme that is able to achieve near-associative performance without the area-delay overheads of a fully associative memory on FPGAs. At the same time, we develop a novel memory management algorithm that allows us to statistically mimic an associative memory. Using the proposed architecture as a 64KB L1 data cache, we show that it is able to achieve near-associative miss rates while consuming 3-13 x fewer FPGA memory resources for a set of benchmark programs from the SPEC CPU2006 suite than fully associative memories generated by the Xilinx Coregen tool. Benefits for our architecture increase with key width, allowing area reduction up to 100x. Mapping delay is also reduced to 3.7ns for a 1,024-entry flat version or 6.1ns for an area-efficient version compared to 17.6ns for a fully associative memory for a 64-bit key on a Xilinx Virtex 6 device.
引用
收藏
页数:22
相关论文
共 28 条
[1]  
[Anonymous], 2000, IEEE ACM T NETWORK
[2]  
[Anonymous], 2012, P SASO WORKSH AD HOS
[3]  
[Anonymous], 2011, VIRT 6 FPGA DAT SHEE
[4]   Balanced allocations [J].
Azar, Y ;
Broder, AZ ;
Karlin, AR ;
Upfal, E .
SIAM JOURNAL ON COMPUTING, 1999, 29 (01) :180-200
[5]  
Battle S, 2012, INT S HIGH PERF COMP, P273
[6]   SPACE/TIME TRADE/OFFS IN HASH CODING WITH ALLOWABLE ERRORS [J].
BLOOM, BH .
COMMUNICATIONS OF THE ACM, 1970, 13 (07) :422-&
[7]  
Bluespec Inc, 2012, BLUESP SYSTEMVERILOG
[8]   PRACTICAL DICTIONARY MANAGEMENT FOR HARDWARE DATA-COMPRESSION [J].
BUNTON, S ;
BORRIELLO, G .
COMMUNICATIONS OF THE ACM, 1992, 35 (01) :95-104
[9]  
Chazelle B., 2004, P 15 ANN ACM SIAM S, P30
[10]   AN OPTIMAL ALGORITHM FOR GENERATING MINIMAL PERFECT HASH FUNCTIONS [J].
CZECH, ZJ ;
HAVAS, G ;
MAJEWSKI, BS .
INFORMATION PROCESSING LETTERS, 1992, 43 (05) :257-264