Crossbar-Constrained Technology Mapping for ReRAM Based In-Memory Computing

被引:12
作者
Bhattacharjee, Debjyoti [1 ]
Tavva, Yaswanth [2 ]
Easwaran, Arvind [1 ]
Chattopadhyay, Anupam [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[2] NUS, Singapore 119077, Singapore
关键词
Memristor; ReRAM; memristive systems; logic design; technology mapping; design automation; EXPRESSIONS;
D O I
10.1109/TC.2020.2964671
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In-memory computing has gained significant attention due to the potential for dramatic improvement in speed and energy. Redox-based resistive RAMs (ReRAMs), capable of non-volatile storage and logic operations simultaneously have been used for logic-in-memory computing approaches. To this effect, we propose ReRAM based VLIW Architecture for in-Memory comPuting (ReVAMP), supported by a detailed device-accurate simulation setup with peripheral circuitry. We present theoretical bounds on the minimum area required for in-memory computation of arbitrary Boolean functions specified using structural representation (And-Inverter Graph and Majority-Inverter Graph) and two-level representation (Exclusive-Sum-of-Product). To support the ReVAMP architecture, we present two technology mapping flows that fully exploit the bit-level parallelism offered by the execution of logic using ReRAM crossbar array. The area-constrained mapping (ArC) generates feasible mapping for a variety of crossbar dimensions while the delay-constrained mapping (DeC) focuses primarily on minimizing the latency of mapping. We evaluate the proposed mappings against two state-of-the-art technology in-memory computing architectures, PLiM and MAGIC along with their automation flows (SIMPLE and COMPACT). ArC and DeC outperform state-of-the-art PLiM architecture by 1:46x and 4:3x on average in latency. ArC offers significantly lower area (on average 25:27x and 6:57x), while improving the area-delay product by 1:37x and 1:12x against two mapping approaches for MAGIC respectively. In contrast, DeC achieves average area (1:45x and 3:06x) and area-delay product (1:12x and 6:36x) improvements over the mapping approaches for MAGIC architecture respectively. The proposed mapping techniques allow a variety of runtime efficiency trade-offs.
引用
收藏
页码:734 / 748
页数:15
相关论文
共 56 条
[1]   A Scalable Processing-in-Memory Accelerator for Parallel Graph Processing [J].
Ahn, Junwhan ;
Hong, Sungpack ;
Yoo, Sungjoo ;
Mutlu, Onur ;
Choi, Kiyoung .
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, :105-117
[2]   PIM-Enabled Instructions: A Low-Overhead, Locality-Aware Processing-in-Memory Architecture [J].
Ahn, Junwhan ;
Yoo, Sungjoo ;
Mutlu, Onur ;
Choi, Kiyoung .
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, :336-348
[3]   Majority-Inverter Graph: A New Paradigm for Logic Optimization [J].
Amaru, Luca ;
Gaillardon, Pierre-Emmanuel ;
De Micheli, Giovanni .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2016, 35 (05) :806-819
[4]   Majority-Inverter Graph: A Novel Data-Structure and Algorithms for Efficient Logic Optimization [J].
Amaru, Luca ;
Gaillardon, Pierre-Emmanuel ;
De Micheli, Giovanni .
2014 51ST ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2014,
[5]  
[Anonymous], [No title captured]
[6]  
[Anonymous], [No title captured]
[7]   JAFAR: Near-Data Processing for Databases [J].
Babarinsa, Oreoluwa ;
Idreos, Stratos .
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, :2069-2070
[8]   Study of Multi-level Characteristics for 3D Vertical Resistive Switching Memory [J].
Bai, Yue ;
Wu, Huaqiang ;
Wu, Riga ;
Zhang, Ye ;
Deng, Ning ;
Yu, Zhiping ;
Qian, He .
SCIENTIFIC REPORTS, 2014, 4
[9]  
Ben Hur R, 2017, ICCAD-IEEE ACM INT, P225, DOI 10.1109/ICCAD.2017.8203782
[10]  
Berkeley Logic Synthesis and Verification Group, ABC SYST SEQ SYNTH V