Energy-efficient computing-in-memory architecture for AI processor: device, circuit, architecture perspective

被引:14
作者
Chang, Liang [1 ]
Li, Chenglong [1 ]
Zhang, Zhaomin [1 ]
Xiao, Jianbiao [1 ]
Liu, Qingsong [1 ]
Zhu, Zhen [1 ]
Li, Weihang [1 ]
Zhu, Zixuan [1 ]
Yang, Siqi [1 ]
Zhou, Jun [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
基金
国家重点研发计划;
关键词
energy efficiency; computing-in-memory; non-volatile memory; test demonstrators; AI processor; SRAM MACRO; UNIT-MACRO; ACCELERATOR; COMPUTATION;
D O I
10.1007/s11432-021-3234-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An artificial intelligence (AI) processor is a promising solution for energy-efficient data processing, including health monitoring and image/voice recognition. However, data movements between compute part and memory induce memory wall and power wall challenges to the conventional computing architecture. Recently, the memory-centric architecture has been revised to solve the data movement issue, where the memory is equipped with the compute-capable memory technique, namely, computing-in-memory (CIM). In this paper, we analyze the requirement of AI algorithms on the data movement and low power requirement of AI processors. In addition, we introduce the story of CIM and implementation methodologies of CIM architecture. Furthermore, we present several novel solutions beyond traditional analog-digital mixed static random-access memory (SRAM)-based CIM architecture. Finally, recent CIM tape-out studies are listed and discussed.
引用
收藏
页数:15
相关论文
共 94 条
[31]   Practical Near-Data Processing for In-memory Analytics Frameworks [J].
Gao, Mingyu ;
Ayers, Grant ;
Kozyrakis, Christos .
2015 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURE AND COMPILATION (PACT), 2015, :113-124
[32]   PROCESSING IN MEMORY - THE TERASYS MASSIVELY-PARALLEL PIM ARRAY [J].
GOKHALE, M ;
HOLMES, B ;
IOBST, K .
COMPUTER, 1995, 28 (04) :23-31
[33]  
Gonugondla SK, 2018, ISSCC DIG TECH PAP I, P490, DOI 10.1109/ISSCC.2018.8310398
[34]  
JEDEC, JESD235A2015
[35]   A 28 nm Configurable Memory (TCAM/BCAM/SRAM) Using Push-Rule 6T Bit Cell Enabling Logic-in-Memory [J].
Jeloka, Supreet ;
Akesh, Naveen Bharathwaj ;
Sylvester, Dennis ;
Blaauw, David .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2016, 51 (04) :1009-1021
[36]   A Programmable Heterogeneous Microprocessor Based on Bit-Scalable In-Memory Computing [J].
Jia, Hongyang ;
Valavi, Hossein ;
Tang, Yinqi ;
Zhang, Jintao ;
Verma, Naveen .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2020, 55 (09) :2609-2621
[37]   C3SRAM: An In-Memory-Computing SRAM Macro Based on Robust Capacitive Coupling Computing Mechanism [J].
Jiang, Zhewei ;
Yin, Shihui ;
Seo, Jae-Sun ;
Seok, Mingoo .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2020, 55 (07) :1888-1897
[38]   Accurate deep neural network inference using computational phase-change memory [J].
Joshi, Vinay ;
Le Gallo, Manuel ;
Haefeli, Simon ;
Boybat, Irem ;
Nandakumar, S. R. ;
Piveteau, Christophe ;
Dazzi, Martino ;
Rajendran, Bipin ;
Sebastian, Abu ;
Eleftheriou, Evangelos .
NATURE COMMUNICATIONS, 2020, 11 (01)
[39]   In-Datacenter Performance Analysis of a Tensor Processing Unit [J].
Jouppi, Norman P. ;
Young, Cliff ;
Patil, Nishant ;
Patterson, David ;
Agrawal, Gaurav ;
Bajwa, Raminder ;
Bates, Sarah ;
Bhatia, Suresh ;
Boden, Nan ;
Borchers, Al ;
Boyle, Rick ;
Cantin, Pierre-luc ;
Chao, Clifford ;
Clark, Chris ;
Coriell, Jeremy ;
Daley, Mike ;
Dau, Matt ;
Dean, Jeffrey ;
Gelb, Ben ;
Ghaemmaghami, Tara Vazir ;
Gottipati, Rajendra ;
Gulland, William ;
Hagmann, Robert ;
Ho, C. Richard ;
Hogberg, Doug ;
Hu, John ;
Hundt, Robert ;
Hurt, Dan ;
Ibarz, Julian ;
Jaffey, Aaron ;
Jaworski, Alek ;
Kaplan, Alexander ;
Khaitan, Harshit ;
Killebrew, Daniel ;
Koch, Andy ;
Kumar, Naveen ;
Lacy, Steve ;
Laudon, James ;
Law, James ;
Le, Diemthu ;
Leary, Chris ;
Liu, Zhuyuan ;
Lucke, Kyle ;
Lundin, Alan ;
MacKean, Gordon ;
Maggiore, Adriana ;
Mahony, Maire ;
Miller, Kieran ;
Nagarajan, Rahul ;
Narayanaswami, Ravi .
44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, :1-12
[40]   A Multi-Functional In-Memory Inference Processor Using a Standard 6T SRAM Array [J].
Kang, Mingu ;
Gonugondla, Sujan K. ;
Patil, Ameya ;
Shanbhag, Naresh R. .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2018, 53 (02) :642-655