Energy-efficient computing-in-memory architecture for AI processor: device, circuit, architecture perspective

被引:13
作者
Chang, Liang [1 ]
Li, Chenglong [1 ]
Zhang, Zhaomin [1 ]
Xiao, Jianbiao [1 ]
Liu, Qingsong [1 ]
Zhu, Zhen [1 ]
Li, Weihang [1 ]
Zhu, Zixuan [1 ]
Yang, Siqi [1 ]
Zhou, Jun [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
基金
国家重点研发计划;
关键词
energy efficiency; computing-in-memory; non-volatile memory; test demonstrators; AI processor; SRAM MACRO; UNIT-MACRO; ACCELERATOR; COMPUTATION;
D O I
10.1007/s11432-021-3234-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An artificial intelligence (AI) processor is a promising solution for energy-efficient data processing, including health monitoring and image/voice recognition. However, data movements between compute part and memory induce memory wall and power wall challenges to the conventional computing architecture. Recently, the memory-centric architecture has been revised to solve the data movement issue, where the memory is equipped with the compute-capable memory technique, namely, computing-in-memory (CIM). In this paper, we analyze the requirement of AI algorithms on the data movement and low power requirement of AI processors. In addition, we introduce the story of CIM and implementation methodologies of CIM architecture. Furthermore, we present several novel solutions beyond traditional analog-digital mixed static random-access memory (SRAM)-based CIM architecture. Finally, recent CIM tape-out studies are listed and discussed.
引用
收藏
页数:15
相关论文
共 94 条
  • [1] A Scalable Processing-in-Memory Accelerator for Parallel Graph Processing
    Ahn, Junwhan
    Hong, Sungpack
    Yoo, Sungjoo
    Mutlu, Onur
    Choi, Kiyoung
    [J]. 2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, : 105 - 117
  • [2] A 55-nm, 1.0-0.4V, 1.25-pJ/MAC Time-Domain Mixed-Signal Neuromorphic Accelerator With Stochastic Synapses for Reinforcement Learning in Autonomous Mobile Robots
    Amaravati, Anvesha
    Bin Nasir, Saad
    Ting, Justin
    Yoon, Insik
    Raychowdhury, Arijit
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2019, 54 (01) : 75 - 87
  • [3] Amravati A, 2018, ISSCC DIG TECH PAP I ISSCC DIG TECH PAP I
  • [4] Ando K, 2017, SYMP VLSI CIRCUITS, pC24, DOI 10.23919/VLSIC.2017.8008533
  • [5] CONV-SRAM: An Energy-Efficient SRAM With In-Memory Dot-Product Computation for Low-Power Convolutional Neural Networks
    Biswas, Avishek
    Chandrakasan, Anantha P.
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2019, 54 (01) : 217 - 230
  • [6] Biswas A, 2018, ISSCC DIG TECH PAP I, P488, DOI 10.1109/ISSCC.2018.8310397
  • [7] Chang Chang L L, DES AUT TEST EUROPE, P384
  • [8] PXNOR-BNN: In/With Spin-Orbit Torque MRAM Preset-XNOR Operation-Based Binary Neural Networks
    Chang, Liang
    Ma, Xin
    Wang, Zhaohao
    Zhang, Youguang
    Xie, Yuan
    Zhao, Weisheng
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2019, 27 (11) : 2668 - 2679
  • [9] DASM: Data-Streaming-Based Computing in Nonvolatile Memory Architecture for Embedded System
    Chang, Liang
    Ma, Xin
    Wang, Zhaohao
    Zhang, Youguang
    Ding, Yufei
    Zhao, Weisheng
    Xie, Yuan
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2019, 27 (09) : 2046 - 2059
  • [10] Multi-Port 1R1W Transpose Magnetic Random Access Memory by Hierarchical Bit-Line Switching
    Chang, Liang
    Wang, Zhaohao
    Zhang, Youguang
    Zhao, Weisheng
    [J]. IEEE ACCESS, 2019, 7 : 110463 - 110471