Approximate Adder Tree Design with Sparsity-Aware Encoding and In-Memory Swapping for SRAM-based Digital Compute-In-Memory Macros

被引:1
|
作者
Lin, Ming-Guang [1 ]
Wang, Jiing-Ping [1 ]
Chang, Cheng-Yang [1 ]
Wu, An-Yeu [1 ]
机构
[1] Natl Taiwan Univ, Grad Inst Elect Engn, Taipei, Taiwan
来源
2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024 | 2024年
关键词
Digital Compute-in-memory; deep learning accelerator; vision transformer; approximate adder tree;
D O I
10.1109/AICAS59952.2024.10595967
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
SRAM-based digital compute-in-memory (DCIM) is a promising solution to accelerate the extensive multiply-and-accumulate (MAC) operations in deep neural networks (DNNs). Despite its remarkable potential to enhance performance and efficiency, the adder tree circuits dominate the power and area. While directly replacing the exact adder tree with approximate circuits can greatly reduce the overheads, it also induces severe accuracy drop without a laborious retraining process. This work adopts an OR-based approximate adder tree as a baseline and proposes two techniques to recover the accuracy drop without retraining the model: sparsity-aware encoding to introduce bit sparsity in the most significant bits (MSBs) and an in-memory swapping mechanism for error mitigation. Validated on large-scale datasets with advanced vision transformer (ViT) models, our methods effectively restore the accuracy by 72.08% to 78.56% across various ViT models, eliminating the need for expert-driven retraining.
引用
收藏
页码:362 / 366
页数:5
相关论文
共 35 条
  • [1] A review of SRAM-based compute-in-memory circuits
    Yoshioka, Kentaro
    Ando, Shimpei
    Miyagi, Satomi
    Chen, Yung-Chin
    Zhang, Wenlun
    JAPANESE JOURNAL OF APPLIED PHYSICS, 2024, 63 (12)
  • [2] Variation-aware Design Methodology For SRAM-based Multi-bit Analog Compute-In-Memory Architecture
    Kushwaha, Dinesh
    Joshi, Rajiv, V
    Bulusu, Anand
    Dasgupta, Sudeb
    2024 22ND IEEE INTERREGIONAL NEWCAS CONFERENCE, NEWCAS 2024, 2024, : 243 - 247
  • [3] From macro to microarchitecture: reviews and trends of SRAM-based compute-in-memory circuits
    Zhaoyang ZHANG
    Jinwu CHEN
    Xi CHEN
    An GUO
    Bo WANG
    Tianzhu XIONG
    Yuyao KONG
    Xingyu PU
    Shengnan HE
    Xin SI
    Jun YANG
    Science China(Information Sciences), 2023, 66 (10) : 42 - 60
  • [4] From macro to microarchitecture: reviews and trends of SRAM-based compute-in-memory circuits
    Zhang, Zhaoyang
    Chen, Jinwu
    Chen, Xi
    Guo, An
    Wang, Bo
    Xiong, Tianzhu
    Kong, Yuyao
    Pu, Xingyu
    He, Shengnan
    Si, Xin
    Yang, Jun
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (10)
  • [5] A Novel CNFET SRAM-Based Compute-In-Memory for BNN Considering Chirality and Nanotubes
    Kim, Youngbae
    Alnatsheh, Nader
    Yadav, Nandakishor
    Cho, Jaeik
    Jo, Heeyoung
    Choi, Kyuwon Ken
    ELECTRONICS, 2024, 13 (11)
  • [6] SRAM-Based Hybrid Analog Compute-In-memory Architecture to Enhance the Signal Margin
    Kushwaha, Dinesh
    Joshi, Rajiv V.
    Dasgupta, Sudeb
    Bulusu, Anand
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [7] A 35.5-127.2 TOPS/W Dynamic Sparsity-Aware Reconfigurable-Precision Compute-in-Memory SRAM Macro for Machine Learning
    Ali, Mustafa
    Chakraborty, Indranil
    Saxena, Utkarsh
    Agrawal, Amogh
    Ankit, Aayush
    Roy, Kaushik
    IEEE SOLID-STATE CIRCUITS LETTERS, 2021, 4 : 129 - 132
  • [8] Design and Analysis of an Energy-efficient Duo-Core SRAM-based Compute-in-Memory Accelerator
    Chiou, Lih-Yih
    Shih, Hong-Ming
    Hsu, Shun-Hsiu
    Sheng, Zu-Cheng
    Chang, Soon-Jyh
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [9] Design Challenges and Methodology of High-Performance SRAM-Based Compute-in-Memory for AI Edge Devices
    Wang, Yufei
    Zhou, Yongliang
    Wang, Bo
    Xiong, Tianzhu
    Kong, Yuyao
    Si, Xin
    2021 6TH INTERNATIONAL CONFERENCE ON UK-CHINA EMERGING TECHNOLOGIES (UCET 2021), 2021, : 47 - 52
  • [10] An Energy-Efficient SRAM-based Charge Domain Compute In-Memory Architecture
    Singla, Anmol
    Kushwaha, Dinesh
    Aman, Gulyawar
    Chakraborty, Shubhradip
    Goel, Abhishek
    Bulusu, Anand
    Dasgupta, Sudeb
    2024 22ND IEEE INTERREGIONAL NEWCAS CONFERENCE, NEWCAS 2024, 2024, : 233 - 237