Memory System Designed for Multiply-Accumulate (MAC) Engine Based on Stochastic Computing

被引:1
|
作者
Zhang, Xinyue
Wang, Yuan [1 ]
Zhang, Yawen
Song, Jiahao
Zhang, Zuodong
Cheng, Kaili
Wang, Runsheng
Huang, Ru
机构
[1] Peking Univ, Inst Microelect, Beijing 100871, Peoples R China
来源
17TH IEEE INTERNATIONAL CONFERENCE ON IC DESIGN AND TECHNOLOGY (ICICDT 2019) | 2019年
基金
中国国家自然科学基金;
关键词
convolutional neural networks (CNN); stochastic computing (SC); multiply-accumulate (MAC) engine; memory system;
D O I
10.1109/icicdt.2019.8790878
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural network (CNN) achieves excellent performance on fascinating tasks such as image recognition and natural language processing at the cost of high power consumption. Stochastic computing (SC) is an attractive paradigm implemented in low power applications which performs arithmetic operations with simple logic and low hardware cost. However, conventional memory structure designed and optimized for binary computing leads to extra data conversion costs, which significantly decreases the energy efficiency. Therefore, a new memory system designed for SC-based multiply-accumulate (MAC) engine applied in CNN which is compatible with conventional memory system is proposed in this paper. As a result, the overall energy consumption of our new computing structure is 0.91pJ, which is reduced by 82.1% compared with the conventional structure, and the energy efficiency achieves 164.8 TOPS/W.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Silicon-Based Metastructure Optical Scattering Multiply-Accumulate Computation Chip
    Liu, Xu
    Zhu, Xudong
    Wang, Chunqing
    Cao, Yifan
    Wang, Baihang
    Ou, Hanwen
    Wu, Yizheng
    Mei, Qixun
    Zhang, Jialong
    Cong, Zhe
    Liu, Rentao
    NANOMATERIALS, 2022, 12 (13)
  • [22] A Novel Low Power and High Speed Multiply-Accumulate (MAC) Unit Design for Floating-Point Numbers
    Babu, N. Jithendra
    Sarma, Rajkumar
    2015 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES AND MANAGEMENT FOR COMPUTING, COMMUNICATION, CONTROLS, ENERGY AND MATERIALS (ICSTM), 2015, : 411 - 417
  • [23] A fully integrated reprogrammable memristor-CMOS system for efficient multiply-accumulate operations
    Cai, Fuxi
    Correll, Justin M.
    Lee, Seung Hwan
    Lim, Yong
    Bothra, Vishishtha
    Zhang, Zhengya
    Flynn, Michael P.
    Lu, Wei D.
    NATURE ELECTRONICS, 2019, 2 (07) : 290 - 299
  • [24] A Posit Based Multiply-accumulate Unit with Small Quire Size for Deep Neural Networks
    Nakahara Y.
    Masuda Y.
    Kiyama M.
    Amagasaki M.
    Iida M.
    IPSJ Transactions on System LSI Design Methodology, 2022, 15 : 16 - 19
  • [25] RETRACTED: A nano-scale design of a multiply-accumulate unit for digital signal processing based on quantum computing (Retracted Article)
    Ahmadpour, Seyed-Sajad
    Navimipour, Nima Jafari
    Yalcin, Senay
    Bakhshayeshi Avval, Danial
    Ul Ain, Noor
    OPTICAL AND QUANTUM ELECTRONICS, 2024, 56 (01)
  • [26] High-Performance Digital Filtering on Truncated Multiply-Accumulate Units in the Residue Number System
    Lyakhov, Pavel
    Valueva, Maria
    Valuev, Georgii
    Nagornov, Nikolai
    IEEE ACCESS, 2020, 8 (08): : 209181 - 209190
  • [27] A High-Accuracy Hardware-Efficient Multiply-Accumulate (MAC) Unit Based on Dual-Mode Truncation Error Compensation for CNNs
    Tang, Song-Nien
    Han, Yu-Shin
    IEEE ACCESS, 2020, 8 : 214716 - 214731
  • [28] Ferroelectric FET-Based Time-Mode Multiply-Accumulate Accelerator: Design and Analysis
    Rafiq, Musaib
    Kaur, Tanveer
    Gaidhane, Amol
    Chauhan, Yogesh Singh
    Sahay, Shubham
    IEEE TRANSACTIONS ON ELECTRON DEVICES, 2023, 70 (12) : 6613 - 6621
  • [29] Design and Analysis of High-Throughput Two-Cycle Multiply-Accumulate (MAC) Architectures for Fixed-Point Arithmetic
    Bhadra, Arijit
    Samui, Suman
    2022 IEEE CALCUTTA CONFERENCE, CALCON, 2022, : 267 - 272
  • [30] Architecture-Aware Design of a Decimation Filter Based on a Dual Wordlength Multiply-Accumulate Unit
    Lindahl, Erik
    Gustafsson, Oscar
    2008 42ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-4, 2008, : 1897 - 1901