Memory System Designed for Multiply-Accumulate (MAC) Engine Based on Stochastic Computing

被引：1

作者：

Zhang, Xinyue

Wang, Yuan ^{[1
]}

Zhang, Yawen

Song, Jiahao

Zhang, Zuodong

Cheng, Kaili

Wang, Runsheng

Huang, Ru

机构：

[1] Peking Univ, Inst Microelect, Beijing 100871, Peoples R China

来源：

17TH IEEE INTERNATIONAL CONFERENCE ON IC DESIGN AND TECHNOLOGY (ICICDT 2019) | 2019年

基金：

中国国家自然科学基金;

关键词：

convolutional neural networks (CNN); stochastic computing (SC); multiply-accumulate (MAC) engine; memory system;

D O I：

10.1109/icicdt.2019.8790878

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Convolutional neural network (CNN) achieves excellent performance on fascinating tasks such as image recognition and natural language processing at the cost of high power consumption. Stochastic computing (SC) is an attractive paradigm implemented in low power applications which performs arithmetic operations with simple logic and low hardware cost. However, conventional memory structure designed and optimized for binary computing leads to extra data conversion costs, which significantly decreases the energy efficiency. Therefore, a new memory system designed for SC-based multiply-accumulate (MAC) engine applied in CNN which is compatible with conventional memory system is proposed in this paper. As a result, the overall energy consumption of our new computing structure is 0.91pJ, which is reduced by 82.1% compared with the conventional structure, and the energy efficiency achieves 164.8 TOPS/W.

引用

页数：4

共 50 条

[21] Silicon-Based Metastructure Optical Scattering Multiply-Accumulate Computation Chip
Liu, Xu
Zhu, Xudong
Wang, Chunqing
Cao, Yifan
Wang, Baihang
Ou, Hanwen
Wu, Yizheng
Mei, Qixun
Zhang, Jialong
Cong, Zhe
Liu, Rentao
NANOMATERIALS, 2022, 12 (13)
[22] A Novel Low Power and High Speed Multiply-Accumulate (MAC) Unit Design for Floating-Point Numbers
Babu, N. Jithendra
Sarma, Rajkumar
2015 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES AND MANAGEMENT FOR COMPUTING, COMMUNICATION, CONTROLS, ENERGY AND MATERIALS (ICSTM), 2015, : 411 - 417
[23] A fully integrated reprogrammable memristor-CMOS system for efficient multiply-accumulate operations
Cai, Fuxi
Correll, Justin M.
Lee, Seung Hwan
Lim, Yong
Bothra, Vishishtha
Zhang, Zhengya
Flynn, Michael P.
Lu, Wei D.
NATURE ELECTRONICS, 2019, 2 (07) : 290 - 299
[24] A Posit Based Multiply-accumulate Unit with Small Quire Size for Deep Neural Networks
Nakahara Y.
Masuda Y.
Kiyama M.
Amagasaki M.
Iida M.
IPSJ Transactions on System LSI Design Methodology, 2022, 15 : 16 - 19
[25] RETRACTED: A nano-scale design of a multiply-accumulate unit for digital signal processing based on quantum computing (Retracted Article)
Ahmadpour, Seyed-Sajad
Navimipour, Nima Jafari
Yalcin, Senay
Bakhshayeshi Avval, Danial
Ul Ain, Noor
OPTICAL AND QUANTUM ELECTRONICS, 2024, 56 (01)
[26] High-Performance Digital Filtering on Truncated Multiply-Accumulate Units in the Residue Number System
Lyakhov, Pavel
Valueva, Maria
Valuev, Georgii
Nagornov, Nikolai
IEEE ACCESS, 2020, 8 (08): : 209181 - 209190
[27] A High-Accuracy Hardware-Efficient Multiply-Accumulate (MAC) Unit Based on Dual-Mode Truncation Error Compensation for CNNs
Tang, Song-Nien
Han, Yu-Shin
IEEE ACCESS, 2020, 8 : 214716 - 214731
[28] Ferroelectric FET-Based Time-Mode Multiply-Accumulate Accelerator: Design and Analysis
Rafiq, Musaib
Kaur, Tanveer
Gaidhane, Amol
Chauhan, Yogesh Singh
Sahay, Shubham
IEEE TRANSACTIONS ON ELECTRON DEVICES, 2023, 70 (12) : 6613 - 6621
[29] Design and Analysis of High-Throughput Two-Cycle Multiply-Accumulate (MAC) Architectures for Fixed-Point Arithmetic
Bhadra, Arijit
Samui, Suman
2022 IEEE CALCUTTA CONFERENCE, CALCON, 2022, : 267 - 272
[30] Architecture-Aware Design of a Decimation Filter Based on a Dual Wordlength Multiply-Accumulate Unit
Lindahl, Erik
Gustafsson, Oscar
2008 42ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-4, 2008, : 1897 - 1901

← 1 2 3 4 5 →