Architecture of neural processing unit for deep neural networks

被引：15

作者：

Lee, Kyuho J. ^{[1
]}

机构：

[1] Ulsan Natl Inst Sci & Technol, Artificial Intelligence Grad Sch, Sch Elect & Comp Engn, Ulsan, South Korea

来源：

HARDWARE ACCELERATOR SYSTEMS FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING | 2021年 / 122卷

基金：

新加坡国家研究基金会;

关键词：

ACCELERATOR;

D O I：

10.1016/bs.adcom.2020.11.001

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep Neural Networks (DNNs) have become a promising solution to inject AI in our daily lives from self-driving cars, smartphones, games, drones, etc. In most cases, DNNs were accelerated by server equipped with numerous computing engines, e.g., GPU, but recent technology advance requires energy-efficient acceleration of DNNs as the modern applications moved down to mobile computing nodes. Therefore, Neural Processing Unit (NPU) architectures dedicated to energy-efficient DNN acceleration became essential. Despite the fact that training phase of DNN requires precise number representations, many researchers proved that utilizing smaller bit-precision is enough for inference with low-power consumption. This led hardware architects to investigate energy-efficient NPU architectures with diverse HW-SW co-optimization schemes for inference. This chapter provides a review of several design examples of latest NPU architecture for DNN, mainly about inference engines. It also provides a discussion on the new architectural researches of neuromorphic computers and processing-in-memory architecture, and provides perspectives on the future research directions.

引用

页码：217 / 245

页数：29

共 50 条

[21] NeuroUnlock: Unlocking the Architecture of Obfuscated Deep Neural Networks
Ahmadi, Mahya Morid
Alrahis, Lilas
Colucci, Alessio
Sinanoglu, Ozgur
Shafique, Muhammad
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[22] Efficient Softmax Hardware Architecture for Deep Neural Networks
Du, Gaoming
Tian, Chao
Li, Zhenmin
Zhang, Duoli
Yin, Yongsheng
Ouyang, Yiming
GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, : 75 - 80
[23] Deep generative neural networks for spectral image processing
Mishra, Puneet
ANALYTICA CHIMICA ACTA, 2022, 1191
[24] Sensory processing and categorization in cortical and deep neural networks
Pinotsis, Dimitris A.
Siegel, Markus
Miller, Earl K.
NEUROIMAGE, 2019, 202
[25] Efficient Processing of Deep Neural Networks: A Tutorial and Survey
Sze, Vivienne
Chen, Yu-Hsin
Yang, Tien-Ju
Emer, Joel S.
PROCEEDINGS OF THE IEEE, 2017, 105 (12) : 2295 - 2329
[26] SYNAPTIC DEPRESSION IN DEEP NEURAL NETWORKS FOR SPEECH PROCESSING
Zhang, Wenhao
Li, Hanyu
Yang, Minda
Mesgarani, Nima
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5865 - 5869
[27] A General-Purpose Neural Architecture Search Algorithm for Building Deep Neural Networks
Zito, Francesco
Cutello, Vincenzo
Pavone, Mario
METAHEURISTICS, MIC 2024, PT II, 2024, 14754 : 126 - 141
[28] Neural Architecture Search Using Deep Neural Networks and Monte Carlo Tree Search
Wang, Linnan
Zhao, Yiyang
Yuu Jinnai
Tian, Yuandong
Fonseca, Rodrigo
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9983 - 9991
[29] Cellular Neural Networks Simulation on a Parallel Graphics Processing Unit
Fernandez, Andres
Martin, Ruben San
Farguell, Enric
Pazienza, Giovanni Egidio
2008 11TH INTERNATIONAL WORKSHOP ON CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS, 2008, : 208 - +
[30] Memory-Centric Architecture of Neural Processing Unit for Edge Device
Lee, Eunchong
Sung, Minyong
Jang, Sung-Joon
Park, Jonghee
Lee, Sang-Seol
18TH INTERNATIONAL SOC DESIGN CONFERENCE 2021 (ISOCC 2021), 2021, : 240 - 241

← 1 2 3 4 5 →