Deep learning accelerators: a case study with MAESTRO

被引：5

作者：

Bolhasani, Hamidreza ^{[1
]}

Jassbi, Somayyeh Jafarali ^{[1
]}

机构：

[1] Islamic Azad Univ, Sci & Res Branch, Dept Comp Engn, Tehran, Iran

来源：

JOURNAL OF BIG DATA | 2020年 / 7卷 / 01期

关键词：

Deep learning; Convolutional neural networks; Deep neural networks; Hardware accelerator; Deep learning accelerator;

D O I：

10.1186/s40537-020-00377-8

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In recent years, deep learning has become one of the most important topics in computer sciences. Deep learning is a growing trend in the edge of technology and its applications are now seen in many aspects of our life such as object detection, speech recognition, natural language processing, etc. Currently, almost all major sciences and technologies are benefiting from the advantages of deep learning such as high accuracy, speed and flexibility. Therefore, any efforts in improving performance of related techniques is valuable. Deep learning accelerators are considered as hardware architecture, which are designed and optimized for increasing speed, efficiency and accuracy of computers that are running deep learning algorithms. In this paper, after reviewing some backgrounds on deep learning, a well-known accelerator architecture named MAERI (Multiply-Accumulate Engine with Reconfigurable interconnects) is investigated. Performance of a deep learning task is measured and compared in two different data flow strategies: NLR (No Local Reuse) and NVDLA (NVIDIA Deep Learning Accelerator), using an open source tool called MAESTRO (Modeling Accelerator Efficiency via Spatio-Temporal Resource Occupancy). Measured performance indicators of novel optimized architecture, NVDLA shows higher L1 and L2 computation reuse, and lower total runtime (cycles) in comparison to the other one.

引用

页数：11

共 50 条

[1] Deep learning accelerators: a case study with MAESTRO
Hamidreza Bolhasani
Somayyeh Jafarali Jassbi
Journal of Big Data, 7
[2] FPGA-Based Accelerators of Deep Learning Networks for Learning and Classification: A Review
Shawahna, Ahmad
Sait, Sadiq M.
El-Maleh, Aiman
IEEE ACCESS, 2019, 7 : 7823 - 7859
[3] Exploiting deep learning accelerators for neuromorphic workloads
Sun, Pao-Sheng Vincent
Titterton, Alexander
Gopiani, Anjlee
Santos, Tim
Basu, Arindam
Lu, Wei D.
Eshraghian, Jason K.
NEUROMORPHIC COMPUTING AND ENGINEERING, 2024, 4 (01):
[4] Deep Learning Accelerators' Configuration Space Exploration Effect on Performance and Resource Utilization: A Gemmini Case Study
Gookyi, Dennis Agyemanh Nana
Lee, Eunchong
Kim, Kyungho
Jang, Sung-Joon
Lee, Sang-Seol
SENSORS, 2023, 23 (05)
[5] AdequateDL: Approximating Deep Learning Accelerators
Sentieys, Olivier
Filip, Silviu
Briand, David
Novo, David
Dupuis, Etienne
O'Connor, Ian
Bosio, Alberto
2021 24TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2021, : 37 - 40
[6] The Progress and Trends of FPGA-Based Accelerators in Deep Learning
Wu Y.-X.
Liang K.
Liu Y.
Cui H.-M.
Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (11): : 2461 - 2480
[7] Assembly language and assembler for deep learning accelerators
Lan H.
Wu L.
Han D.
Du Z.
High Technology Letters, 2019, 25 (04): : 386 - 394
[8] Deep Learning Case Study for Automatic Bird Identification
Niemi, Juha
Tanttu, Juha T.
APPLIED SCIENCES-BASEL, 2018, 8 (11):
[9] Assembly language and assembler for deep learning accelerators
兰慧盈
Wu Linyang
Han Dong
Du Zidong
HighTechnologyLetters, 2019, 25 (04) : 386 - 394
[10] EDLAB: A Benchmark for Edge Deep Learning Accelerators
Kong, Hao
Huai, Shuo
Liu, Di
Zhang, Lei
Chen, Hui
Zhu, Shien
Li, Shiqing
Liu, Weichen
Rastogi, Manu
Subramaniam, Ravi
Athreya, Madhu
Lewis, M. Anthony
IEEE DESIGN & TEST, 2022, 39 (03) : 8 - 17

← 1 2 3 4 5 →