A High-Level Modeling Framework for Estimating Hardware Metrics of CNN Accelerators

被引:9
作者
Juracy, Leonardo Rezende [1 ]
Moreira, Matheus Trevisan [2 ]
Amory, Alexandre de Morais [3 ]
Hampel, Alexandre F. [1 ]
Moraes, Fernando Gehm [1 ]
机构
[1] Pontifical Catholic Univ Rio Grande Sul PUCRS, Sch Technol, BR-90619900 Porto Alegre, RS, Brazil
[2] Chronos Tech, San Diego, CA 92122 USA
[3] TeCIP Inst, Scuola Super SantAnna, I-56124 Pisa, Italy
关键词
Convolutional neural networks; Space exploration; Estimation; Computer architecture; Training; Hardware acceleration; Convolution; CNN; convolution hardware accelerator; system simulator; PPA; design space exploration;
D O I
10.1109/TCSI.2021.3104644
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
GPUs became the reference platform for both training and inference phases of Convolutional Neural Networks (CNN) due to their tailored architecture to the CNN operators. However, GPUs are power-hungry architectures. A path to enable the deployment of CNNs in energy-constrained devices is adopting hardware accelerators for the inference phase. The design space exploration of CNNs using standard approaches, such as RTL, is limited due to their complexity. Thus, designers need frameworks enabling design space exploration that delivers accurate hardware estimation metrics to deploy CNNs. This work proposes a framework to explore CNNs design space, providing power, performance, and area (PPA) estimations. The heart of the framework is a system simulator. The system simulator front-end is TensorFlow, and the back-end is performance estimations obtained from the physical synthesis of hardware accelerators, not only from components like multipliers and adders. The first set of results evaluate the CNN accuracy using integer quantization, the accelerators PPA after physical synthesis, and the benefits of using a system simulator. These results allow a rich design space exploration, enabling selecting the best set of CNN parameters to meet the design constraints.
引用
收藏
页码:4783 / 4795
页数:13
相关论文
共 39 条
  • [31] Hybrid Graph Representation and Learning Framework for High-Level Synthesis Design Space Exploration
    Taghipour, Pouya
    Granger, Eric
    Blaquiere, Yves
    IEEE ACCESS, 2024, 12 : 189574 - 189589
  • [32] AxHLS: Design Space Exploration and High-Level Synthesis of Approximate Accelerators using Approximate Functional Units and Analytical Models
    Castro-Godinez, Jorge
    Mateus-Vargas, Julian
    Shafique, Muhammad
    Henkel, Joerg
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
  • [33] High-level modeling of communication-centric applications: Extensions to a system-level design and virtual prototyping tool
    Genius, Daniela
    Apvrille, Ludovic
    Li, Letitia W.
    MICROPROCESSORS AND MICROSYSTEMS, 2019, 67 : 117 - 130
  • [34] LP-HLS: Automatic power-intent generation for high-level synthesis based hardware implementation flow
    Qamar, Affaq
    Bin Muslim, Fahad
    Iqbal, Javed
    Lavagno, Luciano
    MICROPROCESSORS AND MICROSYSTEMS, 2017, 50 : 26 - 38
  • [35] Optimum Network/Framework Selection from High-Level Specifications in Embedded Deep Learning Vision Applications
    Velasco-Montero, Delia
    Fernandez-Berni, Jorge
    Carmona-Galan, Ricardo
    Rodriguez-Vazquez, Angel
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2018, 2018, 11182 : 369 - 379
  • [36] Teaching Hardware Implementation of Neural Networks using High-Level Synthesis in Less Than Four Hours for Engineering Education of Intelligent Embedded Computing
    Huang, Nan-Sheng
    Braun, Jan-Matthias
    Larsen, Jorgen Christian
    Manoonpong, Poramate
    2019 20TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2019, : 185 - 191
  • [37] Enhanced Reinforcement Learning Method Combining One-Hot Encoding-Based Vectors for CNN-Based Alternative High-Level Decisions
    Gu, Bonwoo
    Sung, Yunsick
    APPLIED SCIENCES-BASEL, 2021, 11 (03): : 1 - 15
  • [38] An Exploration Framework for Efficient High-Level Synthesis of Support Vector Machines: Case Study on ECG Arrhythmia Detection for Xilinx Zynq SoC
    Tsoutsouras, Vasileios
    Koliogeorgi, Konstantina
    Xydis, Sotirios
    Soudris, Dimitrios
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2017, 88 (02): : 127 - 147
  • [39] An Exploration Framework for Efficient High-Level Synthesis of Support Vector Machines: Case Study on ECG Arrhythmia Detection for Xilinx Zynq SoC
    Vasileios Tsoutsouras
    Konstantina Koliogeorgi
    Sotirios Xydis
    Dimitrios Soudris
    Journal of Signal Processing Systems, 2017, 88 : 127 - 147