A High-Level Modeling Framework for Estimating Hardware Metrics of CNN Accelerators

被引:9
作者
Juracy, Leonardo Rezende [1 ]
Moreira, Matheus Trevisan [2 ]
Amory, Alexandre de Morais [3 ]
Hampel, Alexandre F. [1 ]
Moraes, Fernando Gehm [1 ]
机构
[1] Pontifical Catholic Univ Rio Grande Sul PUCRS, Sch Technol, BR-90619900 Porto Alegre, RS, Brazil
[2] Chronos Tech, San Diego, CA 92122 USA
[3] TeCIP Inst, Scuola Super SantAnna, I-56124 Pisa, Italy
关键词
Convolutional neural networks; Space exploration; Estimation; Computer architecture; Training; Hardware acceleration; Convolution; CNN; convolution hardware accelerator; system simulator; PPA; design space exploration;
D O I
10.1109/TCSI.2021.3104644
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
GPUs became the reference platform for both training and inference phases of Convolutional Neural Networks (CNN) due to their tailored architecture to the CNN operators. However, GPUs are power-hungry architectures. A path to enable the deployment of CNNs in energy-constrained devices is adopting hardware accelerators for the inference phase. The design space exploration of CNNs using standard approaches, such as RTL, is limited due to their complexity. Thus, designers need frameworks enabling design space exploration that delivers accurate hardware estimation metrics to deploy CNNs. This work proposes a framework to explore CNNs design space, providing power, performance, and area (PPA) estimations. The heart of the framework is a system simulator. The system simulator front-end is TensorFlow, and the back-end is performance estimations obtained from the physical synthesis of hardware accelerators, not only from components like multipliers and adders. The first set of results evaluate the CNN accuracy using integer quantization, the accelerators PPA after physical synthesis, and the benefits of using a system simulator. These results allow a rich design space exploration, enabling selecting the best set of CNN parameters to meet the design constraints.
引用
收藏
页码:4783 / 4795
页数:13
相关论文
共 39 条
  • [21] USING CNN-BASED HIGH-LEVEL FEATURES FOR REMOTE SENSING SCENE CLASSIFICATION
    Fang, Zhengzheng
    Li, Wei
    Zou, Jinyi
    Du, Qian
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 2610 - 2613
  • [22] Improving Power & Latency Metrics for Hardware Trojan Detection during High Level Synthesis
    Shathanaa, R.
    Ramasubramanian, N.
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [23] Hardware-based nonlinear filtering and segmentation using high-level shading languages
    Viola, I
    Kanitsar, A
    Gröller, ME
    IEEE VISUALIZATION 2003, PROCEEDINGS, 2003, : 309 - 316
  • [24] High-level customization framework for application-specific NoC architectures
    Anagnostopoulos, Iraklis
    Bartzas, Alexandros
    Filippopoulos, Iason
    Soudris, Dimitrios
    DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2012, 16 (04) : 339 - 361
  • [25] ScaleHLS: A New Scalable High-Level Synthesis Framework on Multi-Level Intermediate Representation
    Ye, Hanchen
    Hao, Cong
    Cheng, Jianyi
    Jeong, Hyunmin
    Huang, Jack
    Neuendorffer, Stephen
    Chen, Deming
    2022 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2022), 2022, : 741 - 755
  • [26] Combining on-hardware prototyping and high-level simulation for DSE of multi-ASIP systems
    Meloni, Paolo
    Pomata, Sebastiano
    Raffo, Luigi
    Piscitelli, Roberta
    Pimentel, Andy D.
    2012 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS (SAMOS): ARCHITECTURES, MODELING AND SIMULATION, 2012, : 310 - 317
  • [27] Generation of Efficient Self-adaptive Hardware Polar Decoders Using High-Level Synthesis
    Delomier, Yann
    Le Gal, Bertrand
    Crenne, Jeremie
    Jego, Christophe
    PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2019), 2019, : 242 - 247
  • [28] A framework for high-level simulation and optimization of fine-grained reconfigurable architectures
    Pasha, Muhammad Adeel
    Farooq, Umer
    Siddiqui, Bilal
    SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2019, 95 (08): : 737 - 751
  • [29] High-Level Synthesis for Hardware/Software Co-Design of Distributed Smart Camera Systems
    Streit, Franz-Josef
    Letras, Martin
    Schid, Matthias
    Falk, Joachim
    Wildermann, Stefan
    Teich, Juergen
    11TH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS (ICDSC 2017), 2017, : 174 - 179
  • [30] A Semi-Supervised High-Level Feature Selection Framework for Road Centerline Extraction
    Liu, Ruyi
    Miao, Qiguang
    Zhang, Yi
    Gong, Maoguo
    Xu, Pengfei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (05) : 894 - 898