A Design Space Exploration Framework for Deployment of Resource-Constrained Deep Neural Networks

被引:0
|
作者
Zhang, Yan [1 ]
Pan, Lei [1 ]
Berkowitz, Phillip [2 ]
Lee, Mun Wai [2 ]
Riggan, Benjamin [3 ]
Bhattacharyya, Shuvra S. [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Intelligent Automat, Rockville, MD 20855 USA
[3] Univ Nebraska, Lincoln, NE 68588 USA
来源
REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2024 | 2024年 / 13034卷
关键词
Design space exploration; Deep Neural Networks; Dataflow Modeling; Resource-constrained deployment; PARTICLE SWARM OPTIMIZATION;
D O I
10.1117/12.3014043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed great progress in the development of deep neural networks (DNNs), which has led to growing interest in deploying DNNs in resource-constrained environments such as network-edge and edge-cloud environments. To address objectives of efficient DNN inference, numerous approaches as well as specialized platforms have been designed for inference acceleration. The flexibility and diverse capabilities offered by these approaches and platforms result in large design spaces with complex trade-offs for DNN deployment. Relevant objectives involved in these trade-offs include inference accuracy, latency, throughput, memory requirements, and energy consumption. Tools that can effectively assist designers in deriving efficient DNN configurations for specific deployment scenarios are therefore needed. In this work, we present a design space exploration framework for this purpose. In the proposed framework, DNNs are represented as dataflow graphs using a lightweight-dataflow-based modeling tool, and schedules (strategies for managing processing resources across different DNN tasks) are modeled in a formal, abstract form using dataflow methods as well. The dataflow-based application and schedule representations are integrated systematically with a multiobjective particle swarm optimization (PSO) strategy, which enables efficient evaluation of implementation trade-offs and derivation of Pareto fronts involving alternative deployment configurations. Experimental results using different DNN architectures demonstrate the effectiveness of our proposed framework in exploring design spaces for DNN deployment.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] A convolutional neural network for the resource-constrained project scheduling problem (RCPSP): A new approach
    Golab, Amir
    Gooya, Ehsan Sedgh
    Al Falou, Ayman
    Cabon, Mikael
    DECISION SCIENCE LETTERS, 2023, 12 (02) : 225 - 238
  • [22] SpikeExplorer: Hardware-Oriented Design Space Exploration for Spiking Neural Networks on FPGA
    Padovano, Dario
    Carpegna, Alessio
    Savino, Alessandro
    Di Carlo, Stefano
    ELECTRONICS, 2024, 13 (09)
  • [23] Design Space Exploration for Layer-parallel Execution of Convolutional Neural Networks on CGRAs
    Heidorn, Christian
    Hannig, Frank
    Teich, Jurgen
    PROCEEDINGS OF THE 23RD INTERNATIONAL WORKSHOP ON SOFTWARE AND COMPILERS FOR EMBEDDED SYSTEMS (SCOPES 2020), 2020, : 26 - 31
  • [24] Training Deep Neural Networks with Constrained Learning Parameters
    Date, Prasanna
    Carothers, Christopher D.
    Mitchell, John E.
    Hendler, James A.
    Magdon-Ismail, Malik
    2020 INTERNATIONAL CONFERENCE ON REBOOTING COMPUTING (ICRC 2020), 2020, : 107 - 115
  • [25] Evolutionary Design of Deep Neural Networks
    Radu, Petru
    2019 21ST INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2019), 2020, : 335 - 336
  • [26] Graphical framework for system level design space exploration
    Perko, Klemen
    Trost, Andrej
    INFORMACIJE MIDEM-JOURNAL OF MICROELECTRONICS ELECTRONIC COMPONENTS AND MATERIALS, 2007, 37 (03): : 132 - 141
  • [27] Design Index for Deep Neural Networks
    Date, Prasanna
    Hendler, James A.
    Carothers, Christopher D.
    7TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, (BICA 2016), 2016, 88 : 131 - 138
  • [28] Design Space Exploration for YOLO Neural Network Accelerator
    Huang, Hongmin
    Liu, Zihao
    Chen, Taosheng
    Hu, Xianghong
    Zhang, Qiming
    Xiong, Xiaoming
    ELECTRONICS, 2020, 9 (11) : 1 - 15
  • [29] Resource Allocation Based on Deep Neural Networks for Cognitive Radio Networks
    Zhou, Fuhui
    Zhang, Xiongjian
    Hu, Rose Qingyang
    Papathanassiou, Apostolos
    Meng, Weixiao
    2018 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2018, : 40 - 45
  • [30] Design-Space Exploration and Runtime Resource Management for Multicores
    Mariani, Giovanni
    Palermo, Gianluca
    Zaccaria, Vittorio
    Silvano, Cristina
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2013, 13 (02)