Heterogeneous Accelerator Design for Multi-DNN Workloads via Heuristic Optimization

被引:0
|
作者
Balaskas, Konstantinos [1 ]
Khdr, Heba [1 ]
Sikal, Mohammed Bakr [1 ]
Kreb, Fabian [1 ]
Siozios, Kostas [2 ]
Becker, Jurgen [1 ]
Henkel, Jorg [1 ]
机构
[1] Karlsruhe Inst Technol, Chair Embedded Syst, G-76131 Karlsruhe, Germany
[2] Aristotle Univ Thessaloniki, Dept Phys, Thessaloniki 54124, Greece
关键词
Runtime; Annealing; Accuracy; Artificial neural networks; Simulated annealing; Structural engineering; Artificial intelligence; Optimization; Arithmetic; AI accelerators; deep learning; electronic design automation; neural network hardware; simulated annealing;
D O I
10.1109/LES.2024.3443628
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The significant advancements of deep neural networks (DNNs) in a wide range of application domains have spawned the need for more specialized, sophisticated solutions in the form of multi-DNN workloads. Heterogeneous DNN accelerators have emerged as an elegant solution to tackle the workloads' inherent diversity, achieving significant improvements compared to homogeneous solutions. However, utilizing off-the-shelf architectures provides suboptimal adaptability to given workloads, whereas custom design approaches offer limited heterogeneity, and thus reduced gains. In this letter, we combat these shortcomings and propose an exploration-based framework to holistically design heterogeneous accelerators, tailored for multi-DNN workloads. Our framework is workload-agnostic and leverages architectural heterogeneity to its full potential, by integrating low-precision arithmetic and custom structural parameters. We explore the formed design space, targeting to minimize the system's energy-delay product (EDP) via heuristic techniques. Our proposed accelerators achieve, on average, a significant 5.5x reduction in EDP compared to the state of the art across various multi-DNN workloads.
引用
收藏
页码:317 / 320
页数:4
相关论文
共 50 条
  • [11] Generalized MultiAmdahl: Optimization of Heterogeneous Multi-Accelerator SoC
    Morad, Amir
    Morad, Tomer Y.
    Yavits, Leonid
    Ginosar, Ran
    Weiser, Uri
    IEEE COMPUTER ARCHITECTURE LETTERS, 2014, 13 (01) : 37 - 40
  • [12] Design Space Exploration of Heterogeneous-Accelerator SoCs with Hyperparameter Optimization
    Cong, Thanh
    Charot, Francois
    2021 26TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2021, : 338 - 343
  • [13] A Novel DNN Training Framework via Data Sampling and Multi-Task Optimization
    Zhang, Boyu
    Qin, A. K.
    Pan, Hong
    Sellis, Timos
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [14] A meta-heuristic based multi objective optimization for load distribution in cloud data center under varying workloads
    Shashank Kumar Mishra
    R. Manjula
    Cluster Computing, 2020, 23 : 3079 - 3093
  • [15] A meta-heuristic based multi objective optimization for load distribution in cloud data center under varying workloads
    Mishra, Shashank Kumar
    Manjula, R.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (04): : 3079 - 3093
  • [16] A Case for Efficient Accelerator Design Space Exploration via Bayesian Optimization
    Reagen, Brandon
    Hernandez-Lobato, Jose Miguel
    Adolf, Robert
    Gelbart, Michael
    Whatmough, Paul
    Wei, Gu-Yeon
    Brooks, David
    2017 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2017,
  • [17] Multi-Accelerator Neural Network Inference via TensorRT in Heterogeneous Embedded Systems
    Zhou, Yuxiao
    Guo, Zhishan
    Dong, Zheng
    Yang, Kecheng
    2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 463 - 472
  • [18] Heterogeneous Scalable Multi-languages Optimization via Simulation
    Cordasco, Gennaro
    D'Auria, Matteo
    Spagnuolo, Carmine
    Scarano, Vittorio
    METHODS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, 2018, 946 : 151 - 167
  • [19] HEURISTIC APPROACH FOR HETEROGENEOUS REDUNDANCY OPTIMIZATION IN MULTI-STATE SERIES-PARALLEL SYSTEM
    Gupta, Rashika
    Agarwal, Manju
    INTERNATIONAL JOURNAL OF RELIABILITY QUALITY & SAFETY ENGINEERING, 2007, 14 (04): : 327 - 359
  • [20] Towards Fair and Firm Real-Time Scheduling in DNN Multi-Tenant Multi-Accelerator Systems via Reinforcement Learning
    Russo, Enrico
    Blanco, Francesco Giulio
    Palesi, Maurizio
    Ascia, Giuseppe
    Patti, Davide
    Catania, Vincenzo
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,