Heterogeneous Accelerator Design for Multi-DNN Workloads via Heuristic Optimization

被引:0
|
作者
Balaskas, Konstantinos [1 ]
Khdr, Heba [1 ]
Sikal, Mohammed Bakr [1 ]
Kreb, Fabian [1 ]
Siozios, Kostas [2 ]
Becker, Jurgen [1 ]
Henkel, Jorg [1 ]
机构
[1] Karlsruhe Inst Technol, Chair Embedded Syst, G-76131 Karlsruhe, Germany
[2] Aristotle Univ Thessaloniki, Dept Phys, Thessaloniki 54124, Greece
关键词
Runtime; Annealing; Accuracy; Artificial neural networks; Simulated annealing; Structural engineering; Artificial intelligence; Optimization; Arithmetic; AI accelerators; deep learning; electronic design automation; neural network hardware; simulated annealing;
D O I
10.1109/LES.2024.3443628
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The significant advancements of deep neural networks (DNNs) in a wide range of application domains have spawned the need for more specialized, sophisticated solutions in the form of multi-DNN workloads. Heterogeneous DNN accelerators have emerged as an elegant solution to tackle the workloads' inherent diversity, achieving significant improvements compared to homogeneous solutions. However, utilizing off-the-shelf architectures provides suboptimal adaptability to given workloads, whereas custom design approaches offer limited heterogeneity, and thus reduced gains. In this letter, we combat these shortcomings and propose an exploration-based framework to holistically design heterogeneous accelerators, tailored for multi-DNN workloads. Our framework is workload-agnostic and leverages architectural heterogeneity to its full potential, by integrating low-precision arithmetic and custom structural parameters. We explore the formed design space, targeting to minimize the system's energy-delay product (EDP) via heuristic techniques. Our proposed accelerators achieve, on average, a significant 5.5x reduction in EDP compared to the state of the art across various multi-DNN workloads.
引用
收藏
页码:317 / 320
页数:4
相关论文
共 50 条