Design Space Exploration of Heterogeneous-Accelerator SoCs with Hyperparameter Optimization

被引:1
作者
Cong, Thanh [1 ]
Charot, Francois [1 ]
机构
[1] Univ Rennes, IRISA, CNRS, INRIA, Rennes, France
来源
2021 26TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC) | 2021年
关键词
Heterogeneous architecture design; System-on-chip; Hardware accelerators; Hyperparmeter optimization; Simulation;
D O I
10.1145/3394885.3431415
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Modern SoC systems consist of general-purpose processor cores augmented with large numbers of specialized accelerators. Building such systems requires a design flow allowing the design space to be explored at the system level with an appropriate strategy. In this paper, we describe a methodology allowing to explore the design space of power-performance heterogeneous SoCs by combining an architecture simulator (gem5-Aladdin) and a hyperparameter optimization method (Hyperopt). This methodology allows different types of parallelism with loop unrolling strategies and memory coherency interfaces to be swept. The flow has been applied to a convolutional neural network algorithm. We show that the most energy efficient architecture achieves a 2x to 4x improvement in energy-delay-product compared to an architecture without parallelism. Furthermore, the obtained solution is more efficient than commonly implemented architectures (Systolic, 2D-mapping, and Tiling). We also applied the methodology to find the optimal architecture including its coherency interface for a complex SoC made up of six accelerated-workloads. We show that a hybrid interface appears to be the most efficient; it reaches 22% and 12% improvement in energy-delay-product compared to just only using non-coherent and only LLC-coherent models, respectively.
引用
收藏
页码:338 / 343
页数:6
相关论文
共 24 条
[1]  
[Anonymous], 2009, 2009 4 INT DESIGN TE
[2]  
Bergstra J., 2013, PMLR, P115, DOI DOI 10.5555/3042817.3042832
[3]  
Bergstra J., 2011, P 25 INT C NEUR INF, P24, DOI DOI 10.5555/2986459.2986743
[4]   Determining Optimal Coherency Interface for Many-Accelerator SoCs Using Bayesian Optimization [J].
Bhardwaj, Kshitij ;
Havasi, Marton ;
Yao, Yuan ;
Brooks, David M. ;
Hernandez Lobato, Jose Miguel ;
Wei, Gu-Yeon .
IEEE COMPUTER ARCHITECTURE LETTERS, 2019, 18 (02) :119-123
[5]  
Binkert Nathan, 2011, Computer Architecture News, V39, P1, DOI 10.1145/2024716.2024718
[6]  
Chakradhar S, 2010, CONF PROC INT SYMP C, P247, DOI 10.1145/1816038.1815993
[7]   DianNao: A Small-Footprint High-Throughput Accelerator for Ubiquitous Machine-Learning [J].
Chen, Tianshi ;
Du, Zidong ;
Sun, Ninghui ;
Wang, Jia ;
Wu, Chengyong ;
Chen, Yunji ;
Temam, Olivier .
ACM SIGPLAN NOTICES, 2014, 49 (04) :269-283
[8]  
Cong JS, 2015, ICCAD-IEEE ACM INT, P380, DOI 10.1109/ICCAD.2015.7372595
[9]   Designing Application-Specific Heterogeneous Architectures from Performance Models [J].
Cong, Thanh ;
Charot, Francois .
2019 IEEE 13TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC 2019), 2019, :265-272
[10]   ShiDianNao: Shifting Vision Processing Closer to the Sensor [J].
Du, Zidong ;
Fasthuber, Robert ;
Chen, Tianshi ;
Ienne, Paolo ;
Li, Ling ;
Luo, Tao ;
Feng, Xiaobing ;
Chen, Yunji ;
Temam, Olivier .
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, :92-104