IntelliNoC: A Holistic Design Framework for Energy-Efficient and Reliable On-Chip Communication for Manycores

被引:24
作者
Wang, Ke [1 ]
Louri, Ahmed [1 ]
Karanth, Avinash [2 ]
Bunescu, Razvan [2 ]
机构
[1] George Washington Univ, Dept Elect & Comp Engn, Washington, DC 20037 USA
[2] Ohio Univ, Sch Elect Engn & Comp Sci, Athens, OH 45701 USA
来源
PROCEEDINGS OF THE 2019 46TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '19) | 2019年
关键词
Network-on-Chip (NoC); Reinforcement Learning; NoC Performance; Reliability; Energy-Efficiency; ROUTER ARCHITECTURE; POWER; MODEL; NETWORK;
D O I
10.1145/3307650.3322274
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As technology scales, Network-on-Chips (NoCs), currently being used for on-chip communication in manycore architectures, face several problems including high network latency, excessive power consumption, and low reliability. Simultaneously addressing these problems is proving to be difficult due to the explosion of the design space and the complexity of handling many trade-offs. In this paper, we propose IntelliNoC, an intelligent NoC design framework which introduces architectural innovations and uses reinforcement learning to manage the design complexity and simultaneously optimize performance, energy-efficiency, and reliability in a holistic manner. IntelliNoC integrates three NoC architectural techniques: (1) multi-function adaptive channels (MFACs) to improve energy-efficiency; (2) adaptive error detection/correction and re-transmission control to enhance reliability; and (3) a stress-relaxing bypass feature which dynamically powers off NoC components to prevent overheating and fatigue. To handle the complex dynamic interactions induced by these techniques, we train a dynamic control policy using Q-learning, with the goal of providing improved fault-tolerance and performance while reducing power consumption and area overhead. Simulation using PARSEC benchmarks shows that our proposed IntelliNoC design improves energy-efficiency by 67% and mean-time-to-failure (MTTF) by 77%, and decreases end-to-end packet latency by 32% and area requirements by 25% over baseline NoC architecture.
引用
收藏
页码:589 / 600
页数:12
相关论文
共 45 条
[1]  
Aisopos K., 2011, Proceedings 2011 International Conference on Parallel Architectures and Compilation Techniques (PACT), P298, DOI 10.1109/PACT.2011.61
[2]   A comprehensive model of PMOS NBTI degradation [J].
Alam, MA ;
Mahapatra, S .
MICROELECTRONICS RELIABILITY, 2005, 45 (01) :71-81
[3]  
[Anonymous], 2018, REINFORCEMENT LEARNI
[4]  
[Anonymous], IEEE T COMPUTERS
[5]  
[Anonymous], P 49 ANN IEEE ACM IN
[6]  
[Anonymous], P DES AUT TEST EUR C
[7]  
[Anonymous], 2011, P 5 ANN WORKSH MOD B
[8]  
[Anonymous], 2013, ACM SIGARCH COMPUT A, DOI DOI 10.1145/2508148.2485950
[9]  
[Anonymous], 56 DES AUT C DAC
[10]  
Bai Y., 2017, Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, P825