RQNoC: A Resilient Quality-of-Service Network-on-Chip with Service Redirection

被引:1
作者
Malek, Alirad [1 ]
Sourdis, Ioannis [1 ]
Tzilis, Stavros [1 ]
He, Yifan [2 ]
Rauwerda, Gerard [2 ]
机构
[1] Chalmers Univ Technol, Dept Comp Sci & Engn, SE-41296 Gothenburg, Sweden
[2] Recore Syst, POB 77, NL-7500 AB Enschede, Netherlands
关键词
Design; Reliability; Performance; Fault tolerance; network-on-chip; quality-of-service; ROUTING ALGORITHM; FAULT-TOLERANCE; ARCHITECTURE; CHALLENGES; NOC;
D O I
10.1145/2846097
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we describe RQNoC, a service-oriented Network-on-Chip (NoC) resilient to permanent faults. We characterize the network resources based on the particular service that they support and, when faulty, bypass them, allowing the respective traffic class to be redirected. We propose two alternatives for service redirection, each having different advantages and disadvantages. The first one, Service Detour, uses longer alternative paths through resources of the same service to bypass faulty network parts, keeping traffic classes isolated. The second approach, Service Merge, uses resources of other services providing shorter paths but allowing traffic classes to interfere with each other. The remaining network resources that are common for all services employ additional mechanisms for tolerating faults. Links tolerate faults using additional spare wires combined with a flit-shifting mechanism, and the router control is protected with Triple-Modular-Redundancy (TMR). The proposed RQNoC network designs are implemented in 65nm technology and evaluated in terms of performance, area, power consumption, and fault tolerance. Service Detour requires 9% more area and consumes 7.3% more power compared to a baseline network, not tolerant to faults. Its packet latency and throughput is close to the fault-free performance at low-fault densities, but fault tolerance and performance drop substantially for 8 or more network faults. Service Merge requires 22% more area and 27% more power than the baseline and has a 9% slower clock. Compared to a fault-free network, a Service Merge RQNoC with up to 32 faults has increased packet latency up to 1.5 to 2.4x and reduced throughput to 70% or 50%. However, it delivers substantially better fault tolerance, having a mean network connectivity above 90% even with 32 network faults versus 41% of a Service Detour network. Combining Serve Merge and Service Detour improves fault tolerance, further sustaining a higher number of network faults and reduced packet latency.
引用
收藏
页数:25
相关论文
共 32 条
[1]  
Ali M, 2007, INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, P1027
[2]  
[Anonymous], 2011, 2011 DESIGN AUTOMATI, DOI DOI 10.1109/DATE.2011.5763112
[3]   QNoC: QoS architecture and design process for network on chip [J].
Bolotin, E ;
Cidon, I ;
Ginosar, R ;
Kolodny, A .
JOURNAL OF SYSTEMS ARCHITECTURE, 2004, 50 (2-3) :105-128
[4]   Designing reliable systems from unreliable components: The challenges of transistor variability and degradation [J].
Borkar, S .
IEEE MICRO, 2005, 25 (06) :10-16
[5]  
Chaochao Feng, 2010, Proceedings 2010 IEEE International SOC Conference (SOCC 2010), P441, DOI 10.1109/SOCC.2010.5784672
[6]  
Cheng Liu, 2011, 2011 16th Asia and South Pacific Design Automation Conference, ASP-DAC 2011, P437, DOI 10.1109/ASPDAC.2011.5722230
[7]   Trends and challenges in VLSI circuit reliability [J].
Constantinescu, C .
IEEE MICRO, 2003, 23 (04) :14-19
[8]   BulletProof: A defect-tolerant CMP switch architecture [J].
Constantinides, Kypros ;
Plaza, Stephen ;
Blome, Jason ;
Zhang, Bin ;
Bertacco, Valeria ;
Mahlke, Scott ;
Austin, Todd ;
Orshansky, Michael .
TWELFTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2006, :3-+
[9]   A Reliable Routing Architecture and Algorithm for NoCs [J].
DeOrio, Andrew ;
Fick, David ;
Bertacco, Valeria ;
Sylvester, Dennis ;
Blaauw, David ;
Hu, Jin ;
Chen, Gregory .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2012, 31 (05) :726-739
[10]   Virtualizing Virtual Channels for Increased Network-on-Chip Robustness and Upgradeability [J].
Evripidou, Marios ;
Nicopoulos, Chrysostomos ;
Soteriou, Vassos ;
Kim, Jongman .
2012 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI), 2012, :21-26