Hardware-Software Collaborated Method for Soft-Error Tolerant MPSoC

被引:7
|
作者
Liu, Weichen [1 ]
Xu, Jiang [1 ]
Wang, Xuan [1 ]
Wang, Yu [2 ]
Zhang, Wei [3 ]
Ye, Yaoyao [1 ]
Wu, Xiaowen [1 ]
Nikdast, Mahdi [1 ]
Wang, Zhehui [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Hong Kong, Peoples R China
[2] Tsinghua Univ, Beijing, Peoples R China
[3] Nanyang Technol Univ, Singapore, Singapore
关键词
FAULT; SYSTEMS;
D O I
10.1109/ISVLSI.2011.48
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multiprocessor systems-on-chip (MPSoCs) are attractive platforms for embedded applications with growing complexity, because integrating a system or a complex subsystem on a single chip provides better performance and energy efficiency and lower cost per function. As feature sizes and power supply voltages continually decrease, MPSoCs are becoming more susceptible to soft errors. However, traditional soft-error tolerant methods introduce large area, power and performance overheads to MPSoCs. This paper presents a low-overhead hardware-software collaborated method, called SENoC, to dynamically mitigate soft errors on MPSoCs using an on-chip sensor network. We developed a low-cost on-chip sensor network to collaboratively monitor and detect soft errors, and implemented software-based mechanisms to guarantee correct task executions. To maximize the performance of soft-error tolerant MPSoCs, a hybrid scheduling scheme is proposed to effectively manage applications and resources under uncertainties. We studied the new method on MPSoCs with different scales and tested it using typical embedded applications under different cosmic ray flux conditions. Experimental results show that comparing to traditional methods SENoC requires substantially lower protection overheads to achieve the same level of soft-error tolerance. For instance, soft-error tolerant MPSoCs using SENoC archive on average 114.1% better performance than a latest traditional method, and SENoC only introduces 0.42% area overhead to a 256-core MPSoCs.
引用
收藏
页码:260 / 265
页数:6
相关论文
共 50 条
  • [1] Soft Errors: The Hardware-Software Interface
    Lee, Kyoungwoo
    Shrivastava, Aviral
    Jeyapaul, Reiley
    CODES+ISSS'12:PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON HARDWARE/SOFTWARE-CODESIGN AND SYSTEM SYNTHESIS, 2012, : 577 - 577
  • [2] A Hardware-Software Approach for On-Line Soft Error Mitigation in Interrupt-Driven Applications
    Martinez-Alvarez, Antonio
    Restrepo-Calle, Felipe
    Cuenca-Asensi, Sergio
    Reyneri, Leonardo M.
    Lindoso, Almudena
    Entrena, Luis
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2016, 13 (04) : 502 - 508
  • [3] A Soft-Error Mitigated Microprocessor With Software Controlled Error Reporting and Recovery
    Farnsworth, Chad
    Clark, Lawrence T.
    Gogulamudi, Anudeep R.
    Vashishtha, Vinay
    Gujja, Aditya
    IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2016, 63 (04) : 2241 - 2249
  • [4] ReDup: A software-based method for detecting soft-error using data analysis
    Arasteh, Bahman
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 78 : 89 - 107
  • [5] SwapCodes: Error Codes for Hardware-Software Cooperative GPU Pipeline Error Detection
    Sullivan, Michael B.
    Hari, Siva Kumar Sastry
    Zimmer, Brian
    Tsai, Timothy
    Keckler, Stephen W.
    2018 51ST ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2018, : 762 - 774
  • [6] Reliability and performance analysis of hardware-software systems with fault-tolerant software components
    Levitin, G
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2006, 91 (05) : 570 - 579
  • [7] Soft-Error Tolerant TCAMs for High-Reliability Packet Classifications
    Syafalni, Infall
    Sasao, Tsutomu
    Wen, Xiaoqing
    Holst, Stefan
    Miyase, Kohei
    2014 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS), 2014, : 471 - 474
  • [8] Soft-Error Tolerant Design in Near-Threshold-Voltage Computing
    Wey, I-Chyn
    Fang, Si-Zhan
    Chou, Heng-Jui
    Wu, Zhan-You
    PROCEEDINGS OF 4TH IEEE INTERNATIONAL CONFERENCE ON APPLIED SYSTEM INNOVATION 2018 ( IEEE ICASI 2018 ), 2018, : 1308 - 1309
  • [9] Mixed Error Correction Scheme and Its Design Optimization for Soft-Error Tolerant Datapaths
    Oh, Junghoon
    Kaneko, Mine
    2016 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS), 2016, : 362 - 365
  • [10] BTI and Soft-Error Tolerant Voltage Bootstrapped Schmitt Trigger Circuit
    Gupta, Neha
    Shah, Ambika Prasad
    Vishvakarma, Santosh Kumar
    IEEE TRANSACTIONS ON DEVICE AND MATERIALS RELIABILITY, 2021, 21 (01) : 153 - 155