Special Session: Reliability Analysis for AI/ML Hardware

被引:8
|
作者
Kundu, Shamik [1 ]
Basu, Kanad [1 ]
Sadi, Mehdi [2 ]
Titirsha, Twisha [3 ]
Song, Shihao [3 ]
Das, Anup [3 ]
Guin, Ujjwal [2 ]
机构
[1] Univ Texas Dallas, Elect & Comp Engn, Dallas, TX 75080 USA
[2] Auburn Univ, Elect & Comp Engn, Auburn, AL USA
[3] Drexel Univ, Elect & Comp Engn, Philadelphia, PA USA
关键词
Machine learning; deep learning accelerator; neuromorphic computing; reliability; NEURAL-NETWORKS; POWER;
D O I
10.1109/VTS50974.2021.9441050
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial intelligence (AI) and Machine Learning (ML) are becoming pervasive in today's applications, such as autonomous vehicles, healthcare, aerospace, cybersecurity, and many critical applications. Ensuring the reliability and robustness of the underlying AI/ML hardware becomes our paramount importance. In this paper, we explore and evaluate the reliability of different AI/ML hardware. The first section outlines the reliability issues in a commercial systolic array-based ML accelerator in the presence of faults engendering from device-level non-idealities in the DRAM. Next, we quantified the impact of circuit-level faults in the MSB and LSB logic cones of the Multiply and Accumulate (MAC) block of the AI accelerator on the AI/ML accuracy. Finally, we present two key reliability issues - circuit aging and endurance in emerging neuromorphic hardware platforms and present our system-level approach to mitigate them.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Extended Abstract: Pre-Silicon Vulnerability Assessment for AI/ML Hardware
    Aydin, Furkan
    Karabulut, Emre
    Aysu, Aydin
    PROCEEDING OF THE GREAT LAKES SYMPOSIUM ON VLSI 2024, GLSVLSI 2024, 2024, : 495 - 495
  • [22] Special Session: Hardware Approximate Computing: How, Why, When and Where?
    Saadat, Hassaan
    Parameswaran, Sri
    2017 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURES AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES), 2017,
  • [23] Special Session: CAD for Hardware Security - Automation is Key to Adoption of Solutions
    Aftabjahani, Sohrab
    Kastner, Ryan
    Tehranipoor, Mark
    Farahmandi, Farimah
    Oberg, Jason
    Nordstrom, Anders
    Fern, Nicole
    Althoff, Alric
    2021 IEEE 39TH VLSI TEST SYMPOSIUM (VTS), 2021,
  • [24] AI and ML
    Devlin B.
    SMPTE Motion Imaging Journal, 2021, 130 (03): : 28 - 34
  • [25] The Game-Changing Memristive Technology for Next-Gen AI/ML Hardware
    Bai, Kang Jun
    Lombardi, Jack P.
    Thiem, Clare D.
    DISRUPTIVE TECHNOLOGIES IN INFORMATION SCIENCES VIII, 2024, 13058
  • [26] Can ML-based Reliability Models Span Quantum Hardware Boundaries?
    Ioannou, Georgios
    Kizhuvettil, Gopika
    Charrwi, Mohammad Walid
    Saeed, Samah Mohamed
    2024 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI, 2024, : 607 - 612
  • [27] Special Session: CAD for Hardware Security - Promising Directions for Automation of Security Assurance
    Aftabjahani, Sohrab
    Tehranipoor, Mark
    Farahmandi, Farimah
    Ahmed, Bulbul
    Kastner, Ryan
    Restuccia, Francesco
    Meza, Andres
    Ryan, Kaki
    Fern, Nicole
    van Woudenberg, Jasper
    Velegalati, Rajesh
    Breunesse, Cees-Bart
    Sturton, Cynthia
    Deutschbein, Calvin
    2023 IEEE 41ST VLSI TEST SYMPOSIUM, VTS, 2023,
  • [28] Special Session: The Recent Advance in Hardware Implementation of Post-Quantum Cryptography
    Xie, Jiafeng
    Basu, Kanad
    Gaj, Kris
    Guin, Ujjwal
    2020 IEEE 38TH VLSI TEST SYMPOSIUM (VTS 2020), 2020,
  • [29] Special Session: Towards an Agile Design Methodology for Efficient, Reliable, and Secure ML Systems
    Dave, Shail
    Marchisio, Alberto
    Hanif, Muhammad Abdullah
    Guesmi, Amira
    Shrivastava, Aviral
    Alouani, Ihsen
    Shafique, Muhammad
    2022 IEEE 40TH VLSI TEST SYMPOSIUM (VTS), 2022,
  • [30] Special issue SOCO 2017: AI and ML applied to Health Sciences (MLHS)
    de Cos, Francisco Javier
    Wozniak, Michal
    Mendez, Juan Albino
    Villar Flecha, Jose R.
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (05): : 1217 - 1218