SSD Drive Failure Prediction on Alibaba Data Center Using Machine Learning

被引:1
|
作者
Chen, Lei [1 ]
Zhu, Zongpeng [2 ]
Li, Anyu [2 ]
Mashhadi, Najmeh [1 ]
Frickey, Robert [1 ]
Ye, Jinhe [1 ]
Guo, Xin [1 ]
机构
[1] Solidigm, Data Ctr Div, San Jose, CA 95134 USA
[2] Alibaba Grp, Alibaba Cloud, Hangzhou, Peoples R China
来源
2022 14TH IEEE INTERNATIONAL MEMORY WORKSHOP (IMW 2022) | 2022年
关键词
SSD drive failure detection; SSD SMART Data; Ensemble Learning; Light GBM and Random Forest; RELIABILITY; MODEL;
D O I
10.1109/IMW52921.2022.9779284
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Flash-based Solid-State Drives (SSDs) have become a critical storage tier in data centers and enterprise storage systems. Cloud companies are very interested in predicting drive failures. Drive failure prediction enables managing drive replacement and backup data beforehand and helps planning drive purchase strategies. Solidigm and Alibaba collaborate to collect and analyze Self-Monitoring, Analysis, and Reporting Technology (SMART) data and predict SSD failures 30 days ahead of time using machine learning techniques. In this paper, we use group k-fold cross-validation to select the best parameters for machine learning models and avoid overfitting. After obtaining the prediction score of each sample from the model, a post-processing with neural network is applied on those prediction scores to get the drive-level prediction. A modified ensemble learning method is designed and implemented by majority voting on different models of Light GBM and Random Forest to further improve prediction results. This paper is the first work in both academia and the storage industry to design a drive failure prediction system for deploying in data centers by optimizing models with the highest Precision instead of the highest F1-score to minimize false positive rate. We advance to get drive failure prediction with 100% Precision and 21% Recall, enabling us to avoid the high cost of false positives.
引用
收藏
页码:29 / 33
页数:5
相关论文
共 50 条
  • [21] Sea Level Prediction Using Machine Learning
    Tur, Rifat
    Tas, Erkin
    Haghighi, Ali Torabi
    Mehr, Ali Danandeh
    WATER, 2021, 13 (24)
  • [22] Chronic Diseases Prediction Using Machine Learning With Data Preprocessing Handling: A Critical Review
    Ghaniaviyanto Ramadhan, Nur
    Adiwijaya
    Maharani, Warih
    Akbar Gozali, Alfian
    IEEE ACCESS, 2024, 12 : 80698 - 80730
  • [23] Prediction of Global Horizontal Irradiance Using an Explainable Data Driven Machine Learning Algorithms
    Gupta, Rahul
    Yadav, Anil Kumar
    Jha, Shyama Kant
    ELECTRIC POWER COMPONENTS AND SYSTEMS, 2024,
  • [24] Icing detection and prediction for wind turbines using multivariate sensor data and machine learning
    Ye, Feng
    Ezzat, Ahmed Aziz
    RENEWABLE ENERGY, 2024, 231
  • [25] Prediction of skin sensitization using machine learning
    Im, Jueng Eun
    Lee, Jung Dae
    Kim, Hyang Yeon
    Kim, Hak Rim
    Seo, Dong-Wan
    Kim, Kyu-Bong
    TOXICOLOGY IN VITRO, 2023, 93
  • [26] Prediction of concrete porosity using machine learning
    Cao, Chong
    RESULTS IN ENGINEERING, 2023, 17
  • [27] Software quality prediction using machine learning
    Alaswad, Feisal
    Poovammal, E.
    MATERIALS TODAY-PROCEEDINGS, 2022, 62 : 4714 - 4720
  • [28] Prediction of GPCR activity using machine learning
    Yadav, Prakarsh
    Mollaei, Parisa
    Cao, Zhonglin
    Wang, Yuyang
    Farimani, Amir Barati
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 2564 - 2573
  • [29] Precursor Prediction and Early Warning of Power MOSFET Failure Using Machine Learning With Model Uncertainty Considered
    Hou, Yuluo
    Lu, Chang
    Abbas, Waseem
    Seid Ibrahim, Mesfin
    Waseem, Muhammad
    Hung Lee, Hiu
    Loo, Ka-Hong
    IEEE JOURNAL OF EMERGING AND SELECTED TOPICS IN POWER ELECTRONICS, 2024, 12 (06) : 5762 - 5776
  • [30] Mining Project Failure Indicators From Big Data Using Machine Learning Mixed Methods
    Strang, Kenneth David
    Vajjhala, Narasimha Rao
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY PROJECT MANAGEMENT, 2023, 14 (01) : 1 - 24