SSD Drive Failure Prediction on Alibaba Data Center Using Machine Learning

被引:1
|
作者
Chen, Lei [1 ]
Zhu, Zongpeng [2 ]
Li, Anyu [2 ]
Mashhadi, Najmeh [1 ]
Frickey, Robert [1 ]
Ye, Jinhe [1 ]
Guo, Xin [1 ]
机构
[1] Solidigm, Data Ctr Div, San Jose, CA 95134 USA
[2] Alibaba Grp, Alibaba Cloud, Hangzhou, Peoples R China
来源
2022 14TH IEEE INTERNATIONAL MEMORY WORKSHOP (IMW 2022) | 2022年
关键词
SSD drive failure detection; SSD SMART Data; Ensemble Learning; Light GBM and Random Forest; RELIABILITY; MODEL;
D O I
10.1109/IMW52921.2022.9779284
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Flash-based Solid-State Drives (SSDs) have become a critical storage tier in data centers and enterprise storage systems. Cloud companies are very interested in predicting drive failures. Drive failure prediction enables managing drive replacement and backup data beforehand and helps planning drive purchase strategies. Solidigm and Alibaba collaborate to collect and analyze Self-Monitoring, Analysis, and Reporting Technology (SMART) data and predict SSD failures 30 days ahead of time using machine learning techniques. In this paper, we use group k-fold cross-validation to select the best parameters for machine learning models and avoid overfitting. After obtaining the prediction score of each sample from the model, a post-processing with neural network is applied on those prediction scores to get the drive-level prediction. A modified ensemble learning method is designed and implemented by majority voting on different models of Light GBM and Random Forest to further improve prediction results. This paper is the first work in both academia and the storage industry to design a drive failure prediction system for deploying in data centers by optimizing models with the highest Precision instead of the highest F1-score to minimize false positive rate. We advance to get drive failure prediction with 100% Precision and 21% Recall, enabling us to avoid the high cost of false positives.
引用
收藏
页码:29 / 33
页数:5
相关论文
共 50 条
  • [31] Grape leaf moisture prediction from UAVs using multimodal data fusion and machine learning
    Peng, Xuelian
    Ma, Yuxin
    Sun, Jun
    Chen, Dianyu
    Zhen, Jingbo
    Zhang, Zhitao
    Hu, Xiaotao
    Wang, Yakun
    PRECISION AGRICULTURE, 2024, 25 (03) : 1609 - 1635
  • [32] Machine learning based survival prediction in Glioma using large-scale registry data
    Zhao, Rachel
    Zhuge, Ying
    Camphausen, Kevin
    Krauze, Andra, V
    HEALTH INFORMATICS JOURNAL, 2022, 28 (04)
  • [33] Integrating multiple data sources for improved flight delay prediction using explainable machine learning
    Pineda-Jaramillo, Juan
    Munoz, Claudia
    Mesa-Arango, Rodrigo
    Gonzalez-Calderon, Carlos
    Lange, Anne
    RESEARCH IN TRANSPORTATION BUSINESS AND MANAGEMENT, 2024, 56
  • [34] A Three-Step Weather Data Approach in Solar Energy Prediction Using Machine Learning
    Falope, Tolulope Olumuyiwa
    Lao, Liyun
    Hanak, Dawid
    RENEWABLE ENERGY FOCUS, 2024, 50
  • [35] Prediction of Thermogravimetric Data for Asphaltenes Extracted from Deasphalted Oil Using Machine Learning Techniques
    Sivaramakrishnan, Kaushik
    Tannous, Joy H.
    Chandrasekaran, Vignesh
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2023, 62 (43) : 17787 - 17804
  • [36] Enhancing crop yield prediction in Senegal using advanced machine learning techniques and synthetic data
    Razavi, Mohammad Amin
    Nejadhashemi, A. Pouyan
    Majidi, Babak
    Razavi, Hoda S.
    Kpodo, Josue
    Eeswaran, Rasu
    Ciampitti, Ignacio
    Prasad, P. V. Vara
    ARTIFICIAL INTELLIGENCE IN AGRICULTURE, 2024, 14 : 99 - 114
  • [37] Employee attrition prediction in a pharmaceutical company using both machine learning approach and qualitative data
    Mozaffari, Fatemeh
    Rahimi, Marzieh
    Yazdani, Hamidreza
    Sohrabi, Babak
    BENCHMARKING-AN INTERNATIONAL JOURNAL, 2023, 30 (10) : 4140 - 4173
  • [38] Large scale prediction of groundwater nitrate concentrations from spatial data using machine learning
    Knoll, Lukas
    Breuer, Lutz
    Bach, Martin
    SCIENCE OF THE TOTAL ENVIRONMENT, 2019, 668 : 1317 - 1327
  • [39] Protection against failure of machine-learning-based QoT prediction
    Guo, Ningning
    Li, Longfei
    Mukherjee, Biswanath
    Shen, Gangxiang
    JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2022, 14 (07) : 572 - 585
  • [40] A Comparative Study of Machine Learning Algorithms for Financial Data Prediction
    Omar, Bencharef
    Zineb, Bousbaa
    Jofre Aida, Cortes
    Cortes Daniel, Gonzalez
    2018 INTERNATIONAL SYMPOSIUM ON ADVANCED ELECTRICAL AND COMMUNICATION TECHNOLOGIES (ISAECT), 2018,