SSD Drive Failure Prediction on Alibaba Data Center Using Machine Learning

被引:1
|
作者
Chen, Lei [1 ]
Zhu, Zongpeng [2 ]
Li, Anyu [2 ]
Mashhadi, Najmeh [1 ]
Frickey, Robert [1 ]
Ye, Jinhe [1 ]
Guo, Xin [1 ]
机构
[1] Solidigm, Data Ctr Div, San Jose, CA 95134 USA
[2] Alibaba Grp, Alibaba Cloud, Hangzhou, Peoples R China
来源
2022 14TH IEEE INTERNATIONAL MEMORY WORKSHOP (IMW 2022) | 2022年
关键词
SSD drive failure detection; SSD SMART Data; Ensemble Learning; Light GBM and Random Forest; RELIABILITY; MODEL;
D O I
10.1109/IMW52921.2022.9779284
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Flash-based Solid-State Drives (SSDs) have become a critical storage tier in data centers and enterprise storage systems. Cloud companies are very interested in predicting drive failures. Drive failure prediction enables managing drive replacement and backup data beforehand and helps planning drive purchase strategies. Solidigm and Alibaba collaborate to collect and analyze Self-Monitoring, Analysis, and Reporting Technology (SMART) data and predict SSD failures 30 days ahead of time using machine learning techniques. In this paper, we use group k-fold cross-validation to select the best parameters for machine learning models and avoid overfitting. After obtaining the prediction score of each sample from the model, a post-processing with neural network is applied on those prediction scores to get the drive-level prediction. A modified ensemble learning method is designed and implemented by majority voting on different models of Light GBM and Random Forest to further improve prediction results. This paper is the first work in both academia and the storage industry to design a drive failure prediction system for deploying in data centers by optimizing models with the highest Precision instead of the highest F1-score to minimize false positive rate. We advance to get drive failure prediction with 100% Precision and 21% Recall, enabling us to avoid the high cost of false positives.
引用
收藏
页码:29 / 33
页数:5
相关论文
共 50 条
  • [41] Task failure prediction for wafer-handling robotic arms by using various machine learning algorithms
    Huang, Ping Wun
    Chung, Kuan-Jung
    MEASUREMENT & CONTROL, 2021, 54 (5-6) : 701 - 710
  • [42] Short-Term Prediction of Global Solar Radiation Energy Using Weather Data and Machine Learning Ensembles: A Comparative Study
    Al-Hajj, Rami
    Assi, Ali
    Fouad, Mohamad
    JOURNAL OF SOLAR ENERGY ENGINEERING-TRANSACTIONS OF THE ASME, 2021, 143 (05):
  • [43] Accurate Prediction of Microstructure of Composites using Machine Learning
    Sang, Sheng
    Xu, Chen
    Fan, Jiadi
    Miao, Daniel
    Side, Conner
    Wang, Ziping
    ADVANCED THEORY AND SIMULATIONS, 2023, 6 (02)
  • [44] Noise Prediction Using Machine Learning with Measurements Analysis
    Wen, Po-Jiun
    Huang, Chihpin
    APPLIED SCIENCES-BASEL, 2020, 10 (18):
  • [45] Groundwater Prediction Using Machine-Learning Tools
    Hussein, Eslam A.
    Thron, Christopher
    Ghaziasgar, Mehrdad
    Bagula, Antoine
    Vaccari, Mattia
    ALGORITHMS, 2020, 13 (11)
  • [46] Prediction of epileptic seizures using fNIRS and machine learning
    Guevara, Edgar
    Flores-Castro, Jorge-Arturo
    Peng, Ke
    Dang Khoa Nguyen
    Lesage, Frederic
    Pouliot, Philippe
    Rosas-Romero, Roberto
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (02) : 2055 - 2068
  • [47] Prediction of Cancer Treatment Using Advancements in Machine Learning
    Singh, Arun Kumar
    Ling, Jingjing
    Malviya, Rishabha
    RECENT PATENTS ON ANTI-CANCER DRUG DISCOVERY, 2023, 18 (03) : 364 - 378
  • [48] Crop Yield Prediction Using Machine Learning Algorithms
    Nigam, Aruvansh
    Garg, Saksham
    Agrawal, Archit
    Agrawal, Parul
    2019 FIFTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP 2019), 2019, : 125 - 130
  • [49] Flood Hydrograph Prediction Using Machine Learning Methods
    Tayfur, Gokmen
    Singh, Vijay P.
    Moramarco, Tommaso
    Barbetta, Silvia
    WATER, 2018, 10 (08)
  • [50] Electrical Energy Consumption Prediction Using Machine Learning
    Stankoski, Simon
    Kiprijanovska, Ivana
    Ilievski, Igor
    Slobodan, Jovanovski
    Gjoreski, Hristijan
    ICT INNOVATIONS 2019: BIG DATA PROCESSING AND MINING, 2019, 1110 : 72 - 82