SiaDFP: A Disk Failure Prediction Framework Based on Siamese Neural Network in Large-Scale Data Center

被引:0
|
作者
Fang, Xiaoyu [1 ]
Guan, Wenbai [1 ]
Li, Jiawen [1 ]
Cao, Chenhan [1 ]
Xia, Bin [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Nanjing 210049, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing 210049, Peoples R China
关键词
Neural networks; Market research; Task analysis; Predictive models; Faces; Data centers; Web and internet services; Attention mechanism; change point detection; disk failure prediction; siamese neural network;
D O I
10.1109/TSC.2024.3394692
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of cloud services, service providers increasingly rely on a dependable storage system equipped with large-capacity disks to ensure data availability. The primary source of unreliability in such storage systems attributes to disk failures. In recent years, some proactive methods base on machine learning models have emerged, aiming to predict impending disk failures by leveraging the SMART attributes of disks. These methods enable service providers to timely back up storage data. While the methods prove more effective and efficient in disk failure prediction, they still face challenges, such as inadequate mining of abnormal information and imbalanced classification. In this paper, we mainly analyzed the change of data distribution in hard disks. From the data analysis, we observed that the distribution change in the failed disk is obvious during the period before the disk damage, while that in the healthy disk is insignificant during running time. Motivated by the observation, we propose a novel framework named SiaDFP, based on Siamese neural network, designed to predict impending disk failures by capturing the distribution changes in failed disks. Additionally, we observed that the failed disks exhibit some change points as an abnormal feature by analyzing the disk data trend. To fully mining abnormal information inhere in failed disks, we propose CP-MAP mechanism and 2D-Attention mechanism. Furthermore, we present a subsampling approach named Region Balanced Sampling to address the challenge of imbalanced classification. Experiments on the real-world dataset Backblaze and Baidu demonstrate that the performance of SiaDFP is outstanding in the task of disk failure prediction.
引用
收藏
页码:2890 / 2903
页数:14
相关论文
共 50 条
  • [41] The Network Balance Model of Trauma and Resolution-Level I: Large-Scale Neural Networks
    Chamberlin, D. Eric
    JOURNAL OF EMDR PRACTICE AND RESEARCH, 2019, 13 (02) : 124 - 142
  • [42] Learning Large-scale Fuzzy Cognitive Maps using a Hybrid of Memetic Algorithm and Neural Network
    Chi, Yaxiong
    Liu, Jing
    2014 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2014, : 1036 - 1040
  • [43] NeuTM: A Neural Network-based Framework for Traffic Matrix Prediction in SDN
    Azzouni, Abdelhadi
    Pujolle, Guy
    NOMS 2018 - 2018 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2018,
  • [44] Scalable framework of intelligent RFI flagging for large-scale HI survey data from FAST
    Xiao, Jian
    Zhang, Yajie
    Zhang, Bo
    Yang, Zhicheng
    Yu, Ce
    Cui, Chenzhou
    NEW ASTRONOMY, 2022, 96
  • [45] FLAIR: A Fast and Low-Redundancy Failure Recovery Framework for Inter Data Center Network
    Zhang, Yuchao
    Huang, Haoqiang
    Abdelmoniem, Ahmed M.
    Zeng, Gaoxiong
    Zheng, Chenyue
    Que, Xirong
    Wang, Wendong
    Xu, Ke
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2024, 12 (02) : 737 - 749
  • [46] Auxiliary input-enhanced siamese neural network: A robust tool wear prediction framework with improved feature extraction and generalization ability
    Wang, Chenghan
    Shen, Bin
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2024, 211
  • [47] Large-Scale PFN Fault Diagnosis Method Based on Multidimensional Time Series Anomaly Detection Using Convolutional Neural Network
    Lu, Junyong
    Delin, Zeng
    Yufeng, Zheng
    IEEE TRANSACTIONS ON PLASMA SCIENCE, 2020, 48 (11) : 3997 - 4005
  • [48] A prediction-based neural network scheme for lossless data compression
    Logeswaran, R
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2002, 32 (04): : 358 - 365
  • [49] Reliability Characterization and Failure Prediction of 3D TLC SSDs in Large-Scale Storage Systems
    Olmez, Serkay
    IEEE TRANSACTIONS ON DEVICE AND MATERIALS RELIABILITY, 2021, 21 (02) : 267 - 272
  • [50] Neural network based decentralized excitation control of large scale power systems
    Liu, Wenxin
    Sarangapani, Jagannathan
    Venayagamoorthy, Ganesh K.
    Wunsch, Donald C., II
    Cartes, David A.
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 1975 - +