SiaDFP: A Disk Failure Prediction Framework Based on Siamese Neural Network in Large-Scale Data Center

被引:0
|
作者
Fang, Xiaoyu [1 ]
Guan, Wenbai [1 ]
Li, Jiawen [1 ]
Cao, Chenhan [1 ]
Xia, Bin [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Nanjing 210049, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing 210049, Peoples R China
关键词
Neural networks; Market research; Task analysis; Predictive models; Faces; Data centers; Web and internet services; Attention mechanism; change point detection; disk failure prediction; siamese neural network;
D O I
10.1109/TSC.2024.3394692
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of cloud services, service providers increasingly rely on a dependable storage system equipped with large-capacity disks to ensure data availability. The primary source of unreliability in such storage systems attributes to disk failures. In recent years, some proactive methods base on machine learning models have emerged, aiming to predict impending disk failures by leveraging the SMART attributes of disks. These methods enable service providers to timely back up storage data. While the methods prove more effective and efficient in disk failure prediction, they still face challenges, such as inadequate mining of abnormal information and imbalanced classification. In this paper, we mainly analyzed the change of data distribution in hard disks. From the data analysis, we observed that the distribution change in the failed disk is obvious during the period before the disk damage, while that in the healthy disk is insignificant during running time. Motivated by the observation, we propose a novel framework named SiaDFP, based on Siamese neural network, designed to predict impending disk failures by capturing the distribution changes in failed disks. Additionally, we observed that the failed disks exhibit some change points as an abnormal feature by analyzing the disk data trend. To fully mining abnormal information inhere in failed disks, we propose CP-MAP mechanism and 2D-Attention mechanism. Furthermore, we present a subsampling approach named Region Balanced Sampling to address the challenge of imbalanced classification. Experiments on the real-world dataset Backblaze and Baidu demonstrate that the performance of SiaDFP is outstanding in the task of disk failure prediction.
引用
收藏
页码:2890 / 2903
页数:14
相关论文
共 50 条
  • [1] PRACTISE - Demonstrating a neural network based framework for robust prediction of data center workload
    Scherer, Thomas
    Xue, Ji
    Yan, Feng
    Birke, Robert
    Chen, Lydia Y.
    Smirni, Evgenia
    2015 IEEE/ACM 8TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2015, : 402 - 403
  • [2] Minority Disk Failure Prediction Based on Transfer Learning in Large Data Centers of Heterogeneous Disk Systems
    Zhan, Ji
    Zhou, Ke
    Huang, Ping
    He, Xubin
    Xie, Ming
    Cheng, Bin
    Ji, Yongguang
    Wang, Yinhu
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (09) : 2155 - 2169
  • [3] An Improved LSTM-Based Prediction Approach for Resources and Workload in Large-Scale Data Centers
    Yuan, Haitao
    Bi, Jing
    Li, Shuang
    Zhang, Jia
    Zhou, MengChu
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (12): : 22816 - 22829
  • [4] Automated assessment of balance: A neural network approach based on large-scale balance function data
    Wu, Jingsong
    Li, Yang
    Yin, Lianhua
    He, Youze
    Wu, Tiecheng
    Ruan, Chendong
    Li, Xidian
    Wu, Jianhuang
    Tao, Jing
    FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [5] Spatial-Temporal Deep Tensor Neural Networks for Large-Scale Urban Network Speed Prediction
    Zhou, Lingxiao
    Zhang, Shuaichao
    Yu, Jingru
    Chen, Xiqun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (09) : 3718 - 3729
  • [6] Neural network-based modeling for a large-scale power plant
    Lee, Kwang Y.
    Heo, Jin S.
    Hoffman, Jason A.
    Kim, Sung-Ho
    Jung, Won-Hee
    2007 IEEE POWER ENGINEERING SOCIETY GENERAL MEETING, VOLS 1-10, 2007, : 1028 - 1035
  • [7] GNNVis: Visualize Large-Scale Data by Learning a Graph Neural Network Representation
    Huang, Yajun
    Zhang, Jingbin
    Yang, Yiyang
    Gong, Zhiguo
    Hao, Zhifeng
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 545 - 554
  • [8] BlockAIM: A Neural Network-Based Intelligent Middleware For Large-Scale IoT Data Placement Decisions
    Danish, Syed Muhammad
    Zhang, Kaiwen
    Jacobsen, Hans-Arno
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (01) : 84 - 99
  • [9] A Large Scale Study of Data Center Network Reliability
    Meza, Justin
    Xu, Tianyin
    Veeraraghavan, Kaushik
    Mutlu, Onur
    IMC'18: PROCEEDINGS OF THE INTERNET MEASUREMENT CONFERENCE, 2018, : 393 - 407
  • [10] MONet: heterogeneous Memory over Optical Network for large-scale data center resource disaggregation
    Mishra, Vaibhawa
    Benjamin, Joshua L.
    Zervas, Georgios
    JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2021, 13 (05) : 126 - 139