SiaDFP: A Disk Failure Prediction Framework Based on Siamese Neural Network in Large-Scale Data Center

被引:0
|
作者
Fang, Xiaoyu [1 ]
Guan, Wenbai [1 ]
Li, Jiawen [1 ]
Cao, Chenhan [1 ]
Xia, Bin [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Nanjing 210049, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing 210049, Peoples R China
关键词
Neural networks; Market research; Task analysis; Predictive models; Faces; Data centers; Web and internet services; Attention mechanism; change point detection; disk failure prediction; siamese neural network;
D O I
10.1109/TSC.2024.3394692
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of cloud services, service providers increasingly rely on a dependable storage system equipped with large-capacity disks to ensure data availability. The primary source of unreliability in such storage systems attributes to disk failures. In recent years, some proactive methods base on machine learning models have emerged, aiming to predict impending disk failures by leveraging the SMART attributes of disks. These methods enable service providers to timely back up storage data. While the methods prove more effective and efficient in disk failure prediction, they still face challenges, such as inadequate mining of abnormal information and imbalanced classification. In this paper, we mainly analyzed the change of data distribution in hard disks. From the data analysis, we observed that the distribution change in the failed disk is obvious during the period before the disk damage, while that in the healthy disk is insignificant during running time. Motivated by the observation, we propose a novel framework named SiaDFP, based on Siamese neural network, designed to predict impending disk failures by capturing the distribution changes in failed disks. Additionally, we observed that the failed disks exhibit some change points as an abnormal feature by analyzing the disk data trend. To fully mining abnormal information inhere in failed disks, we propose CP-MAP mechanism and 2D-Attention mechanism. Furthermore, we present a subsampling approach named Region Balanced Sampling to address the challenge of imbalanced classification. Experiments on the real-world dataset Backblaze and Baidu demonstrate that the performance of SiaDFP is outstanding in the task of disk failure prediction.
引用
收藏
页码:2890 / 2903
页数:14
相关论文
共 50 条
  • [21] Accelerating Large-Scale Distributed Neural Network Training with SPMD Parallelism
    Zhang, Shiwei
    Diao, Lansong
    Wu, Chuan
    Wang, Siyu
    Lin, Wei
    PROCEEDINGS OF THE 13TH SYMPOSIUM ON CLOUD COMPUTING, SOCC 2022, 2022, : 403 - 418
  • [22] Large-scale Network Traffic Prediction With LSTM and Temporal Convolutional Networks
    Bi, Jing
    Yuan, Haitao
    Xu, Kangyuan
    Ma, Haisen
    Zhou, Mengchu
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 3865 - 3870
  • [23] Tipping prediction of a class of large-scale radial-ring neural networks
    Lu, Yunxiang
    Xiao, Min
    Wu, Xiaoqun
    Karimi, Hamid Reza
    Xie, Xiangpeng
    Cao, Jinde
    Zheng, Wei Xing
    NEURAL NETWORKS, 2025, 181
  • [24] CHAMP: A Large-Scale Dataset for Skeleton-Based Composite HumAn Motion Prediction
    Zhang, Wanying
    Liu, Mengyuan
    Wang, Xinshun
    Zhao, Shen
    Wang, Can
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 10063 - 10076
  • [25] Neural network modelling of mechanical joints for the application in large-scale crash analyses
    Andre, Victor
    Costas, Miguel
    Langseth, Magnus
    Morin, David
    INTERNATIONAL JOURNAL OF IMPACT ENGINEERING, 2023, 177
  • [26] Decentralized neural network control of a class of large-scale systems with unknown interconnections
    Liu, WX
    Jagannathan, S
    Wunsch, DC
    Crow, ML
    2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 4972 - 4977
  • [27] Artificial neural networks for fault detection in large-scale data acquisition systems
    Jakubek, SM
    Strasser, TI
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, 17 (03) : 233 - 248
  • [28] A Siamese Neural Network Framework With Sememe-Based Context Extraction for Interactive Argument Pair Identification
    Yu, Ning
    Liu, Jianyi
    Shi, Yu
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2355 - 2359
  • [29] Neural Network-based Framework for Data Stream Mining
    Silva, Bruno
    Marques, Nuno
    PROCEEDINGS OF THE SIXTH STARTING AI RESEARCHERS' SYMPOSIUM (STAIRS 2012), 2012, 241 : 294 - +
  • [30] Autonomous and decentralized optimization of large-scale heterogeneous wireless networks by neural network dynamics
    Hasegawa, Mikio
    Tran, Ha Nguyen
    Miyamoto, Goh
    Murata, Yoshitoshi
    Harada, Hiroshi
    Kato, Shuzo
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2008, E91B (01) : 110 - 118