An approach to failure prediction in a cloud based environment

被引:11
|
作者
Adamu, Hussaini [1 ]
Mohammed, Bashir [1 ]
Maina, Ali Bukar [1 ]
Cullen, Andrea [1 ]
Ugail, Hassan [1 ]
Awan, Irfan [1 ]
机构
[1] Univ Bradford, Fac Engn & Informat, Bradford BD7 1DP, W Yorkshire, England
来源
2017 IEEE 5TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD 2017) | 2017年
关键词
Failure; Cloud Computing; Machine Learning; Availability;
D O I
10.1109/FiCloud.2017.56
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Failure in cloud system is defined as an even that occurs when the delivered service deviates from the correct intended service. As the cloud computing systems continue to grow in scale and complexity, there is an urgent need for cloud service providers (CSP) to guarantee a reliable on-demand resource to their customers in the presence of faults thereby fulfilling their service level agreement (SLA). Component failures in cloud systems are very familiar phenomena. However, large cloud service providers' data centers should be designed to provide a certain level of availability to the business system. Infrastructure as-a-service (Iaas) cloud delivery model presents computational resources (CPU and memory), storage resources and networking capacity that ensures high availability in the presence of such failures. The data in-production-faults recorded within a 2 years period has been studied and analyzed from the National Energy Research Scientific computing center (NERSC). Using the real-time data collected from the Computer Failure Data Repository (CFDR), this paper presents the performance of two machine learning (ML) algorithms, Linear Regression (LR) Model and Support Vector Machine (SVM) with a Linear Gaussian kernel for predicting hardware failures in a real-time cloud environment to improve system availability. The performance of the two algorithms have been rigorously evaluated using K-folds cross validation technique. Furthermore, steps and procedure for future studies has been presented. This research will aid computer hardware companies and cloud service providers (CSP) in designing a reliable fault-tolerant system by providing a better device selection, thereby improving system availability and minimizing unscheduled system downtime.
引用
收藏
页码:191 / 197
页数:7
相关论文
共 50 条
  • [1] Improved Metaheuristic Based Failure Prediction with Migration Optimization in Cloud Environment
    Karthikeyan K.
    Liyakathunisa
    Aljohani E.
    Vaiyapuri T.
    Computer Systems Science and Engineering, 2023, 45 (02): : 1641 - 1654
  • [2] Prediction based task scheduling approach for floodplain application in cloud environment
    Kaur, Gurleen
    Bala, Anju
    COMPUTING, 2021, 103 (05) : 895 - 916
  • [3] Prediction based task scheduling approach for floodplain application in cloud environment
    Gurleen Kaur
    Anju Bala
    Computing, 2021, 103 : 895 - 916
  • [4] A new temporal locality-based workload prediction approach for SaaS services in a cloud environment
    Matoussi, Wiem
    Hamrouni, Tarek
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (07) : 3973 - 3987
  • [5] OPSA: an optimized prediction based scheduling approach for scientific applications in cloud environment
    Gurleen Kaur
    Anju Bala
    Cluster Computing, 2021, 24 : 1955 - 1974
  • [6] OPSA: an optimized prediction based scheduling approach for scientific applications in cloud environment
    Kaur, Gurleen
    Bala, Anju
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (03): : 1955 - 1974
  • [7] Virtual resource prediction in cloud environment: A Bayesian approach
    Shyam, Gopal Kirshna
    Manvi, Sunilkumar S.
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 65 : 144 - 154
  • [8] Prediction-Based Elastic Load Balancing Mechanism in Cloud Environment
    Yang, Xin
    Qiao, Xiuquan
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE & APPLICATION TECHNOLOGY (ICCIA 2016), 2016, 56 : 289 - 294
  • [9] Cloud failure prediction based on traditional machine learning and deep learning
    Tengku Nazmi Tengku Asmawi
    Azlan Ismail
    Jun Shen
    Journal of Cloud Computing, 11
  • [10] Cloud failure prediction based on traditional machine learning and deep learning
    Asmawi, Tengku Nazmi Tengku
    Ismail, Azlan
    Shen, Jun
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2022, 11 (01):