An approach to failure prediction in a cloud based environment

被引:11
|
作者
Adamu, Hussaini [1 ]
Mohammed, Bashir [1 ]
Maina, Ali Bukar [1 ]
Cullen, Andrea [1 ]
Ugail, Hassan [1 ]
Awan, Irfan [1 ]
机构
[1] Univ Bradford, Fac Engn & Informat, Bradford BD7 1DP, W Yorkshire, England
来源
2017 IEEE 5TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD 2017) | 2017年
关键词
Failure; Cloud Computing; Machine Learning; Availability;
D O I
10.1109/FiCloud.2017.56
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Failure in cloud system is defined as an even that occurs when the delivered service deviates from the correct intended service. As the cloud computing systems continue to grow in scale and complexity, there is an urgent need for cloud service providers (CSP) to guarantee a reliable on-demand resource to their customers in the presence of faults thereby fulfilling their service level agreement (SLA). Component failures in cloud systems are very familiar phenomena. However, large cloud service providers' data centers should be designed to provide a certain level of availability to the business system. Infrastructure as-a-service (Iaas) cloud delivery model presents computational resources (CPU and memory), storage resources and networking capacity that ensures high availability in the presence of such failures. The data in-production-faults recorded within a 2 years period has been studied and analyzed from the National Energy Research Scientific computing center (NERSC). Using the real-time data collected from the Computer Failure Data Repository (CFDR), this paper presents the performance of two machine learning (ML) algorithms, Linear Regression (LR) Model and Support Vector Machine (SVM) with a Linear Gaussian kernel for predicting hardware failures in a real-time cloud environment to improve system availability. The performance of the two algorithms have been rigorously evaluated using K-folds cross validation technique. Furthermore, steps and procedure for future studies has been presented. This research will aid computer hardware companies and cloud service providers (CSP) in designing a reliable fault-tolerant system by providing a better device selection, thereby improving system availability and minimizing unscheduled system downtime.
引用
收藏
页码:191 / 197
页数:7
相关论文
共 50 条
  • [31] An Exception Handling Approach for Privacy-Preserving Service Recommendation Failure in a Cloud Environment
    Qi, Lianyong
    Meng, Shunmei
    Zhang, Xuyun
    Wang, Ruili
    Xu, Xiaolong
    Zhou, Zhili
    Dou, Wanchun
    SENSORS, 2018, 18 (07)
  • [32] A Probabilistic Prediction Approach for Memory Resource of Complex System Simulation in Cloud Computing Environment
    Wang, Shuai
    Yao, Yiping
    Zhu, Feng
    Tang, Wenjie
    Xiao, Yuhao
    SYMMETRY-BASEL, 2020, 12 (11): : 1 - 15
  • [33] A Practical Approach to Hard Disk Failure Prediction in Cloud Platforms Big Data Model for Failure Management in Datacenters
    Ganguly, Sandipan
    Consul, Ashish
    Khan, Ali
    Bussone, Brian
    Richards, Jacqueline
    Miguel, Alejandro
    PROCEEDINGS 2016 IEEE SECOND INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2016), 2016, : 105 - 116
  • [34] A Prediction- Based ACO Algorithm to Dynamic Tasks Scheduling in Cloud Environment
    Hu, Haitao
    Wang, Hongyan
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 2727 - 2732
  • [35] A Pattern-Based Prediction Model for Dynamic Resource Provisioning in Cloud Environment
    Kim, Hyukho
    Kim, Woongsup
    Kim, Yangwoo
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2011, 5 (10): : 1712 - 1732
  • [36] Emerging Green ICT: Heart Disease Prediction Model in Cloud Environment
    Bala, Anju
    Malhotra, Shikhar
    Gupta, Nishant
    Ahuja, Naman
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ICT FOR SUSTAINABLE DEVELOPMENT ICT4SD 2015, VOL 2, 2016, 409 : 579 - 587
  • [37] A Behavior Based Trustworthy Service Composition Discovery Approach in Cloud Environment
    Pang, Shanchen
    Gao, Qian
    Liu, Ting
    He, Hua
    Xu, Guangquan
    Liang, Kaitai
    IEEE ACCESS, 2019, 7 : 56492 - 56503
  • [38] An anomaly-based approach for DDoS attack detection in cloud environment
    Rawashdeh, Adnan
    Alkasassbeh, Mouhammd
    Al-Hawawreh, Muna
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2018, 57 (04) : 312 - 324
  • [39] An integration approach of hybrid databases based on SQL in cloud computing environment
    Li, Changqing
    Gu, Jianhua
    SOFTWARE-PRACTICE & EXPERIENCE, 2019, 49 (03): : 401 - 422
  • [40] An Agent-Based Approach for Resource Allocation in the Cloud Computing Environment
    Fareh, Mohamed El-kabir
    Kazar, Okba
    Femmam, Manel
    Bourekkache, Samir
    2015 9TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATION SYSTEMS SERVICES AND APPLICATIONS (TSSA), 2015,