Dynamic K-Means Clustering of Workload and Cloud Resource Configuration for Cloud Elastic Model

被引:2
|
作者
Daradkeh, Tariq [1 ]
Agarwal, Anjali [1 ]
Zaman, Marzia [2 ]
Goel, Nishith [2 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
[2] Cistech Ltd, Ottawa, ON K2E 7K3, Canada
来源
IEEE ACCESS | 2020年 / 8卷
基金
加拿大自然科学与工程研究理事会;
关键词
Cloud computing; Data centers; Task analysis; Clustering methods; Servers; Internet; Hardware; Elastic model; kernel density estimator; dynamic k-means clustering; workload; data center configuration; logs analysis;
D O I
10.1109/ACCESS.2020.3042716
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud elasticity involves timely provisioning and de-provisioning of computing resources and adjusting resources size to meet the dynamic workload demand. This requires fast, and accurate resource scaling methods at minimum cost (e.g. pay as you go) that match with workload demands. Two dynamic changing parameters must be defined in an elastic model, the workload resource demand classes, and the data center resource reconfiguration classes. These parameters are not labeled for cloud management system while data center logs are being captured. Building an advance elastic model is a critical task, which defines multiple classes under these two categories i.e. for workload and for provisioning. A dynamic method is therefore required to define (during configuration time window) the workload classes and resource provisioning classes. Unsupervised learning model such as K-Means has many challenges such as time complexity, selection of optimum number of clusters (representing the classes), and determining centroid values of the clusters. All clustering methods depend on minimizing mean square error between center of population in same class member. These methods are often enhanced using guidelines to find out the centroids, but they suffer from K-Means limitations. For the application of clustering cloud log traces, most of the reported work use K-Means clustering to label workload types. However, there is no work reported that label data center scaling classes. In this work, a novel method is proposed to analyze the characteristics of both workloads and datacenter configurations using clustering method, and is based on random variable model transformation (kernel density estimator) guide. This method enhances K-Means clustering by automatically determining optimum number of classes and finding the mean centroids for the clusters. In addition, it improves the accuracy and the time complexity of standard K-Means clustering model, by best correlating between clustering attributes using statistical correlation methods.
引用
收藏
页码:219430 / 219446
页数:17
相关论文
共 50 条
  • [1] Energy-saving Analysis of Cloud Workload Based on K-means Clustering
    Xia, Qingxin
    Lan, Yuqing
    Zhao, Liang
    Xiao, Limin
    2014 IEEE COMPUTING, COMMUNICATIONS AND IT APPLICATIONS CONFERENCE (COMCOMAP), 2014, : 305 - 309
  • [2] Low-Energy Resource Classification Algorithm for Cross-Regional Cloud Data Centers Based on K-Means Clustering Algorithm
    Liang, Bin
    Bai, Junqing
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (08) : 10084 - 10091
  • [3] K-means clustering algorithm for data distribution in cloud computing environment
    Pan, Hailan
    Lei, Yongmei
    Yin, Shi
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2021, 12 (03) : 322 - 331
  • [4] Implementation of K-Means Clustering for Evaluating SaaS on the Cloud computing Environment
    Jagli, Dhanamma
    Purohit, Seema
    Nalla, Subash Chandra
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON ICT IN BUSINESS INDUSTRY & GOVERNMENT (ICTBIG), 2016,
  • [5] Cloud based framework for diagnosis of diabetes mellitus using K-means clustering
    Shakeel P.M.
    Baskar S.
    Dhulipala V.R.S.
    Jaber M.M.
    Health Information Science and Systems, 6 (1)
  • [6] Cloud Computing K-Means Text Clustering Filtering Algorithm based on Hadoop
    Huang Suyu
    Proceedings of the 2016 4th International Conference on Machinery, Materials and Information Technology Applications, 2016, 71 : 1516 - 1521
  • [7] K-Means algorithm based on Cloud Computing
    Xu, Yunfeng
    Zhang, Yan
    Ma, Rui
    2012 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2012), VOL 2, 2012, : 363 - 365
  • [8] Hybrid Resource Scaling for Dynamic Workload in Cloud Computing
    Daraje, Megersa
    Shaikh, Javed
    2021 IEEE INTERNATIONAL CONFERENCE ON MOBILE NETWORKS AND WIRELESS COMMUNICATIONS (ICMNWC), 2021,
  • [9] A Parallel Privacy-Preserving k-Means Clustering Algorithm for Encrypted Databases in Cloud Computing
    Song, Youngho
    Kim, Hyeong-Jin
    Lee, Hyun-Jo
    Chang, Jae-Woo
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [10] Privacy-Preserving Outsourcing of K-Means Clustering for Cloud-Device Collaborative Computing in Space-Air-Ground Integrated IoT
    Zhao, Wei
    Yang, Wu
    Wang, Huanran
    Zhang, Tairong
    Man, Dapeng
    Liu, Tao
    Lv, Jiguang
    Guizani, Mohsen
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23) : 20396 - 20407