SeQual: an unsupervised feature selection method for cloud workload traces

被引:5
作者
Ali, Shallaw Mohammed [1 ,2 ]
Kecskemeti, Gabor [1 ]
机构
[1] Univ Miskolc, Inst Informat Technol, Miskolc, Hungary
[2] Al Qalam Univ Coll, Dept Comp Engn, Kirkuk, Iraq
关键词
Feature selection; Cloud workload trace; K-means clustering;
D O I
10.1007/s11227-023-05163-w
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
One challenge of studying cloud workload traces is the lack of available users' identities. Therefore, clustering methods were used to address this challenge through extracting these identities from workload traces. For better extraction, it is beneficial to select attributes (columns in the traces) for clustering by using feature selection methods. However, the use of general selection methods requires details that are not available for workload traces (e.g. predefined number of clusters). Therefore, in this paper, we present an unsupervised feature selection method for cloud workload traces to identify good candidate attributes for clustering. This method uses Silhouette coefficients to rank attributes that are best for users' extraction through clustering. The performance of our SeQual method is evaluated in comparison with commonly used (supervised and unsupervised) feature selection methods with the help of clustering quality metrics (i.e. adjusted rand index, entropy and precision). The results show that the SeQual method can compete with the supervised methods and perform better than unsupervised ones, with an average accuracy between 90% and 99%.
引用
收藏
页码:15079 / 15097
页数:19
相关论文
共 17 条
[1]  
alexpucher, WORKLOAD TRACES UNLA
[2]   Clustering Datasets in Cloud Computing Environment for User Identification [J].
Ali, Shallaw Mohammed ;
Kecskemeti, Gabor .
30TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2022), 2022, :165-171
[3]  
[Anonymous], 2005, P 11 ANN C ADV SCH C
[4]   Workload Classification in Multi-VM Cloud Environment Using Deep Neural Network Model [J].
Bhagtya, Paras ;
Raghavan, S. ;
Chandraseakran, K. ;
Usha, D. .
36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, :79-82
[5]   Feature selection in machine learning: A new perspective [J].
Cai, Jie ;
Luo, Jiawei ;
Wang, Shulin ;
Yang, Sheng .
NEUROCOMPUTING, 2018, 300 :70-79
[6]   A survey on feature selection methods [J].
Chandrashekar, Girish ;
Sahin, Ferat .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) :16-28
[7]  
El Aboudi N, 2016, 2016 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS)
[8]  
Feitelson D, 2021, PARALLEL WORKLOADS A
[9]   A survey on feature selection approaches for clustering [J].
Hancer, Emrah ;
Xue, Bing ;
Zhang, Mengjie .
ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (06) :4519-4545
[10]  
Jassas Mohammad S., 2020, Advances in Artificial Intelligence. 33rd Canadian Conference on Artificial Intelligence, Canadian AI 2020. Proceedings. Lecture Notes in Artificial Intelligence. Subseries of Lecture Notes in Computer Science (LNAI 12109), P321, DOI 10.1007/978-3-030-47358-7_32