An Improved Parallel Clustering Method Based on K-Means for Electricity Consumption Patterns

被引:0
作者
Yang, Yuehua [1 ]
Wu, Yun [1 ]
机构
[1] Northeast Elect Power Univ, Sch Comp Sci, 169 Changchun Rd, Jilin 132012, Jilin, Peoples R China
关键词
electricity consumption patterns; clustering analysis; data sample density; parallel mining; MapReduce; HADOOP;
D O I
10.20965/jaciii.2024.p0953
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Electricity consumption pattern recognition is the foundation of intelligent electricity distribution data analysis. However, as the scale of electricity consumption data increases, traditional clustering analysis methods encounter bottlenecks such as low computation speed and processing efficiency. To meet the efficient mining needs of massive electricity consumption data, in this paper a parallel processing method of the density-based k-means clustering is presented. First, an initial cluster center selection method based on data sample density is proposed to avoid inaccurate initial cluster center point selection, leading to clustering falling into local optima. The dispersion degree of the data samples within the cluster is also used as an important reference for determining the number of clusters. Subsequently, parallelization of density calculation and clustering for data samples were achieved based on the MapReduce model. Through experiments conducted on Hadoop clusters, it has been shown that the proposed parallel processing method is efficient and feasible, and can provide favorable support for intelligent power allocation decisions.
引用
收藏
页码:953 / 961
页数:9
相关论文
共 38 条
  • [1] Hierarchical Clustering for Smart Meter Electricity Loads Based on Quantile Autocovariances
    Alonso, Andres M.
    Nogales, Francisco J.
    Ruiz, Carlos
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (05) : 4522 - 4530
  • [2] Benlaehmi Y, 2021, INT J ADV COMPUT SC, V12, P778
  • [3] A Scalable Ensemble Approach to Forecast the Electricity Consumption of Households
    Botman, Lola
    Soenen, Jonas
    Theodorakos, Konstantinos
    Yurtman, Aras
    Bekker, Jessa
    Vanthournout, Koen
    Blockeel, Hendrik
    De Moor, Bart
    Lago, Jesus
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (01) : 757 - 768
  • [4] Chen M, 2014, LECT NOTES COMPUT SC, V8182, P213, DOI 10.1007/978-3-642-54370-8_18
  • [5] Short-term fast forecasting based on family behavior pattern recognition for small-scale users load
    Cheng, Xiaoming
    Wang, Lei
    Zhang, Pengchao
    Wang, Xinkuan
    Yan, Qunmin
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (03): : 2107 - 2123
  • [6] Electrical Consumption Pattern base on Meter Data Management System using Big Data Techniques
    Correa, Estuardo
    Inga, Esteban
    Inga, Juan
    Hincapie, Roberto
    [J]. 2017 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND COMPUTER SCIENCE (INCISCOS), 2017, : 334 - 339
  • [7] Efficient Big Data Processing in Hadoop MapReduce
    Dittrich, Jens
    Quiane-Ruiz, Jorge-Arnulfo
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (12): : 2014 - 2015
  • [8] [谷紫文 Gu Ziwen], 2021, [电力系统保护与控制, Power System Protection and Control], V49, P118
  • [9] hadoop.apache, Apache Hadoop
  • [10] Research on parallel association rule mining of big data based on an improved K-means clustering algorithm
    Hao, Li
    Wang, Tuanbu
    Guo, Chaoping
    [J]. INTERNATIONAL JOURNAL OF AUTONOMOUS AND ADAPTIVE COMMUNICATIONS SYSTEMS, 2023, 16 (03) : 233 - 247