Concept drift detection on stream data for revising DBSCAN

被引:0
作者
Miyata Y. [1 ]
Ishikawa H. [2 ]
机构
[1] Hitachi, Ltd., Research and Development Group, 1-280, Higashi-koigakubo, Kokubunji, Tokyo
[2] Tokyo Metropolitan University, 6-6, Asahigaoka, Hino, Tokyo
关键词
Clustering; Concept drift; Data stream mining; DBSCAN; Power grid;
D O I
10.1541/ieejeiss.140.949
中图分类号
学科分类号
摘要
Data stream mining of IoT data can support operator to immediately isolate causes of equipment alarms. The challenge, however, is to keep their classifiers high purity (the data ratio with same proper class in a cluster) with concept drifting ascribed to differences between alarm models and entities. We propose to continuously update data class according to their distribution changes. Through evaluation, no purity deterioration was verified for oscillation condition data with a drifting rate of 1%. The result suggested that the method improves operator decision making. © 2020 The Institute of Electrical Engineers of Japan.
引用
收藏
页码:949 / 955
页数:6
相关论文
共 18 条
[1]  
United Nations Development Programme: Sustainable Development Goals
[2]  
Colglazier W., Sustainable development agenda: 2030, Science, 349, 6252, pp. 1048-1050, (2015)
[3]  
Rhodes C.J., The 2015 Paris Climate Change Conference: COP21, Science Reviews 2000 Ltd, 99, 1, pp. 97-104, (2016)
[4]  
Bayindir R., Colak I., Fulli G., Demirtas K., Smart grid technologies and applications, Renewable and Sustainable Energy Reviews, 66, pp. 499-516, (2016)
[5]  
Ester M., Kriegel H.P., Sander J., Xu X., A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise, Proc. ACM Conf. on Knowledge Discovery and Data Mining, pp. 226-231, (1996)
[6]  
Leskovec J., Rajaraman A., Ullman J.D., Mining of Massive Datasets 2nd Edition, (2014)
[7]  
Deepa M., Ravanthy P., Student P., Validation of Document Clustering based on Purity and Entropy measures, International Journal of Advanced Research in Computer and Communication Engineering, 1, 3, pp. 147-152, (2012)
[8]  
Ester M., Kriegel H.P., Sander J., Wimmer M., Xu X., Incremental Clustering for Mining in a Data Warehousing Environment, Proc. Conf. on Very Large Data Bases, pp. 323-333, (1998)
[9]  
Cao F., Estert M., Qian W., Zhou A., Density-Based Clustering over an Evolving Data Stream with Noise, Proc. SIAM Conf. on Data Mining, pp. 325-377, (2006)
[10]  
Aggarwal C.C., Han J., Wang J., Yu P.S., A Framework for Clustering Evolving Data Streams, Proc. Conf. on Very Large Data Bases, (VLDB' 03), pp. 81-92, (2003)