Dynamic rough clustering and its applications

被引:46
作者
Peters, Georg [2 ]
Weber, Richard [1 ]
Nowatzke, Rene [3 ]
机构
[1] Univ Chile, Dept Ind Engn, Santiago, Chile
[2] Munich Univ Appl Sci, Dept Comp Sci & Math, D-80335 Munich, Germany
[3] Munita eV, D-80335 Munich, Germany
关键词
Dynamic data mining; Changing data structures; Rough k-means clustering; FUZZY-SETS; K-MEANS;
D O I
10.1016/j.asoc.2012.05.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic data mining has gained increasing attention in the last decade. It addresses changing data structures which can be observed in many real-life applications, e.g. buying behavior of customers. As opposed to classical, i.e. static data mining where the challenge is to discover pattern inherent in given data sets, in dynamic data mining the challenge is to understand - and in some cases even predict how such pattern will change over time. Since changes in general lead to uncertainty, the appropriate approaches for uncertainty modeling are needed in order to capture, model, and predict the respective phenomena considered in dynamic environments. As a consequence, the combination of dynamic data mining and soft computing is a very promising research area. The proposed algorithm consists of a dynamic clustering cycle when the data set will be refreshed from time to time. Within this cycle criteria check if the newly arrived data have structurally changed in comparison to the data already analyzed. If yes, appropriate actions are triggered, in particular an update of the initial settings of the cluster algorithm. As we will show, rough clustering offers strong tools to detect such changing data structures. To evaluate the proposed dynamic rough clustering algorithm it has been applied to synthetic as well as to real-world data sets where it provides new insights regarding the underlying dynamic phenomena. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:3193 / 3207
页数:15
相关论文
共 52 条
[1]   Dynamic cluster generation for a fuzzy classifier with ellipsoidal regions [J].
Abe, S .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (06) :869-876
[2]  
Benderskaya E.N., 2011, KNOWLEDGE ORIENTED A, P397
[3]  
Beringer J., 2007, ADV FUZZY CLUSTERING, P333
[4]  
Bezdek J. C., 1981, Pattern recognition with fuzzy objective function algorithms
[5]   Some new indexes of cluster validity [J].
Bezdek, JC ;
Pal, NR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (03) :301-315
[6]   Info-fuzzy algorithms for mining dynamic data streams [J].
Cohen, Lior ;
Avrahami, Gil ;
Last, Mark ;
Kandel, Abraham .
APPLIED SOFT COMPUTING, 2008, 8 (04) :1283-1294
[7]   A methodology for dynamic data mining based on fuzzy clustering [J].
Crespo, F ;
Weber, R .
FUZZY SETS AND SYSTEMS, 2005, 150 (02) :267-284
[8]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227
[9]  
Diday E., 1973, J COMPUTATIONAL INFO, V2, P61
[10]   ROUGH FUZZY-SETS AND FUZZY ROUGH SETS [J].
DUBOIS, D ;
PRADE, H .
INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1990, 17 (2-3) :191-209