P-FCM: a proximity - based fuzzy clustering

被引:56
作者
Pedrycz, W [1 ]
Loia, V
Senatore, S
机构
[1] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2G7, Canada
[2] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
[3] Univ Salerno, Dept Math & Informat, I-84081 Baronissi, SA, Italy
基金
加拿大自然科学与工程研究理事会;
关键词
fuzzy clustering; proximity measure; web mining; fuzzy C-means (FCM); supervision hints; preference modeling; proximity hints (constraints);
D O I
10.1016/j.fss.2004.03.004
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this study, we introduce and study a proximity-based fuzzy clustering. As the name stipulates, in this mode of clustering, a structure "discovery" in the data is realized in an unsupervised manner and becomes augmented by a certain auxiliary supervision mechanism. The supervision mechanism introduced in this algorithm is realized via a number of proximity "hints" (constraints) that specify an extent to which some pairs of patterns are regarded similar or different. They are provided externally to the clustering algorithm and help in the navigation of the search through the set of patterns and this gives rise to a two-phase optimization process. Its first phase is the standard FCM while the second step is concerned with the gradient-driven minimization of the differences between the provided proximity values and those computed on a basis of the partition matrix computed at the first phase of the algorithm. The proximity type of auxiliary information is discussed in the context of Web mining where clusters of Web pages are built in presence of some proximity information provided by a user who assesses (assigns) these degrees on a basis of some personal preferences. Numeric studies involve experiments with several synthetic data and Web data (pages). (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:21 / 41
页数:21
相关论文
共 26 条
[1]  
[Anonymous], [No title captured]
[2]  
[Anonymous], Pattern Recognition With Fuzzy Objective Function Algorithms
[3]  
BAEZAYATES RA, 1999, MODERN INFORMATION R
[4]   Partitioning-based clustering for Web document categorization [J].
Boley, D ;
Gini, M ;
Gross, R ;
Han, EH ;
Hastings, K ;
Karypis, G ;
Kumar, V ;
Mobasher, B ;
Moore, J .
DECISION SUPPORT SYSTEMS, 1999, 27 (03) :329-341
[5]   Syntactic clustering of the Web [J].
Broder, AZ ;
Glassman, SC ;
Manasse, MS ;
Zweig, G .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1997, 29 (8-13) :1157-1166
[6]  
FURUKRANZ J, 1999, P 3 INT S ADV INT DA, P487
[7]   Clustering of XML documents [J].
Guillaume, D ;
Murtagh, F .
COMPUTER PHYSICS COMMUNICATIONS, 2000, 127 (2-3) :215-227
[8]   Generalized fuzzy c-means clustering strategies using Lp norm distances [J].
Hathaway, RJ ;
Bezdek, JC ;
Hu, YK .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2000, 8 (05) :576-582
[9]   NERF C-MEANS - NON-EUCLIDEAN RELATIONAL FUZZY CLUSTERING [J].
HATHAWAY, RJ ;
BEZDEK, JC .
PATTERN RECOGNITION, 1994, 27 (03) :429-437
[10]   On relational data versions of c-means algorithms [J].
Hathaway, RJ ;
Bezdek, JC ;
Davenport, JW .
PATTERN RECOGNITION LETTERS, 1996, 17 (06) :607-612