Variance based three-way clustering approaches for handling overlapping clustering

被引:38
作者
Afridi, Mohammad Khan [1 ]
Azam, Nouman [1 ]
Yao, JingTao [2 ]
机构
[1] Natl Univ Comp & Emerging Sci, Islamabad, Pakistan
[2] Univ Regina, Dept Comp Sci, Regina, SK S4S 0A2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Three-way clustering; Clustering; Three-way decisions; Overlapping clusters; ROUGH; FUZZY;
D O I
10.1016/j.ijar.2019.11.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The conventional clustering approaches are not very effective in dealing with clusters having overlapping regions. The three-way clustering (3WC) is an effective and promising approach in this regards. A key issue in 3WC is the determination of thresholds which plays a crucial and important role in accurate estimation of the overlapping region. In this article, we propose different variance based criteria for determining the thresholds. In particular, we examine the variance or spread in evaluation function values of objects contained in the three regions obtained with 3WC of objects. An algorithm called 3WC-OR is introduced that considers the optimization of the proposed criteria for determining effective thresholds by incorporating approaches such as genetic algorithms and game-theoretic rough sets. Experimental results on five UCI datasets indicate that the proposed algorithm significantly improves results on datasets with overlapping clusters and provide comparable results on datasets with non-overlapping clusters. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:47 / 63
页数:17
相关论文
共 47 条
[1]   A three-way clustering approach for handling missing data using GTRS [J].
Afridi, Mohammad Khan ;
Azam, Nouman ;
Yao, JingTao ;
Alanazi, Eisa .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 98 :11-24
[2]  
[Anonymous], 1981, PATTERN RECOGN, DOI 10.1007/978-1-4757-0450-1_3
[3]   OVERLAPPING CLUSTERING - A NEW METHOD FOR PRODUCT POSITIONING [J].
ARABIE, P ;
CARROLL, JD ;
DESARBO, W ;
WIND, J .
JOURNAL OF MARKETING RESEARCH, 1981, 18 (03) :310-317
[4]  
Aslam J.A., 2004, J GRAPH ALGORITHMS A, V8, P95, DOI [10.7155/jgaa.00084, DOI 10.7155/JGAA.00084]
[5]   Variance Based Determination of Three-Way Decisions Using Probabilistic Rough Sets [J].
Azam, Nouman ;
Yao, Jing Tao .
ROUGH SETS, (IJCRS 2016), 2016, 9920 :209-218
[6]   FCM - THE FUZZY C-MEANS CLUSTERING-ALGORITHM [J].
BEZDEK, JC ;
EHRLICH, R ;
FULL, W .
COMPUTERS & GEOSCIENCES, 1984, 10 (2-3) :191-203
[7]   Orthopartitions and soft clustering: Soft mutual information measures for clustering validation [J].
Campagner, Andrea ;
Ciucci, Davide .
KNOWLEDGE-BASED SYSTEMS, 2019, 180 :51-61
[8]   A feature-based approach to market segmentation via overlapping K-centroids clustering [J].
Chaturvedi, A ;
Carroll, JD ;
Green, PE ;
Rotondo, JA .
JOURNAL OF MARKETING RESEARCH, 1997, 34 (03) :370-377
[9]  
Cleuziou G, 2008, INT C PATT RECOG, P563
[10]   A Multifaceted Analysis of Probabilistic Three-way Decisions [J].
Deng, Xiaofei ;
Yao, Yiyu .
FUNDAMENTA INFORMATICAE, 2014, 132 (03) :291-313