An Enhanced Spectral Clustering Algorithm with S-Distance

被引:20
作者
Sharma, Krishna Kumar [1 ,2 ]
Seal, Ayan [1 ,3 ]
Herrera-Viedma, Enrique [4 ,5 ]
Krejcar, Ondrej [3 ,6 ]
机构
[1] PDPM Indian Inst Informat Technol Design & Mfg, Dept Comp Sci & Engn, Jabalpur 482005, India
[2] Univ Kota, Dept Comp Sci & Informat, Kota 324022, Rajasthan, India
[3] Univ Hradec Kralove, Fac Informat & Management, Ctr Basic & Appl Sci, Hradec Kralove 50003, Czech Republic
[4] Univ Granada, Andalusian Res Inst Data Sci & Computat Intellige, Granada 18071, Spain
[5] King Abdulaziz Univ, Fac Engn, Jeddah 21589, Saudi Arabia
[6] Univ Teknol Malaysia, Malaysia Japan Int Inst Technol MJIIT, Kuala Lumpur 54100, Malaysia
来源
SYMMETRY-BASEL | 2021年 / 13卷 / 04期
关键词
S-divergence; S-distance; spectral clustering;
D O I
10.3390/sym13040596
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Calculating and monitoring customer churn metrics is important for companies to retain customers and earn more profit in business. In this study, a churn prediction framework is developed by modified spectral clustering (SC). However, the similarity measure plays an imperative role in clustering for predicting churn with better accuracy by analyzing industrial data. The linear Euclidean distance in the traditional SC is replaced by the non-linear S-distance (Sd). The Sd is deduced from the concept of S-divergence (SD). Several characteristics of Sd are discussed in this work. Assays are conducted to endorse the proposed clustering algorithm on four synthetics, eight UCI, two industrial databases and one telecommunications database related to customer churn. Three existing clustering algorithms-k-means, density-based spatial clustering of applications with noise and conventional SC-are also implemented on the above-mentioned 15 databases. The empirical outcomes show that the proposed clustering algorithm beats three existing clustering algorithms in terms of its Jaccard index, f-score, recall, precision and accuracy. Finally, we also test the significance of the clustering results by the Wilcoxon's signed-rank test, Wilcoxon's rank-sum test, and sign tests. The relative study shows that the outcomes of the proposed algorithm are interesting, especially in the case of clusters of arbitrary shape.
引用
收藏
页数:17
相关论文
共 40 条
[1]   A survey on clustering algorithms for wireless sensor networks [J].
Abbasi, Ameer Ahmed ;
Younis, Mohamed .
COMPUTER COMMUNICATIONS, 2007, 30 (14-15) :2826-2841
[2]  
[Anonymous], 2008, RESPONSE MODELING DI
[3]   A novel evolutionary data mining algorithm with applications to churn prediction [J].
Au, WH ;
Chan, KCC ;
Yao, X .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2003, 7 (06) :532-545
[4]  
Banerjee A, 2005, J MACH LEARN RES, V6, P1705
[5]  
Bottou L., 1995, Advances in Neural Information Processing Systems 7, P585
[6]   Churn Prediction in Online Games Using Players' Login Records: A Frequency Analysis Approach [J].
Castro, Emiliano G. ;
Tsuzuki, Marcos S. G. .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2015, 7 (03) :255-265
[7]   Using K-means method and spectral clustering technique in an outfitter's value analysis [J].
Chang, En-Chi ;
Huang, Shian-Chang ;
Wu, Hsin-Hung .
QUALITY & QUANTITY, 2010, 44 (04) :807-815
[8]   Spectral clustering: A semi-supervised approach [J].
Chen, Weifu ;
Feng, Guocan .
NEUROCOMPUTING, 2012, 77 (01) :229-242
[9]   Reliable writer identification in medieval manuscripts through page layout features: The "Avila" Bible case [J].
De Stefano, C. ;
Maniaci, M. ;
Fontanella, F. ;
di Freca, A. Scotto .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 72 :99-110
[10]  
Dua D., 2017, UCI Machine Learning Repository