Evidence Accumulation Clustering with Possibilitic Fuzzy C-Means base clustering approach to disease diagnosis

被引:5
作者
Iliyasu, Abdullah M. [1 ]
Fatichah, Chastine [2 ]
Abuhasel, Khaled A. [3 ]
机构
[1] Prince Sattam Bin Abdulaziz Univ, Coll Engn, Res Grp, CIIS, Al Kharj 11942, Saudi Arabia
[2] Inst Teknol Sepuluh Nopember, Dept Informat, Kampus ITS Sukolilo, Surabaya 60111, Indonesia
[3] Bisha Univ, Dept Mech Engn, Bisha 61361, Saudi Arabia
关键词
Evidence accumulation clustering; K-means; fuzzy C-means; possibilitic fuzzy C-means; hybrid intelligent systems; health informatics; medical data classification; disease diagnosis; CLASSIFICATION;
D O I
10.7305/automatika.2016.10.1427
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditionally, supervised machine learning methods are the first choice for tasks involving classification of data. This study provides a non-conventional hybrid alternative technique (pEAC) that blends the Possibilistic Fuzzy CMeans (PFCM) as base cluster generating algorithm into the 'standard' Evidence Accumulation Clustering (EAC) clustering method. The PFCM coalesces the separate properties of the Possibilistic C-Means (PCM) and Fuzzy C-Means (FCM) algorithms into a sophisticated clustering algorithm. Notwithstanding the tremendous capabilities offered by this hybrid technique, in terms of structure, it resembles the hEAC and fEAC ensemble clustering techniques that are realised by integrating the K-Means and FCM clustering algorithms into the EAC technique. To validate the new technique's effectiveness, its performance on both synthetic and real medical datasets was evaluated alongside individual runs of well-known clustering methods, other unsupervised ensemble clustering techniques and some supervised machine learning methods. Our results show that the proposed pEAC technique outperformed the individual runs of the clustering methods and other unsupervised ensemble techniques in terms accuracy for the diagnosis of hepatitis, cardiovascular, breast cancer, and diabetes ailments that were used in the experiments. Remarkably, compared alongside selected supervised machine learning classification models, our proposed pEAC ensemble technique exhibits better diagnosing accuracy for the two breast cancer datasets that were used, which suggests that even at the cost of none labelling of data, the proposed technique offers efficient medical data classification.
引用
收藏
页码:822 / 835
页数:14
相关论文
共 32 条
[1]   A combined AdaBoost and NEWFM technique for medical data classification [J].
Abuhasel, Khaled A. ;
Iliyasu, Abdullah M. ;
Fatichah, Chastine .
Lecture Notes in Electrical Engineering, 2015, 339 :801-809
[2]  
Abuhasel KA, 2015, J ADV COMPUT INTELL, V19, P447
[3]  
[Anonymous], J MED IMAGING HLTH I
[4]  
[Anonymous], 2013, HYBRID METAHEURISTIC
[5]  
[Anonymous], 2009, World Acad Sci Eng Technol
[6]  
[Anonymous], 1997, CS97557
[7]   On the efficiency of evolutionary fuzzy clustering [J].
Campello, Ricardo J. G. B. ;
Hruschka, Eduardo R. ;
Alves, Vinicius S. .
JOURNAL OF HEURISTICS, 2009, 15 (01) :43-75
[8]   OWA-weighted based clustering method for classification problem [J].
Cheng, Ching-Hsue ;
Wang, Jia-Wen ;
Wu, Ming-Chang .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :4988-4995
[9]  
Evans R, 2011, 2011 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN ASIA (CITA 11)
[10]   Fuzzy Feature Representation for White Blood Cell Differential Counting in Acute Leukemia Diagnosis [J].
Fatichah, Chastine ;
Tangel, Martin L. ;
Yan, Fei ;
Betancourt, Janet P. ;
Widyanto, M. Rahmat ;
Dong, Fangyan ;
Hirota, Kaoru .
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2015, 13 (03) :742-752