Accuracy Assessment of Machine Learning Algorithm(s) in Thyroid Dysfunction Diagnosis

被引:2
作者
Danjuma, Kwetishe Joro [1 ]
Wajiga, Gregory Maksha [1 ]
Garba, Etemi Joshua [1 ]
Ahmadu, Asabe Sandra [1 ]
Longe, Olumide Babatope [2 ]
机构
[1] Modibbo Adama Univ, Dept Comp Sci, Yola, Nigeria
[2] Acad City Univ Coll, Fac Computat Sci & Informat, Accra, Ghana
来源
2022 IEEE NIGERIA 4TH INTERNATIONAL CONFERENCE ON DISRUPTIVE TECHNOLOGIES FOR SUSTAINABLE DEVELOPMENT (IEEE NIGERCON) | 2022年
关键词
Machine; learning; algorithm; accuracy; thyroid; diagnosis;
D O I
10.1109/NIGERCON54645.2022.9803113
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study presents accuracy assessment of decision tree, random forest, support vector machine, neural network, and Naive Bayes classifiers used in thyroid classification problem. Utilising thyroid data from the University of California, Irvine repository, the study applied synthetic minority oversampling technique to resolve imbalanced dataset and avoid the likelihood of overfitting, reservoir sampling technique to split the augmented data into sample sizes, and 10-fold cross-validation to measure the unbiased accuracy of the models across the sample sizes in Weka. The random forest classifier yielded 99.075% accuracy, decision tree and support vector machine achieved 98.500% accuracy, neural network produced 98.375% accuracy, and the Naive Bayes classifier generated the least classification accuracy of 98.200%. The accuracy assessments across sample sizes are statically identical with each classifier beating the other classifiers on one of the datasets, which revealed the existence of a trade-off between classification accuracy and time complexities.
引用
收藏
页码:252 / 256
页数:5
相关论文
共 41 条
[1]   Predicting Breast Cancer Recurrence Using Machine Learning Techniques: A Systematic Review [J].
Abreu, Pedro Henriques ;
Santos, Miriam Seoane ;
Abreu, Miguel Henriques ;
Andrade, Bruno ;
Silva, Daniel Castro .
ACM COMPUTING SURVEYS, 2016, 49 (03)
[2]  
Agrawal M., 2012, INT J EMERGING TECHN, V2, P139
[3]   A novel hybrid decision support system for thyroid disease forecasting [J].
Ahmad, Waheed ;
Ahmad, Ayaz ;
Lu, Chuncheng ;
Khoso, Barkat Ali ;
Huang, Lican .
SOFT COMPUTING, 2018, 22 (16) :5377-5383
[4]   Identifying channel sand-body from multiple seismic attributes with an improved random forest algorithm [J].
Ao, Yile ;
Li, Hongqi ;
Zhu, Liping ;
Ali, Sikandar ;
Yang, Zhongguo .
JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2019, 173 :781-792
[5]  
Awad Mariette, 2015, Support Vector Machines for Classification, V4, P39, DOI [DOI 10.1007/978-1-4302-5990-93, DOI 10.1007/978-1-4302-5990-9_3]
[6]   Performance Evaluation of the Machine Learning Algorithms Used in Inference Mechanism of a Medical Decision Support System [J].
Bal, Mert ;
Amasyali, M. Fatih ;
Sever, Hayri ;
Kose, Guven ;
Demirhan, Ayse .
SCIENTIFIC WORLD JOURNAL, 2014,
[7]   Predictive data mining in clinical medicine: Current issues and guidelines [J].
Bellazzi, Riccardo ;
Zupan, Blaz .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2008, 77 (02) :81-97
[8]   A case-based reasoning system for supervised classification problems in the medical field [J].
Bentaiba-Lagrid, Miled Basma ;
Bouzar-Benlabiod, Lydia ;
Rubin, Stuart H. ;
Bouabana-Tebibel, Thouraya ;
Hanini, Maria R. .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 150
[9]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[10]   A Three-Stage Expert System Based on Support Vector Machines for Thyroid Disease Diagnosis [J].
Chen, Hui-Ling ;
Yang, Bo ;
Wang, Gang ;
Liu, Jie ;
Chen, Yi-Dong ;
Liu, Da-You .
JOURNAL OF MEDICAL SYSTEMS, 2012, 36 (03) :1953-1963