An Improved Framework for Detecting Thyroid Disease Using Filter-Based Feature Selection and Stacking Ensemble

被引:13
作者
Obaido, George [1 ,2 ]
Achilonu, Okechinyere [3 ]
Ogbuokiri, Blessing [4 ]
Amadi, Chimeremma Sandra [5 ]
Habeebullahi, Lawal [6 ]
Ohalloran, Tony [7 ]
Chukwu, Chidozie Williams [8 ]
Mienye, Ebikella Domor [9 ]
Aliyu, Mikail [10 ]
Fasawe, Olufunke [10 ]
Modupe, Ibukunola Abosede [11 ]
Omietimi, Erepamo Job [12 ]
Aruleba, Kehinde [13 ]
机构
[1] Univ Calif Berkeley, Berkeley Inst Data Sci, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Ctr Human Compatible Artificial Intelligence, Berkeley, CA 94720 USA
[3] Univ Witwatersrand Johannesburg, Sch Publ Hlth, ZA-2017 Johannesburg, South Africa
[4] Brock Univ, Dept Comp Sci, St Catharines, ON L2S 3A1, Canada
[5] Fed Univ Technol Owerri FUTO, Dept Informat Technol, Owerri 460113, Nigeria
[6] Summit Univ Offa, Dept Comp Sci, Offa 250101, Nigeria
[7] Natl Univ Ireland, Sch Comp Sci, Galway H91 TK33, Ireland
[8] Wake Forest Univ, Dept Math, Winston Salem, NC 27106 USA
[9] Univ Johannesburg, Coll Business & Econ, ZA-2006 Johannesburg, South Africa
[10] Univ Calif Berkeley, Sch Publ Hlth, Berkeley, CA 94704 USA
[11] Vaal Univ Technol, Dept Comp Sci, Vanderbijlpark, ZA-1900, South Africa
[12] Univ Pretoria, Dept Geol, ZA-0028 Pretoria, South Africa
[13] Univ Leicester, Sch Comp & Math Sci, Leicester LE1 7RH, England
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Thyroid; Diseases; Predictive models; Feature extraction; Accuracy; Thyroid cancer; Artificial intelligence; Medical services; Ensemble learning; Machine learning; healthcare; machine learning; filter-based stacking ensemble learning; thyroid disease; SUPPORT VECTOR MACHINE; PREDICTION; CHALLENGES; MANAGEMENT; ALGORITHM; CARCINOMA; PAPILLARY; PATTERNS; TUMORS;
D O I
10.1109/ACCESS.2024.3418974
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, machine learning (ML) has become a pivotal tool for predicting and diagnosing thyroid disease. While many studies have explored the use of individual ML models for thyroid disease detection, the accuracy and robustness of these single-model approaches are often constrained by data imbalance and inherent model biases. This study introduces a filter-based feature selection and stacking-based ensemble ML framework, tailored specifically for thyroid disease detection. This framework capitalizes on the collective strengths of multiple base models by aggregating their predictions, aiming to surpass the predictive performance of individual models. Such an approach can also reduce screening time and costs considering few clinical attributes are used for diagnosis. Through extensive experiments conducted on a clinical thyroid disease dataset, the filter-based feature selection approach and the ensemble learning method demonstrated superior discriminative ability, reflected by improved receiver operating characteristic-area under the curve (ROC-AUC) scores of 99.9%. The proposed framework sheds light on the complementary strengths of different base models, fostering a deeper understanding of their joint predictive performance. Our findings underscore the potential of ensemble strategies to significantly improve the efficacy of ML-based detection of thyroid diseases, marking a shift from reliance on single models to more robust, collective approaches.
引用
收藏
页码:89098 / 89112
页数:15
相关论文
共 112 条
[1]   Performance Analysis of Machine Learning Algorithms for Thyroid Disease [J].
Abbad Ur Rehman, Hafiz ;
Lin, Chyi-Yeu ;
Mushtaq, Zohaib ;
Su, Shun-Feng .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (10) :9437-9449
[2]   Stacking-Based Ensemble Learning Method for Multi-Spectral Image Classification [J].
Aboneh, Tagel ;
Rorissa, Abebe ;
Srinivasagan, Ramasamy .
TECHNOLOGIES, 2022, 10 (01)
[3]  
Agilandeeswari L., 2021, P INT C INT SYST DES, P1395
[4]  
Ahuja R, 2020, STUD COMPUT INTELL, V855, P225, DOI 10.1007/978-3-030-28553-1_11
[5]   Ensemble-based Effective Diagnosis of Thyroid Disorder with Various Feature Selection Techniques [J].
Akhtar, Tehseen ;
Arif, Saad ;
Mushtaq, Zohaib ;
Gilani, Syed Orner ;
Jamil, Mohsin ;
Ayaz, Yasar ;
Butt, Shahid Ikramullah .
2022 2ND INTERNATIONAL CONFERENCE OF SMART SYSTEMS AND EMERGING TECHNOLOGIES (SMARTTECH 2022), 2022, :14-19
[6]   Changing patterns in the incidence and survival of thyroid cancer with follicular phenotype - Papillary, follicular, and anaplastic: A morphological and epidemiological study [J].
Albores-Saavedra, Jorge ;
Henson, Donald Earl ;
Glazer, Evan ;
Schwartz, Arnold M. .
ENDOCRINE PATHOLOGY, 2007, 18 (01) :1-7
[7]  
Almahshi Hebatullah Mohammad, 2022, 2022 5th International Conference on Engineering Technology and its Applications (IICETA), P159, DOI 10.1109/IICETA54559.2022.9888736
[8]   Early Thyroid Risk Prediction by Data Mining and Ensemble Classifiers [J].
Alshayeji, Mohammad H. .
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (03) :1195-1213
[9]   RETRACTED: Empirical Method for Thyroid Disease Classification Using a Machine Learning Approach (Retracted Article) [J].
Alyas, Tahir ;
Hamid, Muhammad ;
Alissa, Khalid ;
Faiz, Tauqeer ;
Tabassum, Nadia ;
Ahmad, Aqeel .
BIOMED RESEARCH INTERNATIONAL, 2022, 2022
[10]  
Amgad N., 2024, P 6 INT C COMP INF I, P195