Enhanced thyroid disease prediction using ensemble machine learning: a high-accuracy approach with feature selection and class balancing

被引:0
|
作者
Md. Rezaul Islam [1 ]
Aniruddha Islam Chowdhury [2 ]
Sharmin Shama [3 ]
Md. Masudul Hasan Lamyea [3 ]
机构
[1] Shahjalal University of Science and Technology,Department of Computer Science and Engineering
[2] Bangabandhu Sheikh Mujibur Rahman Digital University,Department of Educational Technology and Engineering
[3] Dhaka International University,Department of Computer Science and Engineering
来源
Discover Artificial Intelligence | / 5卷 / 1期
关键词
Thyroid Disease Prediction; Machine Learning Algorithms; Data Visualization; Class balancing techniques; XGBoost Algorithm; Confusion Matrices; Etc;
D O I
10.1007/s44163-025-00225-9
中图分类号
学科分类号
摘要
Thyroid disorders are increasingly prevalent, making early detection crucial for reducing mortality and complications. Accurate prediction of disease progression and understanding the interplay of clinical features are essential for effective diagnosis and treatment. Our study addresses these challenges by employing a standard machine learning model, enhanced with comprehensive clinical feature analysis and an ensemble learning technique. By leveraging machine learning, we can identify key risk factors and improve diagnostic accuracy. To achieve optimal prediction outcomes, we evaluated seventeen machine learning models and implemented an Ensemble ML classifier using a hard voting strategy. Class balancing techniques, particularly random oversampling, significantly improved classification performance. Our experimental results demonstrate that the proposed model outperforms existing methods, achieving 100% sensitivity and 99.72% accuracy using the XGBoost algorithm and SelectKBest feature selection. By addressing feature reduction and high class-imbalance, the ensemble ML classifier with hard voting proves more effective in handling classification challenges.
引用
收藏
相关论文
共 11 条
  • [1] Feature selection approach using ensemble learning for network anomaly detection
    Doreswamy
    Hooshmand, Mohammad Kazim
    Gad, Ibrahim
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2020, 5 (04) : 283 - 293
  • [2] Efficient prediction of coronary artery disease using machine learning algorithms with feature selection techniques
    Hassan, Md. Mehedi
    Zaman, Sadika
    Rahman, Md. Mushfiqur
    Bairagi, Anupam Kumar
    El-Shafai, Walid
    Rathore, Rajkumar Singh
    Gupta, Deepak
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 115
  • [3] Prediction and feature selection of low birth weight using machine learning algorithms
    Reza, Tasneem Binte
    Salma, Nahid
    JOURNAL OF HEALTH POPULATION AND NUTRITION, 2024, 43 (01)
  • [4] Prediction of Breast Cancer using Traditional and Ensemble Technique: A Machine Learning Approach
    Islam, Tamanna
    Akhi, Amatul Bushra
    Akter, Farzana
    Hasan, Md. Najmul
    Lata, Munira Akter
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (06) : 867 - 875
  • [5] Interactive Thyroid Disease Prediction System Using Machine Learning Technique
    Tyagi, Ankita
    Mehra, Ritika
    Saxena, Aditya
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 689 - 693
  • [6] Machine learning framework with feature selection approaches for thyroid disease classification and associated risk factors identification
    Azrin Sultana
    Rakibul Islam
    Journal of Electrical Systems and Information Technology, 10 (1)
  • [7] An Empirical Analysis of Machine Learning Algorithms for Crime Prediction Using Stacked Generalization: An Ensemble Approach
    Kshatri, Sapna Singh
    Singh, Deepak
    Narain, Bhavana
    Bhatia, Surbhi
    Quasim, Mohammad Tabrez
    Sinha, G. R.
    IEEE ACCESS, 2021, 9 : 67488 - 67500
  • [8] Enhanced forecasting of emergency department patient arrivals using feature engineering approach and machine learning
    Porto, Bruno Matos
    Fogliatto, Flavio Sanson
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [9] Improved Detection of Coronary Artery Disease Using DT-RFE Based Feature Selection and Ensemble Learning
    Tyagi, Ashima
    Singh, Vibhav Prakash
    Gore, Manoj Madhava
    ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2021, 2022, 1534 : 425 - 440
  • [10] Machine learning-based ensemble approach in prediction of lung cancer predisposition using XRCC1 gene polymorphism
    Choudhary, Abhishek
    Anand, Adarsh
    Singh, Amrita
    Roy, Pratima
    Singh, Navneet
    Kumar, Vinay
    Sharma, Siddharth
    Baranwal, Manoj
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2024, 42 (15): : 7828 - 7837