Application of Ensemble Machine Learning for Classification Problems on Very Small Datasets

被引:0
|
作者
Pavic, Ognjen [1 ]
Dasic, Lazar [1 ]
Geroski, Tijana [2 ,3 ]
Pirkovic, Marijana Stanojevic [4 ]
Milovanovic, Aleksandar [1 ]
Filipovic, Nenad [2 ,3 ]
机构
[1] Univ Kragujevac, Inst Informat Technol, Kragujevac 34000, Serbia
[2] Univ Kragujevac, Fac Engn, Kragujevac 34000, Serbia
[3] Bioengn Res & Dev Ctr BioIRC, Kragujevac 34000, Serbia
[4] Univ Kragujevac, Fac Med Sci, Kragujevac 34000, Serbia
来源
APPLIED ARTIFICIAL INTELLIGENCE 2: MEDICINE, BIOLOGY, CHEMISTRY, FINANCIAL, GAMES, ENGINEERING, SICAAI 2023 | 2024年 / 999卷
关键词
Machine learning; Classification; Risk assessment; Random forest; Ensemble First Section;
D O I
10.1007/978-3-031-60840-7_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning is one of the most widely used branches of artificial intelligence in recent years. It is most commonly used for solving classification or regression problems through the utilization of supervised learning approaches. Machine learning models require high quality and a sufficient quantity of data to produce good results. This paper investigates an approach which incorporates ensemble learning through the aggregation of multiple machine learning models for the purposes of increasing prediction capabilities in cases in which a very limited amount of data is available for training. The ensemble model was trained on a patient fractional flow reserve biomarker dataset and with the goal of classifying patients into risk classes based on their risk of suffering an acute myocardial infarction. The ensemble model was comprised of multiple random forest classification models which were trained with different combinations of training and test data to improve the prediction accuracy over the use of a single random forest model. Final ensemble achieved a prediction accuracy of 71.3% which was an immense improvement over the 36% prediction accuracy of a single random forest classification model.
引用
收藏
页码:108 / 115
页数:8
相关论文
共 50 条
  • [31] Performance of Quantum Annealing Machine Learning Classification Models on ADMET Datasets
    Salloum, Hadi
    Sabbagh, Kamil
    Savchuk, Vladislav
    Lukin, Ruslan
    Orabi, Osama
    Isangulov, Marat
    Mazzara, Manuel
    IEEE ACCESS, 2025, 13 : 16263 - 16287
  • [32] APPLICATION OF MACHINE LEARNING TO LIMITED DATASETS: PREDICTION OF PROJECT SUCCESS
    Bang, Sofie
    Aarvold, Magnus O.
    Hartvig, Wilhelm J.
    Olsson, Nils O. E.
    Rauzy, Antoine
    JOURNAL OF INFORMATION TECHNOLOGY IN CONSTRUCTION, 2022, 27 : 732 - 755
  • [33] Ensemble based machine learning approach for prediction of glioma and multi-grade classification
    Joshi, Rakesh Chandra
    Mishra, Rashmi
    Gandhi, Puneet
    Pathak, Vinay Kumar
    Burget, Radim
    Dutta, Malay Kishore
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 137
  • [34] Classification of Patients with the Development of Alzheimer's Disease using an Ensemble of Machine Learning Models
    Nykoniuk, Mariia
    Melnykova, Nataliia
    Patereha, Yurii
    Sala, Dariusz
    Cichon, Dariusz
    6TH INTERNATIONAL CONFERENCE ON INFORMATICS & DATA-DRIVEN MEDICINE, IDDM 2023, 2023, 3609
  • [35] Application of machine learning ensemble models for rainfall prediction
    Ahmadi, Hasan
    Aminnejad, Babak
    Sabatsany, Hojat
    ACTA GEOPHYSICA, 2023, 71 (04) : 1775 - 1786
  • [36] Application of Machine Learning Methods in Maritime Safety Information Classification
    Liu, Hongze
    Liu, Zhengjiang
    Liu, Dexin
    PROCEEDINGS OF 2018 TENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2018, : 735 - 740
  • [37] Ensemble of deep learning and machine learning approach for classification of handwritten Hindi numerals
    Rajpal D.
    Garg A.R.
    Journal of Engineering and Applied Science, 2023, 70 (01):
  • [38] Mining very large datasets with support vector machine algorithms
    Poulet, F
    Do, TN
    ENTERPRISE INFORMATION SYSTEMS V, 2004, : 177 - 184
  • [39] Novel Application of Machine Learning Techniques for Rapid SourceApportionment of Aerosol Mass Spectrometer Datasets
    Pande, Paritosh
    Shrivastava, Manish
    Shilling, John E.
    Zelenyuk, Alla
    Zhang, Qi
    Chen, Qi
    Ng, Nga Lee
    Zhang, Yue
    Takeuchi, Masayuki
    Nah, Theodora
    Rasool, Quazi Z.
    Zhang, Yuwei
    Zhao, Bin
    Liu, Ying
    ACS EARTH AND SPACE CHEMISTRY, 2022, 6 (04): : 932 - 942
  • [40] INVESTIGATIONS ON CLASSIFICATION METHODS FOR LOAN APPLICATION BASED ON MACHINE LEARNING
    Wu, Mingli
    Huang, Yafei
    Duan, Jianyong
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2019, : 541 - 546