Application of Ensemble Machine Learning for Classification Problems on Very Small Datasets

被引:0
|
作者
Pavic, Ognjen [1 ]
Dasic, Lazar [1 ]
Geroski, Tijana [2 ,3 ]
Pirkovic, Marijana Stanojevic [4 ]
Milovanovic, Aleksandar [1 ]
Filipovic, Nenad [2 ,3 ]
机构
[1] Univ Kragujevac, Inst Informat Technol, Kragujevac 34000, Serbia
[2] Univ Kragujevac, Fac Engn, Kragujevac 34000, Serbia
[3] Bioengn Res & Dev Ctr BioIRC, Kragujevac 34000, Serbia
[4] Univ Kragujevac, Fac Med Sci, Kragujevac 34000, Serbia
来源
APPLIED ARTIFICIAL INTELLIGENCE 2: MEDICINE, BIOLOGY, CHEMISTRY, FINANCIAL, GAMES, ENGINEERING, SICAAI 2023 | 2024年 / 999卷
关键词
Machine learning; Classification; Risk assessment; Random forest; Ensemble First Section;
D O I
10.1007/978-3-031-60840-7_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning is one of the most widely used branches of artificial intelligence in recent years. It is most commonly used for solving classification or regression problems through the utilization of supervised learning approaches. Machine learning models require high quality and a sufficient quantity of data to produce good results. This paper investigates an approach which incorporates ensemble learning through the aggregation of multiple machine learning models for the purposes of increasing prediction capabilities in cases in which a very limited amount of data is available for training. The ensemble model was trained on a patient fractional flow reserve biomarker dataset and with the goal of classifying patients into risk classes based on their risk of suffering an acute myocardial infarction. The ensemble model was comprised of multiple random forest classification models which were trained with different combinations of training and test data to improve the prediction accuracy over the use of a single random forest model. Final ensemble achieved a prediction accuracy of 71.3% which was an immense improvement over the 36% prediction accuracy of a single random forest classification model.
引用
收藏
页码:108 / 115
页数:8
相关论文
共 50 条
  • [1] Application of Machine Learning Models for Malware Classification With Real and Synthetic Datasets
    Joshi, Santosh
    Pons, Alexander Perez
    Kulkarni, Shrirang Ambaji
    Upadhyay, Himanshu
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY AND PRIVACY, 2024, 18 (01)
  • [2] Machine Learning Methods with Noisy, Incomplete or Small Datasets
    Caiafa, Cesar F.
    Sun, Zhe
    Tanaka, Toshihisa
    Marti-Puig, Pere
    Sole-Casals, Jordi
    APPLIED SCIENCES-BASEL, 2021, 11 (09):
  • [3] Guidelines to Select Machine Learning Scheme for Classification of Biomedical Datasets
    Tanwani, Ajay Kumar
    Afridi, Jamal
    Shafiq, M. Zubair
    Farooq, Muddassar
    EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, PROCEEDINGS, 2009, 5483 : 128 - 139
  • [4] A novel ensemble machine learning for robust microarray data classification
    Peng, Yonghong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2006, 36 (06) : 553 - 573
  • [5] Minimalist Machine Learning: Binary Classification of Medical Datasets with Matrix Transformations
    Solorio-Ramirez, Jose Luis
    Camacho-Nieto, Oscar
    Yanez-Marquez, Cornelio
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2025, 29 (02) : 277 - 286
  • [6] Novel Machine Learning Experiments with Artificially Generated Big Data from Small Immunotherapy Datasets
    Mahmoud, Ahsanullah Yunas
    Neagu, Daniel
    Scrimieri, Daniele
    Abdullatif, Amr Rashad Ahmed
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 986 - 991
  • [7] Machine Learning Classification Based Techniques for Fraud Discovery in Credit Card Datasets
    Ogundokun, Roseline Oluwaseun
    Misra, Sanjay
    Ogundokun, Opeyemi Eyitayo
    Oluranti, Jonathan
    Maskeliunas, Rytis
    APPLIED INFORMATICS (ICAI 2021), 2021, 1455 : 26 - 38
  • [8] Application of Ensemble Machine Learning for Construction Safety Risk Assessment
    George M.R.
    Nalluri M.R.
    Anand K.B.
    Journal of The Institution of Engineers (India): Series A, 2022, 103 (04): : 989 - 1003
  • [9] A hybrid ensemble for classification in multiclass datasets: An application to oilseed disease dataset
    Chaudhary, Archana
    Kolhe, Savita
    Kamal, Raj
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2016, 124 : 65 - 72
  • [10] Discussion on classification problems in machine learning
    Yao, Han
    PROCEEDINGS OF THE 2015 2ND INTERNATIONAL WORKSHOP ON MATERIALS ENGINEERING AND COMPUTER SCIENCES (IWMECS 2015), 2015, 33 : 761 - 763