Application of Ensemble Machine Learning for Classification Problems on Very Small Datasets

被引:0
|
作者
Pavic, Ognjen [1 ]
Dasic, Lazar [1 ]
Geroski, Tijana [2 ,3 ]
Pirkovic, Marijana Stanojevic [4 ]
Milovanovic, Aleksandar [1 ]
Filipovic, Nenad [2 ,3 ]
机构
[1] Univ Kragujevac, Inst Informat Technol, Kragujevac 34000, Serbia
[2] Univ Kragujevac, Fac Engn, Kragujevac 34000, Serbia
[3] Bioengn Res & Dev Ctr BioIRC, Kragujevac 34000, Serbia
[4] Univ Kragujevac, Fac Med Sci, Kragujevac 34000, Serbia
来源
APPLIED ARTIFICIAL INTELLIGENCE 2: MEDICINE, BIOLOGY, CHEMISTRY, FINANCIAL, GAMES, ENGINEERING, SICAAI 2023 | 2024年 / 999卷
关键词
Machine learning; Classification; Risk assessment; Random forest; Ensemble First Section;
D O I
10.1007/978-3-031-60840-7_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning is one of the most widely used branches of artificial intelligence in recent years. It is most commonly used for solving classification or regression problems through the utilization of supervised learning approaches. Machine learning models require high quality and a sufficient quantity of data to produce good results. This paper investigates an approach which incorporates ensemble learning through the aggregation of multiple machine learning models for the purposes of increasing prediction capabilities in cases in which a very limited amount of data is available for training. The ensemble model was trained on a patient fractional flow reserve biomarker dataset and with the goal of classifying patients into risk classes based on their risk of suffering an acute myocardial infarction. The ensemble model was comprised of multiple random forest classification models which were trained with different combinations of training and test data to improve the prediction accuracy over the use of a single random forest model. Final ensemble achieved a prediction accuracy of 71.3% which was an immense improvement over the 36% prediction accuracy of a single random forest classification model.
引用
收藏
页码:108 / 115
页数:8
相关论文
共 50 条
  • [41] Comparison of Machine Learning Algorithms on Different Datasets
    Uysal, Elif
    Ozturk, Ali
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [42] Machine learning based novel ensemble learning framework for electricity operational forecasting
    Weeraddana, Dilusha
    Khoa, Nguyen Lu Dang
    Mahdavi, Nariman
    ELECTRIC POWER SYSTEMS RESEARCH, 2021, 201
  • [43] Machine learning ensemble for neurological disorders
    Kaur, Harkawalpreet
    Malhi, Avleen Kaur
    Pannu, Husanbir Singh
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16) : 12697 - 12714
  • [44] Ensemble Learning for Multidimensional Poverty Classification
    Abu Bakar, Azuraliza
    Hamdan, Rusnita
    Sani, Nor Samsiah
    SAINS MALAYSIANA, 2020, 49 (02): : 447 - 459
  • [45] Application of Machine Learning to Sleep Stage Classification
    Smith, Andrew
    Anand, Hardik
    Milosavljevic, Snezana
    Rentschler, Katherine M.
    Pocivavsek, Ana
    Valafar, Homayoun
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 349 - 354
  • [46] Using machine learning to predict concrete's strength: learning from small datasets
    Ouyang, Boya
    Song, Yu
    Li, Yuhai
    Wu, Feishu
    Yu, Huizi
    Wang, Yongzhe
    Yin, Zhanyuan
    Luo, Xiaoshu
    Sant, Gaurav
    Bauchy, Mathieu
    ENGINEERING RESEARCH EXPRESS, 2021, 3 (01):
  • [47] Scaling associative classification for very large datasets
    Venturini L.
    Baralis E.
    Garza P.
    Venturini, Luca (luca.venturini@polito.it), 1600, SpringerOpen (04)
  • [48] A Scalable Classification Algorithm for Very Large Datasets
    Delen, Dursun
    Kletke, Marilyn
    Kim, Jin-Hwa
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2005, 4 (02) : 83 - 94
  • [49] A Novel Machine Learning Algorithm to Reduce Prediction Error and Accelerate Learning Curve for Very Large Datasets
    Hou, Wenjun
    Perkowski, Marek
    2019 IEEE 49TH INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC (ISMVL), 2019, : 97 - 101
  • [50] QUIC Network Traffic Classification Using Ensemble Machine Learning Techniques
    Almuhammadi, Sultan
    Alnajim, Abdullatif
    Ayub, Mohammed
    APPLIED SCIENCES-BASEL, 2023, 13 (08):