Application of Ensemble Machine Learning for Classification Problems on Very Small Datasets

被引：0

作者：

Pavic, Ognjen ^{[1
]}

Dasic, Lazar ^{[1
]}

Geroski, Tijana ^{[2
,3
]}

Pirkovic, Marijana Stanojevic ^{[4
]}

Milovanovic, Aleksandar ^{[1
]}

Filipovic, Nenad ^{[2
,3
]}

机构：

[1] Univ Kragujevac, Inst Informat Technol, Kragujevac 34000, Serbia

[2] Univ Kragujevac, Fac Engn, Kragujevac 34000, Serbia

[3] Bioengn Res & Dev Ctr BioIRC, Kragujevac 34000, Serbia

[4] Univ Kragujevac, Fac Med Sci, Kragujevac 34000, Serbia

来源：

APPLIED ARTIFICIAL INTELLIGENCE 2: MEDICINE, BIOLOGY, CHEMISTRY, FINANCIAL, GAMES, ENGINEERING, SICAAI 2023 | 2024年 / 999卷

关键词：

Machine learning; Classification; Risk assessment; Random forest; Ensemble First Section;

D O I：

10.1007/978-3-031-60840-7_15

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Machine learning is one of the most widely used branches of artificial intelligence in recent years. It is most commonly used for solving classification or regression problems through the utilization of supervised learning approaches. Machine learning models require high quality and a sufficient quantity of data to produce good results. This paper investigates an approach which incorporates ensemble learning through the aggregation of multiple machine learning models for the purposes of increasing prediction capabilities in cases in which a very limited amount of data is available for training. The ensemble model was trained on a patient fractional flow reserve biomarker dataset and with the goal of classifying patients into risk classes based on their risk of suffering an acute myocardial infarction. The ensemble model was comprised of multiple random forest classification models which were trained with different combinations of training and test data to improve the prediction accuracy over the use of a single random forest model. Final ensemble achieved a prediction accuracy of 71.3% which was an immense improvement over the 36% prediction accuracy of a single random forest classification model.

引用

页码：108 / 115

页数：8

共 50 条

[21] Decomposition Methods for Machine Learning with Small, Incomplete or Noisy Datasets
Caiafa, Cesar Federico
Sole-Casals, Jordi
Marti-Puig, Pere
Zhe, Sun
Tanaka, Toshihisa
APPLIED SCIENCES-BASEL, 2020, 10 (23): : 1 - 20
[22] Simple Baseline Machine Learning Text Classifiers for Small Datasets
Riekert M.
Riekert M.
Klein A.
SN Computer Science, 2021, 2 (3)
[23] Fuzzy cognitive map ensemble learning paradigm to solve classification problems: Application to autism identification
Papageorgiou, Elpiniki I.
Kannappan, Arthi
APPLIED SOFT COMPUTING, 2012, 12 (12) : 3798 - 3809
[24] Comparison of Machine Learning Algorithms for Classification Problems
Sekeroglu, Boran
Hasan, Shakar Sherwan
Abdullah, Saman Mirza
ADVANCES IN COMPUTER VISION, VOL 2, 2020, 944 : 491 - 499
[25] CommentClass: A Robust Ensemble Machine Learning Model for Comment Classification
Rahman, Md. Mostafizer
Shiplu, Ariful Islam
Watanobe, Yutaka
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
[26] An Ensemble Based Machine Learning Classification for Automated Glaucoma Detection
Pawar, Digvijay J.
Kanse, Yuvraj K.
Patil, Suhas S.
ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2024, 13
[27] Breast Tumor Classification Using an Ensemble Machine Learning Method
Assiri, Adel S.
Nazir, Saima
Velastin, Sergio A.
JOURNAL OF IMAGING, 2020, 6 (06)
[28] Harnessing the Power of Ensemble Machine Learning for the Heart Stroke Classification
Pal P.
Nandal M.
Dikshit S.
Thusu A.
Singh H.V.
EAI Endorsed Transactions on Pervasive Health and Technology, 2023, 9 (01)
[29] Application of machine learning ensemble models for rainfall prediction
Hasan Ahmadi
Babak Aminnejad
Hojat Sabatsany
Acta Geophysica, 2023, 71 : 1775 - 1786
[30] Machine Learning Classification Workflow and Datasets for Ionospheric VLF Data Exclusion
Arnaut, Filip
Kolarski, Aleksandra
Sreckovic, Vladimir A.
DATA, 2024, 9 (01)

← 1 2 3 4 5 →