Performance Comparison of Feature Selection Methods for Prediction in Medical Data

被引：3

作者：

Khalid, Nur Hidayah Mohd ^{[1
]}

Ismail, Amelia Ritahani ^{[1
]}

Aziz, Normaziah Abdul ^{[1
]}

Hussin, Amir Aatieff Amir ^{[1
]}

机构：

[1] Int Islamic Univ Malaysia, Dept Comp Sci, Kulliyyah Informat & Commun Technol, POB 10, Kuala Lumpur 50728, Malaysia

来源：

SOFT COMPUTING IN DATA SCIENCE, SCDS 2023 | 2023年 / 1771卷

关键词：

CatBoost; Feature selection; RFE; Lasso;

D O I：

10.1007/978-981-99-0405-1_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Along with technological advancement, the application of machine learning algorithms in industry, notably in the medical field, has grown and progressed quickly. Medical databases commonly contain a lot of information about themedical histories of the patients and patient's conditions, in addition, it is challenging to identify and extract the information that will be relevant and meaningful for machine learning modelling. Not to mention, the efficacy of the predictive machine learning algorithm can be enhanced by using only useful and pertinent information. Hence, feature selection is proposed to determine the significant features. Thus, feature selection should be fully utilized and applied when building machine learning algorithm. This study analyzes filter, wrapper, and embedded feature selection methods for medical data with the predictive machine learning algorithm, Random Forest and CatBoost. The experiment is carried out by evaluating the performances of the machine learning with and without applying feature selection methods. According to the results, CatBoost with RFE shows the best performance, in comparison to Random Forest with other feature selection methods.

引用

页码：92 / 106

页数：15

共 45 条

[1]

Aggrawal R., 2020, SN Comput. Sci., V1, P1, DOI [10.1007/s42979-020-00370-1, DOI 10.1007/S42979-020-00370-1]

[2] Comparative Study of Optimum Medical Diagnosis of Human Heart Disease Using Machine Learning Technique With and Without Sequential Feature Selection [J].