Feature selection with ensemble learning for prostate cancer diagnosis from microarray gene expression

被引:21
作者
Gumaei, Abdu [1 ,2 ]
Sammouda, Rachid [3 ]
Al-Rakhami, Mabrook [1 ]
AlSalman, Hussain [3 ]
El-Zaart, Ali [4 ]
机构
[1] King Saud Univ, Res Chair Pervas & Mobile Comp, Riyadh, Saudi Arabia
[2] Taiz Univ, Taif, Yemen
[3] King Saud Univ, Riyadh, Saudi Arabia
[4] Beirut Arab Univ, Beirut, Lebanon
关键词
prostate cancer; microarray data; machine learning; random committee; ensemble learning; feature selection; 10-fold cross-validation; RNA-SEQ; CLASSIFICATION; MACHINE; PREDICTION; DIMENSION; PATHWAYS; TOOL;
D O I
10.1177/1460458221989402
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Cancer diagnosis using machine learning algorithms is one of the main topics of research in computer-based medical science. Prostate cancer is considered one of the reasons that are leading to deaths worldwide. Data analysis of gene expression from microarray using machine learning and soft computing algorithms is a useful tool for detecting prostate cancer in medical diagnosis. Even though traditional machine learning methods have been successfully applied for detecting prostate cancer, the large number of attributes with a small sample size of microarray data is still a challenge that limits their ability for effective medical diagnosis. Selecting a subset of relevant features from all features and choosing an appropriate machine learning method can exploit the information of microarray data to improve the accuracy rate of detection. In this paper, we propose to use a correlation feature selection (CFS) method with random committee (RC) ensemble learning to detect prostate cancer from microarray data of gene expression. A set of experiments are conducted on a public benchmark dataset using 10-fold cross-validation technique to evaluate the proposed approach. The experimental results revealed that the proposed approach attains 95.098% accuracy, which is higher than related work methods on the same dataset.
引用
收藏
页数:13
相关论文
共 58 条
[1]   Honey reduces the metastatic characteristics of prostate cancer cell lines by promoting a loss of adhesion [J].
Abel, Sean D. A. ;
Dadhwal, Sumit ;
Gamble, Allan B. ;
Baird, Sarah K. .
PEERJ, 2018, 6
[2]   Margin-Maximizing Feature Elimination Methods for Linear and Nonlinear Kernel-Based Discriminant Functions [J].
Aksu, Yaman ;
Miller, David J. ;
Kesidis, George ;
Yang, Qing X. .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (05) :701-717
[3]  
Bentkowska U., 2020, INTERVAL VALUED METH, P107
[4]   A hybrid LDA and genetic algorithm for gene selection and classification of microarray data [J].
Bonilla Huerta, Edmundo ;
Duval, Beatrice ;
Hao, Jin-Kao .
NEUROCOMPUTING, 2010, 73 (13-15) :2375-2383
[5]  
Bouazza SH., 2018, J ENG TECHNOL, V6, P282
[6]   On optimal settings of classification tree ensembles for medical decision support [J].
Budnik, Mateusz ;
Krawczyk, Bartosz .
HEALTH INFORMATICS JOURNAL, 2013, 19 (01) :3-15
[7]   A kernel-based clustering method for gene selection with gene expression data [J].
Chen, Huihui ;
Zhang, Yusen ;
Gutman, Ivan .
JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 62 :12-20
[8]  
Chen Zhaoyi, 2017, Biomed Hub, V2, P1, DOI 10.1159/000472146
[9]   A global, incremental development method for a web-based prostate cancer treatment decision aid and usability testing in a Dutch clinical setting [J].
Cuypers, Maarten ;
Lamers, Romy E. D. ;
Kil, Paul J. M. ;
The, Regina ;
Karssen, Klemens ;
van de Poll-Franse, Lonneke V. ;
de Vries, Marieke .
HEALTH INFORMATICS JOURNAL, 2019, 25 (03) :701-714
[10]   Prognostic value of an RNA expression signature derived from cell cycle proliferation genes in patients with prostate cancer: a retrospective study [J].
Cuzick, Jack ;
Swanson, Gregory P. ;
Fisher, Gabrielle ;
Brothman, Arthur R. ;
Berney, Daniel M. ;
Reid, Julia E. ;
Mesher, David ;
Speights, V. O. ;
Stankiewicz, Elzbieta ;
Foster, Christopher S. ;
Moller, Henrik ;
Scardino, Peter ;
Warren, Jorja D. ;
Park, Jimmy ;
Younus, Adib ;
Flake, Dart D., II ;
Wagner, Susanne ;
Gutin, Alexander ;
Lanchbury, Jerry S. ;
Stone, Steven .
LANCET ONCOLOGY, 2011, 12 (03) :245-255