Gene selection from large-scale gene expression data based on fuzzy interactive multi-objective binary optimization for medical diagnosis

被引:16
作者
Shahbeig, Saleh [1 ]
Rahideh, Akbar [1 ]
Helfroush, Mohammad Sadegh [1 ]
Kazemi, Kamran [1 ]
机构
[1] Shiraz Univ Technol, Dept Elect & Elect Engn, Shiraz, Iran
关键词
Fuzzy interactive method; Sub-optimal subset; Gene selection; Large-scale data; Multi-objective; Adaptive binary particle swarm optimization; MICROARRAY DATA CLASSIFICATION; SUPPORT VECTOR MACHINE; CANCER CLASSIFICATION; DISCRIMINANT-ANALYSIS; FEATURE-EXTRACTION; PREDICTION; ALGORITHM; PROFILE; SVM; ADENOCARCINOMA;
D O I
10.1016/j.bbe.2018.02.002
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
An efficient fuzzy interactive multi-objective optimization method is proposed to select the sub-optimal subset of genes from large-scale gene expression data It is based on the binary particle swarm optimization (BPSO) algorithm tuned by a chaotic method The proposed method is able to select the sub-optimal subset of genes with the least number of features that can accurately distinguish between the two classes, e. g. the normal and cancerous samples The proposed method is evaluated on several publicly available microarray and RNA-sequencing gene expression datasets such as leukemia, colon cancer, central nervous system, lung cancer, ovarian cancer, prostate cancer and RNA-seq lung disease The results indicate that the proposed method can identify the minimum number of genes to achieve the most accuracy, sensitivity and specificity in the classification process Achieving 100% accuracy in six out of the seven datasets investigated in this study, demonstrates the high capacity of the proposed algorithm to find the sub-optimal subset of genes This approach is useful in clinical applications to extract the most influential genes on a disease and to find the treatment procedure for the disease. (C) 2018 Nalecz Institute of Biocybernetics and Biomedical Engineering of the Polish Academy of Sciences Published by Elsevier B. V. All rights reserved.
引用
收藏
页码:313 / 328
页数:16
相关论文
共 45 条
[1]  
[Anonymous], 2003, J. Econ. Soc. Res
[2]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[3]   Distributed feature selection: An application to microarray data classification [J].
Bolon-Canedo, V. ;
Sanchez-Marono, N. ;
Alonso-Betanzos, A. .
APPLIED SOFT COMPUTING, 2015, 30 :136-150
[4]   An efficient statistical feature selection approach for classification of gene expression data [J].
Chandra, B. ;
Gupta, Manish .
JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (04) :529-535
[5]   A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue [J].
Chen, Zhenyu ;
Li, Jianping ;
Wei, Liwei .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2007, 41 (02) :161-175
[6]   A combination of rough-based feature selection and RBF neural network for classification using gene expression data [J].
Chiang, Jung-Hsien ;
Ho, Shing-Hua .
IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2008, 7 (01) :91-99
[7]   Sparse maximum margin discriminant analysis for feature extraction and gene selection on gene expression data [J].
Cui, Yan ;
Zheng, Chun-Hou ;
Yang, Jian ;
Sha, Wen .
COMPUTERS IN BIOLOGY AND MEDICINE, 2013, 43 (07) :933-941
[8]   Multiple SVM-RFE for gene selection in cancer classification with expression data [J].
Duan, KB ;
Rajapakse, JC ;
Wang, HY ;
Azuaje, F .
IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2005, 4 (03) :228-234
[9]   Feature Selection for Microarray Gene Expression Data Using Simulated Annealing Guided by the Multivariate Joint Entropy [J].
Fernando Gonzalez-Navarro, Felix ;
Belanche-Munoz, Lluis A. .
COMPUTACION Y SISTEMAS, 2014, 18 (02) :275-293
[10]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537