Cascade of genetic algorithm and decision tree for cancer classification on gene expression data

被引:0
作者
Yeh, Jinn-Yi [1 ]
Wu, Tai-Hsi [2 ]
机构
[1] Natl Chiayi Univ, Dept Management Informat Syst, Chiayi 600, Taiwan
[2] Natl Taipei Univ, Dept Business Adm, Taipei 237, Taiwan
关键词
cancer classification; gene expression data; genetic algorithms; decision tree; SUPPORT VECTOR MACHINES; TUMOR CLASSIFICATION; MICROARRAY DATA; FEATURE-SELECTION; CLUSTER-ANALYSIS; VISUALIZATION; PREDICTION; DISCOVERY; SYSTEMS; CELL;
D O I
10.1111/j.1468-0394.2010.00522.x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer classification, through gene expression data analysis, has produced remarkable results, and has indicated that gene expression assays could significantly aid in the development of efficient cancer diagnosis and classification platforms. However, cancer classification, based on DNA array data, remains a difficult problem. The main challenge is the overwhelming number of genes relative to the number of training samples, which implies that there are a large number of irrelevant genes to be dealt with. Another challenge is from the presence of noise inherent in the data set. It makes accurate classification of data more difficult when the sample size is small. We apply genetic algorithms (GAs) with an initial solution provided by t statistics, called t-GA, for selecting a group of relevant genes from cancer microarray data. The decision-tree-based cancer classifier is built on the basis of these selected genes. The performance of this approach is evaluated by comparing it to other gene selection methods using publicly available gene expression data sets. Experimental results indicate that t-GA has the best performance among the different gene selection methods. The Z-score figure also shows that some genes are consistently preferentially chosen by t-GA in each data set.
引用
收藏
页码:201 / 218
页数:18
相关论文
共 50 条
  • [21] Applying Grey Wolf Optimizer-based decision tree classifer for cancer classification on gene expression data
    Vosooghifard, Mahtab
    Ebrahimpour, Hossein
    [J]. 2015 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2015, : 147 - 151
  • [22] A New Hybrid and Ensemble Gene Selection Approach with an Enhanced Genetic Algorithm for Classification of Microarray Gene Expression Values on Leukemia Cancer
    Bilen, Mehmet
    Isik, Ali H.
    Yigit, Tuncay
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 1554 - 1566
  • [23] Similarity-balanced discriminant neighbor embedding and its application to cancer classification based on gene expression data
    Zhang, Li
    Qian, Liqiang
    Ding, Chuntao
    Zhou, Weida
    Li, Fanzhang
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 64 : 236 - 245
  • [24] Cancer classification using gene expression data
    Lu, Y
    Han, JW
    [J]. INFORMATION SYSTEMS, 2003, 28 (04) : 243 - 268
  • [25] A discrete bacterial algorithm for feature selection in classification of microarray gene expression cancer data
    Wang, Hong
    Jing, Xingjian
    Niu, Ben
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 126 : 8 - 19
  • [26] Feature Selection of Gene Expression Data Using a Modified Artificial Fish Swarm Algorithm With Population Variation
    Li, Zong-Zheng
    Wang, Fang-Ling
    Qin, Feng
    Yusoff, Yusliza Binti
    Zain, Azlan Mohd
    [J]. IEEE ACCESS, 2024, 12 : 72688 - 72706
  • [27] Classification of breast cancer using microarray gene expression data: A survey
    Abd-Elnaby, Muhammed
    Alfonse, Marco
    Roushdy, Mohamed
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 117
  • [28] Feature selection of gene expression data for Cancer classification using double RBF-kernels
    Liu, Shenghui
    Xu, Chunrui
    Zhang, Yusen
    Liu, Jiaguo
    Yu, Bin
    Liu, Xiaoping
    Dehmer, Matthias
    [J]. BMC BIOINFORMATICS, 2018, 19
  • [29] A hybrid decision tree/genetic algorithm method for data mining
    Carvalho, DR
    Freitas, AA
    [J]. INFORMATION SCIENCES, 2004, 163 (1-3) : 13 - 35
  • [30] A hybrid feature selection algorithm for gene expression data classification
    Lu, Huijuan
    Chen, Junying
    Yan, Ke
    Jin, Qun
    Xue, Yu
    Gao, Zhigang
    [J]. NEUROCOMPUTING, 2017, 256 : 56 - 62