An expert system to classify microarray gene expression data using gene selection by decision tree

被引:41
作者
Horng, Jorng-Tzong [1 ,2 ]
Wu, Li-Cheng [2 ]
Liu, Baw-Juine [3 ]
Kuo, Jun-Li [1 ]
Kuo, Wen-Horng [4 ]
Zhang, Jin-Jian [4 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Chungli 32054, Taiwan
[2] Natl Cent Univ, Inst Syst Biol & Bioinformat, Chungli 32054, Taiwan
[3] Yuan Ze Univ, Dept Comp Sci & Informat Engn, Jhongli, Taiwan
[4] Natl Cent Univ, Coll Med, Chungli 32054, Taiwan
关键词
Expert system; Machine learning; Bioinformatics; Microarray gene expression; Decision tree; METASTATIC BREAST-CANCER; MOLECULAR CLASSIFICATION; TRASTUZUMAB; INFORMATION; PREDICTION; MULTICLASS; DISCOVERY; DATABASES; MODEL;
D O I
10.1016/j.eswa.2008.12.037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gene selection call help the analysis of microarray gene expression data. However, it is very difficult to obtain a satisfactory classification result by machine learning techniques because of both the curse-of-dimensionality problem and the over-fitting problem. That is, the dimensions of the features are too large but the samples are too few. In this study, we designed an approach that attempts to avoid these two problems and then used it to select a small set of significant biomarker genes for diagnosis. Finally, we attempted to use these markers for the classification of cancer. This approach was tested the approach on a number of microarray datasets in order to demonstrate that it performs well and is both useful and reliable. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:9072 / 9081
页数:10
相关论文
共 37 条
[1]  
[Anonymous], PRACTICAL BUSINESS S
[2]   Exploiting scale-free information from expression data for cancer classification [J].
Antonov, AV ;
Tetko, IV ;
Kosykh, D ;
Surmeli, D ;
Mewes, HW .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2005, 29 (04) :288-293
[3]   Optimization models for cancer classification: extracting gene interaction information from microarray expression data [J].
Antonov, AV ;
Tetko, IV ;
Mader, MT ;
Budczies, J ;
Mewes, HW .
BIOINFORMATICS, 2004, 20 (05) :644-U145
[4]   Microarray analysis of trophoblast differentiation: gene expression reprogramming in key gene function categories [J].
Aronow, BJ ;
Richardson, BD ;
Handwerger, S .
PHYSIOLOGICAL GENOMICS, 2001, 6 (02) :105-116
[5]   Gene selection using a two-level hierarchical Bayesian model [J].
Bae, K ;
Mallick, BK .
BIOINFORMATICS, 2004, 20 (18) :3423-3430
[6]   Application of DNA microarray technology in determining breast cancer prognosis and therapeutic response [J].
Brennan, DJ ;
O'Brien, SL ;
Fagan, A ;
Culhane, AC ;
Higgins, DG ;
Duffy, MJ ;
Gallagher, WM .
EXPERT OPINION ON BIOLOGICAL THERAPY, 2005, 5 (08) :1069-1083
[7]  
Brown T.A., 2002, Genomes, Vsecond
[8]   PCP: a program for supervised classification of gene expression profiles [J].
Buturovic, LJ .
BIOINFORMATICS, 2006, 22 (02) :245-247
[9]   Differential coexpression analysis using microarray data and its application to human cancer [J].
Choi, JK ;
Yu, US ;
Yoo, OJ ;
Kim, S .
BIOINFORMATICS, 2005, 21 (24) :4348-4355
[10]   Biomarker discovery in microarray gene expression data with Gaussian processes [J].
Chu, W ;
Ghahramani, Z ;
Falciani, F ;
Wild, DL .
BIOINFORMATICS, 2005, 21 (16) :3385-3393