The classification of tumor using gene expression profile based on support vector machines and factor analysis

被引:0
作者
Wang, Shulin [1 ,2 ]
Wang, Ji [1 ]
Chen, Huowang [1 ]
Tang, Wensheng [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Hunan, Peoples R China
[2] Hunan Univ, Coll Comp & Commun, Changsha 410082, Hunan, Peoples R China
来源
ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2 | 2006年
基金
中国国家自然科学基金;
关键词
biological data mining; feature selection; classification; gene expression profiles; factor analysis; support vector machines;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gene expression data that is being used to gather information from tissue samples is expected to significantly improve the development of efficient tumor diagnosis and to provide understanding and insight into tumor related cellular processes. In this paper we propose a novel feature selection approach which integrates the feature score criterion with factor analysis to further improve the SVM-based classification performance of gene expression data. We examine two sets of published gene expression data to validate the novel feature selection method by means of SVM classifier with different parameters. Experiments show that the proposed hybrid method can select a small quantity of principal factors to represent a large number of genes and SVM has a superior classification performance with the common factors which are extracted from gene expression data. Moreover experiment results demonstrate successful cross-validation accuracy of 92% for the colon dataset and 100% for the leukemia dataset.
引用
收藏
页码:471 / +
页数:2
相关论文
共 17 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]   Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[3]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[4]  
CHO SB, 2003, P 1 AS PAC BIOINF C, P189
[5]  
Cristianini N., 2000, An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods
[6]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[7]   Support vector machine classification and validation of cancer tissue samples using microarray expression data [J].
Furey, TS ;
Cristianini, N ;
Duffy, N ;
Bednarski, DW ;
Schummer, M ;
Haussler, D .
BIOINFORMATICS, 2000, 16 (10) :906-914
[8]   An accelerated procedure for recursive feature ranking on microarray data [J].
Furlanello, C ;
Serafini, M ;
Merler, S ;
Jurman, G .
NEURAL NETWORKS, 2003, 16 (5-6) :641-648
[9]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[10]   Gene selection for cancer classification using support vector machines [J].
Guyon, I ;
Weston, J ;
Barnhill, S ;
Vapnik, V .
MACHINE LEARNING, 2002, 46 (1-3) :389-422