Optimal search-based gene subset selection for gene array cancer classification

被引:27
作者
Li, Jiexun [1 ]
Su, Hua
Chen, Hsinchun
Futscher, Bernard W.
机构
[1] Univ Arizona, Eller Coll Management, Dept Management Informat Syst, Artificial Intelligence Lab, Tucson, AZ 85721 USA
[2] Univ Arizona, Arizona Canc Ctr, Tucson, AZ 85721 USA
来源
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE | 2007年 / 11卷 / 04期
基金
美国国家卫生研究院;
关键词
genetics; medical diagnosis; optimization methods; pattern classification; search methods;
D O I
10.1109/TITB.2007.892693
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High dimensionality has been a major problem for gene array-hased cancer classification. It is critical to identify marker genes for cancer diagnoses. We developed a framework of gene selection methods based on previous studies. This paper focuses on optimal search-based subset selection methods because they evaluate the group performance of genes and help to pinpoint global optimal set of marker genes. Notably, this paper is the first to introduce tabu search (TS) to gene selection from high-dimensional gene array data. Our comparative study of gene selection methods demonstrated the effectiveness of optimal search-based gene subset selection to identify cancer marker genes. TS was shown to be a promising tool for gene subset selection.
引用
收藏
页码:398 / 405
页数:8
相关论文
共 42 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]  
[Anonymous], 1992, P 10 NAT C ART INT S
[3]  
[Anonymous], 1999, TABU SEARCH
[4]  
[Anonymous], 1993, ESSENTIALS ARTIFICIA
[5]  
Bishop CM., 1995, Neural networks for pattern recognition
[6]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[7]  
Bo TH, 2002, GENOME BIOL, V3
[8]  
Chen HC, 1998, J AM SOC INFORM SCI, V49, P693, DOI 10.1002/(SICI)1097-4571(199806)49:8<693::AID-ASI4>3.0.CO
[9]  
2-O
[10]   An improved branch and bound algorithm for feature selection [J].
Chen, XW .
PATTERN RECOGNITION LETTERS, 2003, 24 (12) :1925-1933