Fuzzy-rough attribute reduction via mutual information with an application to cancer classification

被引:70
作者
Xu, F. F. [1 ]
Miao, D. Q.
Wei, L.
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai 200092, Peoples R China
基金
高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
Cancer classification; Attribute reduction; Fuzzy rough sets; Mutual information; KNOWLEDGE REDUCTION; MICROARRAY; SELECTION; PATTERNS; SETS;
D O I
10.1016/j.camwa.2008.10.027
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Establishing a classification model for cancer recognition based on DNA microarrays Is useful for cancer diagnosis. Feature selection is a key step to perform cancer classification with DNA microarrays, for there is a large number of genes from which to predict classes and a relatively small number of samples. Automatic methods must be developed for extracting relevant genes which are essential for classification. This paper proposes a novel approach for reducing data redundancy based on fuzzy rough set theory and information theory. A mutual information-based algorithm for attribute reduction is suggested. The method is applied to the problem of gene selection for cancer classification. Experimental results show that the algorithm is more effective than conventional rough sets based approaches. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1010 / 1017
页数:8
相关论文
共 34 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]  
Dash M., 1997, Intelligent Data Analysis, V1
[3]  
DeRisi J, 1996, NAT GENET, V14, P457
[4]  
DEVIJVER PA, 1982, PATTERN RECOGNITION
[5]  
Ding C.H. Q., 2002, RECOMB, P127
[6]   ROUGH FUZZY-SETS AND FUZZY ROUGH SETS [J].
DUBOIS, D ;
PRADE, H .
INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1990, 17 (2-3) :191-209
[7]   Support vector machine classification and validation of cancer tissue samples using microarray expression data [J].
Furey, TS ;
Cristianini, N ;
Duffy, N ;
Bednarski, DW ;
Schummer, M ;
Haussler, D .
BIOINFORMATICS, 2000, 16 (10) :906-914
[8]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[9]   Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation [J].
Hu, Qinghua ;
Xie, Zongxia ;
Yu, Daren .
PATTERN RECOGNITION, 2007, 40 (12) :3509-3521
[10]  
Jensen R, 2002, PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, P29, DOI 10.1109/FUZZ.2002.1004954