A Granular Self-Organizing Map for Clustering and Gene Selection in Microarray Data

被引:22
作者
Ray, Shubhra Sankar [1 ]
Ganivada, Avatharam [2 ]
Pal, Sankar K. [1 ]
机构
[1] Indian Stat Inst, Machine Intelligence Unit, Ctr Soft Comp Res, Kolkata 700108, India
[2] Indian Stat Inst, Ctr Soft Comp Res, Kolkata 700108, India
关键词
Bioinformatics; clustering; feature selection; granular neural network; rough-fuzzy computing; NEURAL-NETWORK; FUZZY; CLASSIFICATION; CANCER; SETS;
D O I
10.1109/TNNLS.2015.2460994
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new granular self-organizing map (GSOM) is developed by integrating the concept of a fuzzy rough set with the SOM. While training the GSOM, the weights of a winning neuron and the neighborhood neurons are updated through a modified learning procedure. The neighborhood is newly defined using the fuzzy rough sets. The clusters (granules) evolved by the GSOM are presented to a decision table as its decision classes. Based on the decision table, a method of gene selection is developed. The effectiveness of the GSOM is shown in both clustering samples and developing an unsupervised fuzzy rough feature selection (UFRFS) method for gene selection in microarray data. While the superior results of the GSOM, as compared with the related clustering methods, are provided in terms of beta-index, DB-index, Dunn-index, and fuzzy rough entropy, the genes selected by the UFRFS are not only better in terms of classification accuracy and a feature evaluation index, but also statistically more significant than the related unsupervised methods. The C-codes of the GSOM and UFRFS
引用
收藏
页码:1890 / 1906
页数:17
相关论文
共 36 条
[1]   FatiGO:: a web tool for finding significant associations of Gene Ontology terms with groups of genes [J].
Al-Shahrour, F ;
Díaz-Uriarte, R ;
Dopazo, J .
BIOINFORMATICS, 2004, 20 (04) :578-580
[2]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[3]   Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses [J].
Bhattacharjee, A ;
Richards, WG ;
Staunton, J ;
Li, C ;
Monti, S ;
Vasa, P ;
Ladd, C ;
Beheshti, J ;
Bueno, R ;
Gillette, M ;
Loda, M ;
Weber, G ;
Mark, EJ ;
Lander, ES ;
Wong, W ;
Johnson, BE ;
Golub, TR ;
Sugarbaker, DJ ;
Meyerson, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) :13790-13795
[4]   A combination of rough-based feature selection and RBF neural network for classification using gene expression data [J].
Chiang, Jung-Hsien ;
Ho, Shing-Hua .
IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2008, 7 (01) :91-99
[5]   Flaxseed inhibits metastasis and decreases extracellular vascular endothelial growth factor in human breast cancer xenografts [J].
Dabrosin, C ;
Chen, JM ;
Wang, L ;
Thompson, LU .
CANCER LETTERS, 2002, 185 (01) :31-37
[6]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227
[7]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[8]  
Dunn J. C., 1973, Journal of Cybernetics, V3, P32, DOI 10.1080/01969727308546046
[9]   Clustering by passing messages between data points [J].
Frey, Brendan J. ;
Dueck, Delbert .
SCIENCE, 2007, 315 (5814) :972-976
[10]   Fuzzy rough sets, and a granular neural network for unsupervised feature selection [J].
Ganivada, Avatharam ;
Ray, Shubhra Sankar ;
Pal, Sankar K. .
NEURAL NETWORKS, 2013, 48 :91-108