Gene expression data analysis with a dynamically extended self-organized map that exploits class information

被引:14
作者
Mavroudi, S
Papadimitriou, S
Bezerianos, A [1 ]
机构
[1] Univ Patras, Sch Med, Dept Phys Med, Patras 26500, Greece
[2] Inst Educ Technol, Dept Informat Management, Kavala, Greece
关键词
D O I
10.1093/bioinformatics/18.11.1446
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Currently the most popular approach to analyze genome-wide expression data is clustering. One of the major drawbacks of most of the existing clustering methods is that the number of clusters has to be specified a priori. Furthermore, by using pure unsupervised algorithms prior biological knowledge is totally ignored Moreover, most current tools lack an effective framework for tight integration of unsupervised and supervised learning for the analysis of high-dimensional expression data and only very few multi-class supervised approaches are designed with the provision for effectively utilizing multiple functional class labeling. Results: The paper adapts a novel Self-Organizing map called supervised Network Self-Organized Map (sNet-SOM) to the peculiarities of multi-labeled gene expression data. The sNet-SOM determines adaptively the number of clusters with a dynamic extension process. This process is driven by an inhomogeneous measure that tries to balance unsupervised, supervised and model complexity criteria. Nodes within a rectangular grid are grown at the boundary nodes, weights rippled from the internal nodes towards the outer nodes of the grid, and whole columns inserted within the map The appropriate level of expansion is determined automatically. Multiple sNet-SOM models are constructed dynamically each for a different unsupervised/supervised balance and model selection criteria are used to select the one optimum one. The results indicate that sNet-SOM yields competitive performance to other recently proposed approaches for supervised classification at a significantly reduced computational cost and it provides extensive exploratory analysis potentiality within the analysis framework. Furthermore, it explores simple design decisions that are easier to comprehend and computationally efficient.
引用
收藏
页码:1446 / 1453
页数:8
相关论文
共 25 条
[1]   Dynamic self-organizing maps with controlled growth for knowledge discovery [J].
Alahakoon, D ;
Halgamuge, SK ;
Srinivasan, B .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03) :601-614
[2]  
[Anonymous], 2005, NEURAL NETWORKS PATT
[3]   A computational neural approach to support the discovery of gene function and classes of cancer [J].
Azuaje, F .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2001, 48 (03) :332-339
[4]   Hierarchical state space partitioning with a network self-organising map for the recognition of ST-T segment changes [J].
Bezerianos, A ;
Vladutu, L ;
Papadimitriou, S .
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2000, 38 (04) :406-415
[5]   Gene expression data analysis [J].
Brazma, A ;
Vilo, J .
FEBS LETTERS, 2000, 480 (01) :17-24
[6]   Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[7]   S-TREE: self-organizing trees for data clustering and online vector quantization [J].
Campos, MM ;
Carpenter, GA .
NEURAL NETWORKS, 2001, 14 (4-5) :505-525
[8]  
CHEESEMAN P, 1995, ADV KNOWLEDGE DISCOV, P153
[9]   Externally growing cell structures for data evaluation of chemical gas sensors [J].
Cheng, GJ ;
Zell, A .
NEURAL COMPUTING & APPLICATIONS, 2001, 10 (01) :89-97
[10]  
CHEUNG VG, 1999, NATURE GENET S, V21