Bi-clustering Gene Expression Data Using Co-similarity

被引：0

作者：

Hussain, Syed Fawad

机构：

来源：

ADVANCED DATA MINING AND APPLICATIONS, PT I | 2011年 / 7120卷

关键词：

Gene Expression Analysis; Bi-clustering; Co-similarity; CLASSIFICATION; PATTERNS; CANCER;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a new framework for hi-clustering gene expression data that is based on the notion of co-similarity between genes and samples. Our work is based on a co-similarity based framework that iteratively learns similarity between rows using similarity between columns and vice-versa in a matrix. The underlying concept. which is usually referred to as bi-clustering in the domain of bioinformatics, aims to find groupings of the feature set that exhibit similar behavior across sample subsets. The algorithm has previously been shown to work well for document clustering in a sparse matrix representation. We propose a variation of the method suited for analyzing data that is represented as a dense matrix and is non-homogenous as is the case in gene expression. Our experiments show that, with the proposed variations, the method is well suited for finding bi-clusters with high degree of homogeneity and we provide empirical results on real world cancer datasets.

引用

页码：190 / 200

页数：11

共 21 条

[1] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
Alon, U
Barkai, N
Notterman, DA
Gish, K
Ybarra, S
Mack, D
Levine, AJ
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
[2] Banerjee A., 2004, KDD, P509, DOI DOI 10.1145/1014052.1014111
[3] Barkow S., 2006, BICAT BICLUSTERING A, V22
[4] Ben-Dor A., J COMPUTATIONAL BIOL, V10, P373
[5] BISSON G, 2008, INT C MACH LEARN APP, P211, DOI DOI 10.1109/ICMLA.2008.103
[6] Cheng Yizong., 2000, BICLUSTERING EXPRESS, P93
[7] Coclustering of human cancer microarrays using minimum sum-squared residue coclustering
Cho, Hyuk
Dhillon, Inderjit S.
[J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2008, 5 (03) : 385 - 400
[8] Comparison of discrimination methods for the classification of tumors using gene expression data
Dudoit, S
Fridlyand, J
Speed, TP
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 77 - 87
[9] Cluster analysis and display of genome-wide expression patterns
Eisen, MB
Spellman, PT
Brown, PO
Botstein, D
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868
[10] Giannakidou Eirini, 2008, 2008 9th International Conference on Web-Age Information Management (WAIM), P317, DOI 10.1109/WAIM.2008.61

← 1 2 3 →