A novel clustering approach based on the manifold structure of gene expression data

被引:0
作者
Shi, Jinlong [1 ]
Luo, Zhigang [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China
来源
2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010) | 2010年
关键词
gene expression; geodesic distance; clustering; geometric representation;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Clustering is an effective approach for computing analysis of gene expression data. Various of clustering algorithms have been developed to give reasonable interpretations of biological data and discover biological meaningful patterns of cellular functions. Based on the manifold structure of gene expression data analyzed under the framework of geometric representation, a novel clustering approach is presented to reveal the nonlinear expression patterns. The novel clustering approach can be divided into the following computing steps. The first step is to construct a neighborhood graph for gene expression points through which the approximate geodesic distances between each two points can be obtained. Then, instead of Euclidean distance, approximate geodesic distance is exploited to reveal the similarity between gene profiles. Finally, via defining the geodesic distance between a cluster and a gene expression point, new clusters can be generated after essential iterative processes. Application of the approach to the yeast cell-cycle dataset validates its rationality and efficiency.
引用
收藏
页数:4
相关论文
共 50 条
[41]   Clustering gene expression data with a penalized graph-based metric [J].
Ariel E Bayá ;
Pablo M Granitto .
BMC Bioinformatics, 12
[42]   Classification of gene-expression data: The manifold-based metric learning way [J].
Lee, Jianguo ;
Zhang, Changshui .
PATTERN RECOGNITION, 2006, 39 (12) :2450-2463
[43]   The Clustering Algorithm Study of Gene Expression Data [J].
He Rui ;
Lin Chunmei .
ENVIRONMENTAL BIOTECHNOLOGY AND MATERIALS ENGINEERING, PTS 1-3, 2011, 183-185 :93-+
[44]   Clustering of high throughput gene expression data [J].
Pirim, Harun ;
Eksioglu, Burak ;
Perkins, Andy D. ;
Yuceer, Cetin .
COMPUTERS & OPERATIONS RESEARCH, 2012, 39 (12) :3046-3061
[45]   A Similarity Measure for Clustering Gene Expression Data [J].
Baishya, Ram Charan ;
Sarmah, Rosy ;
Bhattacharyya, Dhruba Kumar ;
Dutta, Malay Ananda .
APPLIED ALGORITHMS, 2014, 8321 :245-256
[46]   A Novel Scalable Signature Based Subspace Clustering Approach for Big Data [J].
Gayathri, T. ;
Bhaskari, D. Lalitha .
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2019, 14 (02) :41-51
[47]   A novel neural network approach to gene clustering [J].
Hao, W ;
Yu, SN .
IEEE: 2005 INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2005, :221-225
[48]   Gene clustering using Gene expression data and Self-Organizing Map (SOM) [J].
Kekic, Leila ;
Hodic, Jasin ;
Alispahic, Belma .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING 2017 (CMBEBIH 2017), 2017, 62 :445-451
[49]   Large-scale gene expression data clustering through incremental ensemble approach [J].
Khan, Imran ;
Shaikh, Abdul Khalique ;
Adhikari, Naresh .
MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (04)
[50]   Impact of missing data imputation methods on gene expression clustering and classification [J].
de Souto, Marcilio C. P. ;
Jaskowiak, Pablo A. ;
Costa, Ivan G. .
BMC BIOINFORMATICS, 2015, 16