Automatic Generation of Merge Factor for Clustering Microarray Data

被引:0
作者
Pavan, K. Karteeka [1 ]
Rao, Allam Appa [2 ]
Rao, A. V. Dattatreya [3 ]
Sridhar, G. R. [4 ]
机构
[1] RVR & JC Coll Engn, Guntur, Andhra Pradesh, India
[2] Jawaharlal Nehru Technol Univ, Kakinada, Andhra Pradesh, India
[3] Acharya Nagarjuna Univ, Guntur, Andhra Pradesh, India
[4] Endocrine & Diabet Ctr, Visakhapatnam, Andhra Pradesh, India
来源
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY | 2008年 / 8卷 / 09期
关键词
Bioinformatics; Microarray gene expression data; coexpressed genes; clustering; K-means; ISODATA; AGMFI;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microarrays are made it possible to simultaneously monitor the expression profiles of thousands of genes under various experimental conditions. Identification of coexpressed genes and coherent patterns is the central goal in microarray or gene expression data analysis and is an important task in bioinformatics research. Cluster analysis of gene expression data has proved to be a useful tool for identifying coexpressed genes, biologically relevant groupings of genes and samples. In this paper we propose an algorithm -Automatic Generation of Merge Factor for Isodata - AGMFI, to cluster microarray data on the basis of ISODATA. The main idea of AGMFI is to generate initial values for merge factor, maximum merge times instead of selecting heuristic values as in ISODATA. One significant feature of AGMFI over K-means is that the initial number of clusters may be merged or split, and so the final number of clusters may be different from the number of clusters specified as part of the input. We evaluate it's performance by applying on a well-known publicly available microarray data sets and on simulated data set [3]. We compared the results with those of K-means clustering. The experiments indicate that the proposed algorithm AGMFI increased the enrichment of genes of similar function within the cluster.
引用
收藏
页码:127 / 131
页数:5
相关论文
共 50 条
  • [31] A novel pattern based clustering methodology for time-series microarray data
    Phan, Sieu
    Famili, Fazel
    Tang, Zoujian
    Pan, Youlian
    Liu, Ziying
    Ouyang, Junjun
    Lenferink, Anne
    O'Connor, Maureen Mc-Court
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2007, 84 (05) : 585 - 597
  • [32] Automatic similarity detection and clustering of data
    Einstein, Craig
    Chin, Peter
    CYBER SENSING 2017, 2017, 10185
  • [33] A Novel Clustering and Verification Based Microarray Data Bi-clustering Method
    Zhang, Yanjie
    Wang, Hong
    Hu, Zhanyi
    ADVANCES IN SWARM INTELLIGENCE, PT 2, PROCEEDINGS, 2010, 6146 : 611 - +
  • [34] Microarray data clustering using particle swarm optimization K-means algorithm
    Deng, YP
    Kayarat, D
    Elasri, MO
    Brown, SJ
    PROCEEDINGS OF THE 8TH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1-3, 2005, : 1730 - 1734
  • [35] Multi-objective Optimization for Clustering Microarray Gene Expression Data - A Comparative Study
    Fuad, Muhammad Marwan Muhammad
    AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, 2015, 38 : 123 - 133
  • [36] Elastic Differential Evolution for Automatic Data Clustering
    Chen, Jun-Xian
    Gong, Yue-Jiao
    Chen, Wei-Neng
    Li, Mengting
    Zhang, Jun
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (08) : 4134 - 4147
  • [37] Automatic Subspace Clustering of High Dimensional Data
    Rakesh Agrawal
    Johannes Gehrke
    Dimitrios Gunopulos
    Prabhakar Raghavan
    Data Mining and Knowledge Discovery, 2005, 11 : 5 - 33
  • [38] A Bacterial Evolutionary Algorithm for Automatic Data Clustering
    Das, Swagatam
    Chowdhury, Archana
    Abraham, Ajith
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 2403 - +
  • [39] Clustering of DNA Microarray Temporal Data based on the Autoregressive Model
    Choong, Miew Keen
    Yan, Hong
    Levy, David
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 71 - 75
  • [40] Descriptive and Systematic Comparison of Clustering Methods in Microarray Data Analysis
    Kim, Seo Young
    KOREAN JOURNAL OF APPLIED STATISTICS, 2009, 22 (01) : 89 - 106