Improved biclustering of microarray data demonstrated through systematic performance tests

被引:111
|
作者
Turner, H [1 ]
Bailey, T [1 ]
Krzanowski, W [1 ]
机构
[1] Univ Exeter, Dept Math Sci, Exeter EX4 4QE, Devon, England
基金
英国惠康基金;
关键词
biclustering; two-way clustering; overlapping clustering; artificial microarray data; performance evaluation; bicluster quality measures;
D O I
10.1016/j.csda.2004.02.003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A new algorithm is presented for fitting the plaid model, a biclustering method developed for clustering gene expression data. The approach is based on speedy individual differences clustering and uses binary least squares to update the cluster membership parameters, making use of the binary constraints on these parameters and simplifying the other parameter updates. The performance of both algorithms is tested on simulated data sets designed to imitate (normalised) gene expression data, covering a range of biclustering configurations. Empirical distributions for the components of these data sets, including non-systematic error, are derived from a real set of microarray data. A set of two-way quality measures is proposed, based on one-way measures commonly used in information retrieval, to evaluate the quality of a retrieved bicluster with respect to a target bicluster in terms of both genes and samples. By defining a one-to-one correspondence between target biclusters and retrieved biclusters, the performance of each algorithm can be assessed. The results show that, using appropriately selected starting criteria, the proposed algorithm out-performs the original plaid model algorithm across a range of data sets. Furthermore, through the rigorous assessment of the plaid model a benchmark for future evaluation of biclustering methods is established. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:235 / 254
页数:20
相关论文
共 50 条
  • [41] Cuckoo Search with Mutation for Biclustering of Microarray Gene Expression Data
    Rengeswaran, Balamurugan
    Mathaiyan, Natarajan
    Kandasamy, Premalatha
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (03) : 300 - 306
  • [42] A novel biclustering algorithm of binary microarray data: BiBinCons and BiBinAlter
    Ben Saber, Haifa
    Elloumi, Mourad
    BIODATA MINING, 2015, 8
  • [43] Novel Probabilistic Encoding for EAs Applied to Biclustering of Microarray Data
    Marcozzi, Michael
    Divina, Federico
    Aguilar-Ruiz, Jesus S.
    Vanhoof, Wim
    GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2011, : 339 - 346
  • [44] Robust biclustering algorithm (ROBA) for DNA microarray data analysis
    Tchagang, Alain B.
    Tewfik, AhmedH.
    2005 IEEE/SP 13TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), VOLS 1 AND 2, 2005, : 918 - 923
  • [45] DNA microarray data analysis: A novel biclustering algorithm approach
    Tchagang, Alain B.
    Tewfik, Ahmed H.
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
  • [46] A Hybrid Possibilistic Algorithm for Biclustering: Application to Microarray Data Analysis
    Ben Saber, Haifa
    Elloumi, Mourad
    2015 26TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2015, : 48 - 52
  • [47] A novel biclustering algorithm of binary microarray data: BiBinCons and BiBinAlter
    Haifa Ben Saber
    Mourad Elloumi
    BioData Mining, 8
  • [48] An improved biclustering algorithm for gene expression data
    Jin, Sheng-Hua
    Hua, Li
    Open Cybernetics and Systemics Journal, 2014, 8 : 1141 - 1144
  • [49] An improved biclustering algorithm for gene expression data
    Jin, Sheng-Hua
    Hua, Li
    Open Cybernetics and Systemics Journal, 2014, 8 (01): : 1141 - 1144
  • [50] Multi-Objective Particle Swarm Optimization Biclustering of Microarray Data
    Liu, Junwan
    Li, Zhoujun
    Liu, Feifei
    Chen, Yiming
    2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS, 2008, : 363 - +