Improved biclustering of microarray data demonstrated through systematic performance tests

被引:111
|
作者
Turner, H [1 ]
Bailey, T [1 ]
Krzanowski, W [1 ]
机构
[1] Univ Exeter, Dept Math Sci, Exeter EX4 4QE, Devon, England
基金
英国惠康基金;
关键词
biclustering; two-way clustering; overlapping clustering; artificial microarray data; performance evaluation; bicluster quality measures;
D O I
10.1016/j.csda.2004.02.003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A new algorithm is presented for fitting the plaid model, a biclustering method developed for clustering gene expression data. The approach is based on speedy individual differences clustering and uses binary least squares to update the cluster membership parameters, making use of the binary constraints on these parameters and simplifying the other parameter updates. The performance of both algorithms is tested on simulated data sets designed to imitate (normalised) gene expression data, covering a range of biclustering configurations. Empirical distributions for the components of these data sets, including non-systematic error, are derived from a real set of microarray data. A set of two-way quality measures is proposed, based on one-way measures commonly used in information retrieval, to evaluate the quality of a retrieved bicluster with respect to a target bicluster in terms of both genes and samples. By defining a one-to-one correspondence between target biclusters and retrieved biclusters, the performance of each algorithm can be assessed. The results show that, using appropriately selected starting criteria, the proposed algorithm out-performs the original plaid model algorithm across a range of data sets. Furthermore, through the rigorous assessment of the plaid model a benchmark for future evaluation of biclustering methods is established. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:235 / 254
页数:20
相关论文
共 50 条
  • [21] Biclustering of DNA microarray data with early pruning
    Tewfik, AH
    Tchagang, AB
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 773 - 776
  • [22] Biclustering of Expression Microarray Data Using Affinity Propagation
    Farinelli, Alessandro
    Denitto, Matteo
    Bicego, Manuele
    PATTERN RECOGNITION IN BIOINFORMATICS, 2011, 7036 : 13 - 24
  • [23] Quick hierarchical biclustering on microarray gene expression data
    Ji, Liping
    Mock, Kenneth Wei-Liang
    Tan, Kian-Lee
    BIBE 2006: SIXTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2006, : 110 - +
  • [24] Multiobjective Path Relinking for Biclustering: Application to Microarray Data
    Seridi, Khedidja
    Jourdan, Laetitia
    Talbi, El-Ghazali
    EVOLUTIONARY MULTI-CRITERION OPTIMIZATION, EMO 2013, 2013, 7811 : 200 - 214
  • [25] Biclustering of microarray data with MOSPO based on crowding distance
    Liu, Junwan
    Li, Zhoujun
    Hu, Xiaohua
    Chen, Yiming
    BMC BIOINFORMATICS, 2009, 10
  • [26] Biclustering of microarray data with MOSPO based on crowding distance
    Junwan Liu
    Zhoujun Li
    Xiaohua Hu
    Yiming Chen
    BMC Bioinformatics, 10
  • [27] Biclustering of microarray data based on singular value decomposition
    Yang, Wen-Hui
    Dai, Dao-Qing
    Yan, Hong
    EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 194 - +
  • [28] Evolutionary biclustering algorithms: an experimental study on microarray data
    Ons Maâtouk
    Wassim Ayadi
    Hend Bouziri
    Béatrice Duval
    Soft Computing, 2019, 23 : 7671 - 7697
  • [29] Comparative Analysis and Evaluation of Biclustering Algorithms for Microarray Data
    Maind, Ankush
    Raut, Shital
    NETWORKING COMMUNICATION AND DATA KNOWLEDGE ENGINEERING, VOL 2, 2018, 4 : 159 - 171
  • [30] Spectral biclustering of microarray data: Coclustering genes and conditions
    Kluger, Y
    Basri, R
    Chang, JT
    Gerstein, M
    GENOME RESEARCH, 2003, 13 (04) : 703 - 716