Clust: automatic extraction of optimal co-expressed gene clusters from gene expression data

被引:125
作者
Abu-Jamous, Basel [1 ]
Kelly, Steven [1 ]
机构
[1] Univ Oxford, Dept Plant Sci, South Pk Rd, Oxford OX1 3RB, England
基金
比尔及梅琳达.盖茨基金会; 欧盟地平线“2020”;
关键词
Clustering; Gene expression data; Clust; K-means; Cross-clustering; Click; Markov clustering; Hierarchical clustering; Self-organizing maps; WGCNA; BIOSYNTHESIS; DISCOVERY; ONTOLOGY;
D O I
10.1186/s13059-018-1536-8
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Identifying co-expressed gene clusters can provide evidence for genetic or physical interactions. Thus, co-expression clustering is a routine step in large-scale analyses of gene expression data. We show that commonly used clustering methods produce results that substantially disagree and that do not match the biological expectations of co-expressed gene clusters. We present clust, a method that solves these problems by extracting clusters matching the biological expectations of co-expressed genes and outperforms widely used methods. Additionally, clust can simultaneously cluster multiple datasets, enabling users to leverage the large quantity of public expression data for novel comparative analysis. Clust is available at https://github.com/BaselAbujamous/clust.
引用
收藏
页数:11
相关论文
共 39 条
[1]  
Abu-Jamous B, 2018, CLUST METHOD PYTHON
[2]  
Abu-Jamous B, 2018, CLUST 100 GE DATASET, DOI [10. 5281/zenodo. 1298541, DOI 10.5281/ZEN0D0.1298541]
[3]   UNCLES: method for the identification of genes differentially consistently co-expressed in a specific subset of datasets [J].
Abu-Jamous, Basel ;
Fa, Rui ;
Roberts, David J. ;
Nandi, Asoke K. .
BMC BIOINFORMATICS, 2015, 16
[4]   Paradigm of Tunable Clustering Using Binarization of Consensus Partition Matrices (Bi-CoPaM) for Gene Discovery [J].
Abu-Jamous, Basel ;
Fa, Rui ;
Roberts, David J. ;
Nandi, Asoke K. .
PLOS ONE, 2013, 8 (02)
[5]  
Agarwala R, 2018, NUCLEIC ACIDS RES, V46, pD8, DOI [10.1093/nar/gks1189, 10.1093/nar/gkx1095, 10.1093/nar/gkq1172]
[6]  
[Anonymous], 2000, GRAPH CLUSTERING FLO
[7]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[8]  
Ball, 1965, ISODATA NOVEL METHOD
[9]   Transcription - Signal transduction and the control of gene expression [J].
Brivanlou, AH ;
Darnell, JE .
SCIENCE, 2002, 295 (5556) :813-818
[10]   Assigning roles to DNA regulatory motifs using comparative genomics [J].
Buske, Fabian A. ;
Boden, Mikael ;
Bauer, Denis C. ;
Bailey, Timothy L. .
BIOINFORMATICS, 2010, 26 (07) :860-866