Finding Correlated Biclusters from Gene Expression Data

被引:33
|
作者
Yang, Wen-Hui [1 ,2 ]
Dai, Dao-Qing [1 ,2 ]
Yan, Hong [3 ,4 ]
机构
[1] Sun Yat Sen Zhongshan Univ, Ctr Comp Vis, Guangzhou 510275, Guangdong, Peoples R China
[2] Sun Yat Sen Zhongshan Univ, Dept Math, Fac Math & Comp, Guangzhou 510275, Guangdong, Peoples R China
[3] City Univ Hong Kong, Dept Elect Engn, Kowloon, Hong Kong, Peoples R China
[4] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia
关键词
Biclustering; pattern classification; gene expression data; singular-value decomposition; data mining; biology computing; SINGULAR-VALUE DECOMPOSITION; MICROARRAY DATA; DISCRIMINANT-ANALYSIS; CLUSTER-ANALYSIS; PATTERNS; MODELS;
D O I
10.1109/TKDE.2010.150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting biologically relevant information from DNA microarrays is a very important task for drug development and test, function annotation, and cancer diagnosis. Various clustering methods have been proposed for the analysis of gene expression data, but when analyzing the large and heterogeneous collections of gene expression data, conventional clustering algorithms often cannot produce a satisfactory solution. Biclustering algorithm has been presented as an alternative approach to standard clustering techniques to identify local structures from gene expression data set. These patterns may provide clues about the main biological processes associated with different physiological states. In this paper, different from existing bicluster patterns, we first introduce a more general pattern: correlated bicluster, which has intuitive biological interpretation. Then, we propose a novel transform technique based on singular value decomposition so that identifying correlated-bicluster problem from gene expression matrix is transformed into two global clustering problems. The Mixed-Clustering algorithm and the Lift algorithm are devised to efficiently produce delta-corBiclusters. The biclusters obtained using our method from gene expression data sets of multiple human organs and the yeast Saccharomyces cerevisiae demonstrate clear biological meanings.
引用
收藏
页码:568 / 584
页数:17
相关论文
共 50 条
  • [41] Biclustering in gene expression data by tendency
    Liu, JZ
    Yang, J
    Wang, W
    2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 182 - 193
  • [42] BiMine+: An efficient algorithm for discovering relevant biclusters of DNA microarray data
    Ayadi, Wassim
    Ellourni, Mourad
    Hao, Jin Kao
    KNOWLEDGE-BASED SYSTEMS, 2012, 35 : 224 - 234
  • [43] Efficient mining of discriminative co-clusters from gene expression data
    Odibat, Omar
    Reddy, Chandan K.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (03) : 667 - 696
  • [44] Leveraging additional knowledge to support coherent bicluster discovery in gene expression data
    Visconti, Alessia
    Cordero, Francesca
    Pensa, Ruggero G.
    INTELLIGENT DATA ANALYSIS, 2014, 18 (05) : 837 - 855
  • [45] An evolutionary approach for biclustering of gene expression data
    Sheta, Walaa
    Hany, Maha
    Mahdi, Shereef
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2010, 2 (06) : 413 - 421
  • [46] On Evolutionary Algorithms for Biclustering of Gene Expression Data
    Carballido Jessica, A.
    Gallo Cristian, A.
    Dussaut Julieta, S.
    Ignacio, Ponzoni
    CURRENT BIOINFORMATICS, 2015, 10 (03) : 259 - 267
  • [47] Missing value imputation for gene expression data: computational techniques to recover missing data from available information
    Liew, Alan Wee-Chung
    Law, Ngai-Fong
    Yan, Hong
    BRIEFINGS IN BIOINFORMATICS, 2011, 12 (05) : 498 - 513
  • [48] A genetic filter for cancer classification on gene expression data
    Kim, Yong-Hyuk
    Yoon, Yourim
    BIO-MEDICAL MATERIALS AND ENGINEERING, 2015, 26 : S1993 - S2002
  • [49] A Review on Feature Selection Techniques for Gene Expression Data
    Vanjimalar, S.
    Ramyachitra, D.
    Manikandan, P.
    2018 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC 2018), 2018, : 26 - 29
  • [50] Ensemble Cuckoo Search Biclustering of the gene expression data
    Yin, Lu
    Liu, Yongguo
    2016 IEEE 15TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2016, : 419 - 422