Iterative bicluster-based least square framework for estimation of missing values in microarray gene expression data

被引:40
作者
Cheng, K. O. [1 ]
Law, N. F. [1 ]
Siu, W. C. [1 ,2 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Ctr Signal Proc, Hong Kong, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Dept Elect & Informat Engn EIE, Hong Kong, Hong Kong, Peoples R China
关键词
Missing value imputation; Biclustering; Iterative estimation; Gene expression analysis; SACCHAROMYCES-CEREVISIAE; IDENTIFICATION; CLASSIFICATION;
D O I
10.1016/j.patcog.2011.10.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
DNA microarray experiment inevitably generates gene expression data with missing values. An important and necessary pre-processing step is thus to impute these missing values. Existing imputation methods exploit gene correlation among all experimental conditions for estimating the missing values. However, related genes coexpress in subsets of experimental conditions only. In this paper, we propose to use biclusters, which contain similar genes under subset of conditions for characterizing the gene similarity and then estimating the missing values. To further improve the accuracy in missing value estimation, an iterative framework is developed with a stopping criterion on minimizing uncertainty. Extensive experiments have been conducted on artificial datasets, real microarray datasets as well as one non-microarray dataset. Our proposed biclusters-based approach is able to reduce errors in missing value estimation. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1281 / 1289
页数:9
相关论文
共 46 条
  • [21] An Ensemble Filtering and Supervised Clustering based Informative Gene Selection Algorithm in Microarray Gene Expression Data
    Bose, Shilpi
    Das, Chandra
    Banerjee, Abhik
    Chattopadhyay, Matangini
    Chattopadhyay, Samiran
    2020 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NETWORKS (CINE 2020), 2020,
  • [22] A cDNA Microarray Gene Expression Data Classifier for Clinical Diagnostics Based on Graph Theory
    Benso, Alfredo
    Di Carlo, Stefano
    Politano, Gianfranco
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) : 577 - 591
  • [23] Spectral pattern comparison methods for cancer classification based on microarray gene expression data
    Pham, Tuan D.
    Beck, Dominik
    Yan, Hong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2006, 53 (11) : 2425 - 2430
  • [24] Unsupervised Feature Selection for Microarray Gene Expression Data Based on Discriminative Structure Learning
    Ye, Xiucai
    Sakurai, Tetsuya
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2018, 24 (06) : 725 - 741
  • [25] A hybrid CI-based knowledge discovery system on microarray gene expression data
    Tang, YC
    He, YC
    Zhang, YQ
    Huang, Z
    Hu, XH
    Sunderraman, R
    PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 25 - 30
  • [26] A Partial least squares-based regression approach for analysis of frontotemporal dementia gene markers in human brain gene microarray data
    Chan, S. C.
    Wu, H. C.
    Lin, J. Q.
    Zhang, Z. G.
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [27] Least-squares based iterative parameter estimation for two-input multirate sampled-data systems
    Lu, Jing
    Liu, Xinggao
    Ding, Feng
    2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 4379 - +
  • [28] Multi-class tumor classification by discriminant partial least squares using microarray gene expression data and assessment of classification models
    Tan, YX
    Shi, LB
    Tong, WD
    Hwang, GTG
    Wang, C
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2004, 28 (03) : 235 - 244
  • [29] A Novel Hybrid Method for Classification of Tumor in Gene Expression Based Central Nervous System Microarray Data
    Singh, W. Jai
    Kavitha, R. K.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (11): : 121 - 125
  • [30] Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes
    Patrick Warnat
    Roland Eils
    Benedikt Brors
    BMC Bioinformatics, 6