Iterative bicluster-based least square framework for estimation of missing values in microarray gene expression data

被引:40
作者
Cheng, K. O. [1 ]
Law, N. F. [1 ]
Siu, W. C. [1 ,2 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Ctr Signal Proc, Hong Kong, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Dept Elect & Informat Engn EIE, Hong Kong, Hong Kong, Peoples R China
关键词
Missing value imputation; Biclustering; Iterative estimation; Gene expression analysis; SACCHAROMYCES-CEREVISIAE; IDENTIFICATION; CLASSIFICATION;
D O I
10.1016/j.patcog.2011.10.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
DNA microarray experiment inevitably generates gene expression data with missing values. An important and necessary pre-processing step is thus to impute these missing values. Existing imputation methods exploit gene correlation among all experimental conditions for estimating the missing values. However, related genes coexpress in subsets of experimental conditions only. In this paper, we propose to use biclusters, which contain similar genes under subset of conditions for characterizing the gene similarity and then estimating the missing values. To further improve the accuracy in missing value estimation, an iterative framework is developed with a stopping criterion on minimizing uncertainty. Extensive experiments have been conducted on artificial datasets, real microarray datasets as well as one non-microarray dataset. Our proposed biclusters-based approach is able to reduce errors in missing value estimation. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1281 / 1289
页数:9
相关论文
共 46 条
  • [31] Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes
    Patrick Warnat
    Roland Eils
    Benedikt Brors
    BMC Bioinformatics, 6
  • [32] Effective gene expression data generation framework based on multi-model approach
    Sirin, Utku
    Erdogdu, Utku
    Polat, Faruk
    Tan, Mehmet
    Alhajj, Reda
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2016, 70 : 41 - 61
  • [33] Artificial neural network and partial least square regressions for rapid estimation of cellulose pulp dryness based on near infrared spectroscopic data
    Costa, Livia Ribeiro
    Denzin Tonoli, Gustavo Henrique
    Milagres, Flaviana Reis
    Gherardi Hein, Paulo Ricardo
    CARBOHYDRATE POLYMERS, 2019, 224
  • [34] A Machine Learning Method to Trace Cancer Primary Lesion Using Microarray-Based Gene Expression Data
    Lu, Qingfeng
    Chen, Fengxia
    Li, Qianyue
    Chen, Lihong
    Tong, Ling
    Tian, Geng
    Zhou, Xiaohong
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [35] Multi-Objective Evolutionary Algorithms based Interpretable Fuzzy Models for Microarray Gene Expression Data Analysis
    Wang, Zhenyu
    Palade, Vasile
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 308 - 313
  • [36] A Comparison of Evolutionary Methods Based on SVM Error-Bound Theories for Multicategory Microarray Gene Expression Data
    Debnath, Rameswar
    Kurita, Takio
    IMCIC 2010: INTERNATIONAL MULTI-CONFERENCE ON COMPLEXITY, INFORMATICS AND CYBERNETICS, VOL I (POST-CONFERENCE EDITION), 2010, : 295 - 299
  • [37] Hybrid Feature Selection of Breast Cancer Gene Expression Microarray Data Based on Metaheuristic Methods: A Comprehensive Review
    Ali, Nursabillilah Mohd
    Besar, Rosli
    Ab Aziz, Nor Azlina
    SYMMETRY-BASEL, 2022, 14 (10):
  • [38] Network-Based Inference Framework for Identifying Cancer Genes from Gene Expression Data
    Yang, Bo
    Zhang, Junying
    Yin, Yaling
    Zhang, Yuanyuan
    BIOMED RESEARCH INTERNATIONAL, 2013, 2013
  • [39] A model-based optimization framework for the inference of regulatory interactions using time-course DNA microarray expression data
    Thomas, Reuben
    Paredes, Carlos J.
    Mehrotra, Sanjay
    Hatzimanikatis, Vassily
    Papoutsakis, Eleftherios T.
    BMC BIOINFORMATICS, 2007, 8
  • [40] A comparative study of machine learning and deep learning algorithms to classify cancer types based on microarray gene expression data
    Tabares-Soto, Reinel
    Orozco-Arias, Simon
    Romero-Cano, Victor
    Segovia Bucheli, Vanesa
    Luis Rodriguez-Sotelo, Jose
    Felipe Jimenez-Varon, Cristian
    PEERJ COMPUTER SCIENCE, 2020, PeerJ Inc. (2020)