Analysis of Imputation Algorithms for Microarray Gene Expression Data

被引:0
作者
Shashirekha, H. L. [1 ]
Wani, Agaz Hussain [1 ]
机构
[1] Mangalore Univ, Dept Comp Sci, Mangalore 574199, India
来源
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT) | 2015年
关键词
Imputation; Microarray Data Analysis; Gene Expression Data; Missing Value Estimation;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Microarray technology makes it possible to measure expression level of thousands of genes simultaneously in an efficient and inexpensive manner. However, due to various complexities in processing microarrays, expression information of various genes may be missing due to unreliable measurements. The occurrence of missing values in gene expression data can adversely affect downstream analyses such as clustering, dimensionality reduction etc. Different algorithms have been developed to estimate the missing values in different data sets and none of these algorithm works well with all the data sets. In this work, we explore the possible application of Mutual Nearest Neighbor (MNN) algorithm to impute the missing values, which shows comparable results with other well know imputation algorithms. We also have explored five different methods for missing value imputation namely Row Average Imputation, Mean Imputation, Median Imputation, k-Nearest Neighbor Imputation and combination of kNN based feature selection (kNNFS) and kNN -based imputation. The experiments are carried out on very high dimensional gene expression data such as Notterman Carcinoma and Notterman Adenocarcinoma data and the results are illustrated.
引用
收藏
页码:589 / 593
页数:5
相关论文
共 16 条
[1]   Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [J].
Alizadeh, AA ;
Eisen, MB ;
Davis, RE ;
Ma, C ;
Lossos, IS ;
Rosenwald, A ;
Boldrick, JG ;
Sabet, H ;
Tran, T ;
Yu, X ;
Powell, JI ;
Yang, LM ;
Marti, GE ;
Moore, T ;
Hudson, J ;
Lu, LS ;
Lewis, DB ;
Tibshirani, R ;
Sherlock, G ;
Chan, WC ;
Greiner, TC ;
Weisenburger, DD ;
Armitage, JO ;
Warnke, R ;
Levy, R ;
Wilson, W ;
Grever, MR ;
Byrd, JC ;
Botstein, D ;
Brown, PO ;
Staudt, LM .
NATURE, 2000, 403 (6769) :503-511
[2]  
Gan X., 2006, NUCLEIC ACIDS RES, V34
[3]  
Jrnsten R., 2005, BIOINFORMATICS, V21
[4]   Missing value estimation for DNA microarray gene expression data: local least squares imputation [J].
Kim, H ;
Golub, GH ;
Park, H .
BIOINFORMATICS, 2005, 21 (02) :187-198
[5]   Missing value imputation for gene expression data: computational techniques to recover missing data from available information [J].
Liew, Alan Wee-Chung ;
Law, Ngai-Fong ;
Yan, Hong .
BRIEFINGS IN BIOINFORMATICS, 2011, 12 (05) :498-513
[6]  
Liu H., 2010, 2010 9 INT C GRID CL, P5257
[7]  
Meesad P., 2008, 3 INT C INN COMP INF, P36
[8]  
Moorthy K., 2014, REV MISSING VALUE IM, P1822
[9]  
Muhammad Shoaib R. C., 2008, J BIOMED INFORM, V41
[10]   A Bayesian missing value estimation method for gene expression profile data [J].
Oba, S ;
Sato, M ;
Takemasa, I ;
Monden, M ;
Matsubara, K ;
Ishii, S .
BIOINFORMATICS, 2003, 19 (16) :2088-2096