A Review on Missing Value Imputation Algorithms for Microarray Gene Expression Data
被引:47
作者:
Moorthy, Kohbalan
论文数: 0引用数: 0
h-index: 0
机构:
Univ Technol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, MalaysiaUniv Technol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, Malaysia
Moorthy, Kohbalan
[1
]
Mohamad, Mohd Saberi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Technol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, MalaysiaUniv Technol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, Malaysia
Mohamad, Mohd Saberi
[1
]
Deris, Safaai
论文数: 0引用数: 0
h-index: 0
机构:
Univ Technol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, MalaysiaUniv Technol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, Malaysia
Deris, Safaai
[1
]
机构:
[1] Univ Technol Malaysia, Fac Comp, Artificial Intelligence & Bioinformat Res Grp, Skudai 81310, Johor, Malaysia
Gene expression analysis;
gene expression data;
information recovery;
microarray data;
missing value estimation;
missing value imputation;
D O I:
10.2174/1574893608999140109120957
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Many bioinformatics analytical tools, especially for cancer classification and prediction, require complete sets of data matrix. Having missing values in gene expression studies significantly influences the interpretation of final data. However, to most analysts' dismay, this has become a common problem and thus, relevant missing value imputation algorithms have to be developed and/or refined to address this matter. This paper intends to present a review of preferred and available missing value imputation methods for the analysis and imputation of missing values in gene expression data. Focus is placed on the abilities of algorithms in performing local or global data correlation to estimate the missing values. Approaches of the algorithms mentioned have been categorized into global approach, local approach, hybrid approach, and knowledge assisted approach. The methods presented are accompanied with suitable performance evaluation. The aim of this review is to highlight possible improvements on existing research techniques, rather than recommending new algorithms with the same functional aim.