A robust pre-processing of BeadChip microarray images

被引:10
作者
Kalina, Jan [1 ]
机构
[1] Czech Acad Sci, Inst Comp Sci, Pod Vodarenskou Vezi 2, Prague 18207 8, Czech Republic
关键词
Microarray; Robust image analysis; Noise; Outlying measurements; Background effect; SELECTION;
D O I
10.1016/j.bbe.2018.04.005
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Microarray images commonly used in gene expression studies are heavily contaminated by noise and/or outlying values (outliers). Unfortunately, standard methodology for the analysis of Illumina BeadChip microarray images turns out to be too vulnerable to data contamination by outliers. In this paper, an alternative approach to low-level pre-processing of images obtained by the BeadChip microarray technology is proposed. The novel approach robustifies the standard methodology in a complex way and thus ensures a sufficient robustness (resistance) to outliers. A gene expression data set from a cardiovascular genetic study is analyzed and the performance of the novel robust approach is compared with the standard methodology. The robust approach is able to detect and delete a larger percentage of outliers. More importantly, gene expressions are estimated more precisely. As a consequence, also the performance of a subsequently performed classification task to two groups (patients vs. control persons) is improved over the cardiovascular gene expression data set. A further improvement was obtained when considering weighted gene expression values, where the weights correspond to a robust estimate of variability of the measurements for each individual gene transcript. (C) 2018 Nalecz Institute of Biocybernetics and Biomedical Engineering of the Polish Academy of Sciences. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:556 / 563
页数:8
相关论文
共 30 条
[1]   An overview of image-processing methods for Affymetrix GeneChips [J].
Arteaga-Salas, Jose M. ;
Zuzan, Harry ;
Langdon, William B. ;
Upton, Graham J. G. ;
Harrison, Andrew P. .
BRIEFINGS IN BIOINFORMATICS, 2008, 9 (01) :25-33
[2]   Current challenges in open-source bioimage informatics [J].
Cardona, Albert ;
Tomancak, Pavel .
NATURE METHODS, 2012, 9 (07) :661-665
[3]   Breakdown and groups - Rejoinder [J].
Davies, PL ;
Gather, U .
ANNALS OF STATISTICS, 2005, 33 (03) :1016-1035
[4]   Minimum redundancy feature selection from microarray gene expression data [J].
Ding, C ;
Peng, HC .
PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, :523-528
[5]  
Dunning M, 2017, R PACKAGE VERSION 2
[6]   beadarray:: R classes and methods for Illumina bead-based data [J].
Dunning, Mark J. ;
Smith, Mike L. ;
Ritchie, Matthew E. ;
Tavare, Simon .
BIOINFORMATICS, 2007, 23 (16) :2183-2184
[7]   Statistical issues in the analysis of Illumina data [J].
Dunning, Mark J. ;
Barbosa-Morais, Nuno L. ;
Lynch, Andy G. ;
Tavare, Simon ;
Ritchie, Matthew E. .
BMC BIOINFORMATICS, 2008, 9 (1)
[8]  
Dziuda Darius., 2010, Data Mining for Genomics and Proteomics: Analysis of Gene and Protein Expression Data
[9]  
Fraser A, 2010, CH CRC COMP SCI DATA, P1
[10]  
Gohlmann H., 2009, Gene Expression Studies using Affymetrix Microarrays