THE EFFECTS OF PRE-PROCESSING AND PARAMETER CHOICES ON SEARCHES THROUGH LARGE GENE EXPRESSION DATA COLLECTIONS

被引:0
作者
Hibbs, Matthew A. [1 ]
机构
[1] Jackson Lab, Bar Harbor, ME 04609 USA
来源
2009 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS 2009) | 2009年
关键词
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Gene expression microarray data collections contain information that can shed light on a variety of systems-level biological problems, including the functional roles of proteins and the regulatory networks governing their transcription and translation. However, the analysis of these data is complicated by unusual noise characteristics and variation between experimental protocols and technologies. Many of the efforts to confront these difficulties utilize additional pre-processing strategies to adjust the input data and/or alter parameter choices of their algorithmic approach. Here, we examine the effect of some of these techniques in the context of the SPELL similarity search algorithm. Our results demonstrate that pre-processing and parameter choices can greatly affect the performance of this approach. As such, these choices should be carefully considered and evaluated when performing a broad range of analyses of gene expression data.
引用
收藏
页码:164 / 167
页数:4
相关论文
共 10 条
[1]   Microarray data analysis: from disarray to consolidation and consensus [J].
Allison, DB ;
Cui, XQ ;
Page, GP ;
Sabripour, M .
NATURE REVIEWS GENETICS, 2006, 7 (01) :55-65
[2]   Singular value decomposition for genome-wide expression data processing and modeling [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10101-10106
[3]   A search engine to identify pathway genes from expression data on multiple organisms [J].
Chen, Chunnuan ;
Weirauch, Matthew T. ;
Powell, Corey C. ;
Zambon, Alexander C. ;
Stuart, Joshua M. .
BMC SYSTEMS BIOLOGY, 2007, 1
[4]  
Fisher RA, 1914, BIOMETRIKA, V10, P507
[5]   Exploring the functional landscape of gene expression: directed search of large microarray compendia [J].
Hibbs, Matthew A. ;
Hess, David C. ;
Myers, Chad L. ;
Huttenhower, Curtis ;
Li, Kai ;
Troyanskaya, Olga G. .
BIOINFORMATICS, 2007, 23 (20) :2692-2699
[6]  
LEEK J, 2007, PLOS GENET
[7]   Finding function: evaluation methods for functional genomic data [J].
Myers, Chad L. ;
Barrett, Daniel R. ;
Hibbs, Matthew A. ;
Huttenhower, Curtis ;
Troyanskaya, Olga G. .
BMC GENOMICS, 2006, 7 (1)
[8]   A gene recommender algorithm to identify coexpressed genes in C-elegans [J].
Owen, AB ;
Stuart, J ;
Mach, K ;
Villeneuve, AM ;
Kim, S .
GENOME RESEARCH, 2003, 13 (08) :1828-1837
[9]   Missing value estimation methods for DNA microarrays [J].
Troyanskaya, O ;
Cantor, M ;
Sherlock, G ;
Brown, P ;
Hastie, T ;
Tibshirani, R ;
Botstein, D ;
Altman, RB .
BIOINFORMATICS, 2001, 17 (06) :520-525
[10]  
Wall E., 2003, PRACTICAL APPROACH M