Bayesian biomarker identification based on marker-expression proteomics data

被引:11
作者
Bhattacharjee, M. [1 ]
Botting, C. H. [2 ]
Sillanpaa, M. J. [3 ]
机构
[1] Univ St Andrews, Sch Math & Stat, St Andrews KY16 9ST, Fife, Scotland
[2] Univ St Andrews, Ctr Biomol Sci, St Andrews KY16 9ST, Fife, Scotland
[3] Univ Helsinki, Dept Math & Stat, Helsinki, Finland
基金
芬兰科学院;
关键词
Variable selection; Bayesian hierarchical models; CFS; Association analysis; High-throughput analysis;
D O I
10.1016/j.ygeno.2008.06.006
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We are studying variable selection in multiple regression models in which molecular markers and/or gene-expression measurements as well as intensity measurements from protein spectra serve as predictors for the outcome variable (i.e., trait or disease state). Finding genetic biomarkers and searching genetic-epidemiological factors can be formulated as a statistical problem of variable selection, in which, from a large set of candidates, a small number of trait-associated predictors are identified. We illustrate our approach by analyzing the data available for chronic fatigue syndrome (CFS). CFS is a complex disease from several aspects, e.g., it is difficult to diagnose and difficult to quantify. To identify biomarkers we used microarray data and SELDI-TOF-based proteomics data. We also analyzed genetic marker information for a large number of SNPs for an overlapping set of individuals. The objectives of the analyses were to identify markers specific to fatigue that are also possibly exclusive to CFS. The use of such models can be motivated, for example, by the search for new biomarkers for the diagnosis and prognosis of cancer and measures of response to therapy. Generally, for this we use Bayesian hierarchical modeling and Markov Chain Monte Carlo computation. (C) 2008 Elsevier Inc. All rights reserved.
引用
收藏
页码:384 / 392
页数:9
相关论文
共 22 条
[1]   Polymorphisms in the angiotensin-converting enzyme gene are associated with unipolar depression, ACE activity and hypercortisolism [J].
Baghai, T. C. ;
Binder, E. B. ;
Schule, C. ;
Salyakina, D. ;
Eser, D. ;
Lucae, S. ;
Zwanzger, P. ;
Haberger, C. ;
Zill, P. ;
Ising, M. ;
Deiml, T. ;
Uhr, M. ;
Illig, T. ;
Wichmann, H-E ;
Modell, S. ;
Nothdurfter, C. ;
Holsboer, F. ;
Mueller-Myhsok, B. ;
Moeller, H-J ;
Rupprecht, R. ;
Bondy, B. .
MOLECULAR PSYCHIATRY, 2006, 11 (11) :1003-1015
[2]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[3]   Bayesian integrated functional analysis of microarray data [J].
Bhattacharjee, M ;
Pritchard, CC ;
Nelson, PS ;
Arjas, E .
BIOINFORMATICS, 2004, 20 (17) :2943-2953
[4]  
BHATTACHARJEE M, 2006, P CAMDA 2006 DUK U D
[5]  
Bhattacharjee Madhuchhanda, 2008, Pac Symp Biocomput, P178
[6]   Enriching the analysis of genomewide association studies with hierarchical modeling [J].
Chen, Gary K. ;
Witte, John S. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (02) :397-404
[7]   Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching [J].
Du, Pan ;
Kibbe, Warren A. ;
Lin, Simon M. .
BIOINFORMATICS, 2006, 22 (17) :2059-2065
[8]   ExPASy: the proteomics server for in-depth protein knowledge and analysis [J].
Gasteiger, E ;
Gattiker, A ;
Hoogland, C ;
Ivanyi, I ;
Appel, RD ;
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3784-3788
[9]   Bayesian mapping of genotype x expression interactions in quantitative and qualitative traits [J].
Hoti, F. ;
Sillanpaa, M. J. .
HEREDITY, 2006, 97 (01) :4-18
[10]  
Hung RJ, 2004, CANCER EPIDEM BIOMAR, V13, P1013