Issues of processing and multiple testing of SELDI-TOF MS proteomic data

被引:0
作者
Birkner, Merrill D. [1 ]
Hubbard, Alan E. [1 ]
van der Laan, Mark J. [1 ]
Skibola, Christine F. [2 ]
Hegedus, Christine M. [2 ]
Smith, Martyn T. [2 ]
机构
[1] Univ Calif Berkeley, Sch Publ Hlth, Div Biostat, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Sch Publ Hlth, Div Environm Hlth Sci, Berkeley, CA 94720 USA
关键词
proteomics; mass-spectrometry; multiple testing; preprocessing; leukemia; tail probability;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A new data filtering method for SELDI-TOF MS proteomic spectra data is described. We examined technical repeats (2 per subject) of intensity versus m/z (mass/charge) of bone marrow cell lysate for two groups of childhood leukemia patients: acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL). As others have noted, the type of data processing as well as experimental variability can have a disproportionate impact on the list of "interesting" proteins (see Baggerly et al. (2004)). We propose a list of processing and multiple testing techniques to correct for 1) background drift; 2) filtering using smooth regression and cross-validated bandwidth selection; 3) peak finding; and 4) methods to correct for multiple testing (van der Laan et al. (2005)). The result is a list of proteins (indexed by m/z) where average expression is significantly different among disease (or treatment, etc.) groups. The procedures are intended to provide a sensible and statistically driven algorithm, which we argue provides a list of proteins that have a significant difference in expression. Given no sources of unmeasured bias (such as confounding of experimental conditions with disease status), proteins found to be statistically significant using this technique have a low probability of being false positives.
引用
收藏
页数:24
相关论文
共 27 条
[1]   Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments [J].
Baggerly, KA ;
Morris, JS ;
Coombes, KR .
BIOINFORMATICS, 2004, 20 (05) :777-U710
[2]   Improved peak detection and quantification of mass spectrometry data acquired from surface-enhanced laser desorption and ionization by denoising spectra with the undecimated discrete wavelet transform [J].
Coombes, KR ;
Tsavachidis, S ;
Morris, JS ;
Baggerly, KA ;
Hung, MC ;
Kuerer, HM .
PROTEOMICS, 2005, 5 (16) :4107-4117
[3]   Proteomic analysis of human acute leukemia cells: Insight into their classification [J].
Cui, JW ;
Wang, J ;
He, K ;
Jin, BF ;
Wang, HX ;
Li, W ;
Kang, LH ;
Hu, MR ;
Li, HY ;
Yu, M ;
Shen, BF ;
Wang, GJ ;
Zhang, XM .
CLINICAL CANCER RESEARCH, 2004, 10 (20) :6887-6896
[4]   Two-dimensional electrophoresis protein profiling as an analytical tool for human acute leukemia 2 classification [J].
Cui, JW ;
Wang, J ;
He, K ;
Jin, BF ;
Wang, HX ;
Li, W ;
Kang, LH ;
Hu, MR ;
Li, HY ;
Yu, M ;
Shen, BF ;
Wang, GJ ;
Zhang, XM .
ELECTROPHORESIS, 2005, 26 (01) :268-279
[5]  
DUDOIT S, 2004, STAT APPL GENETICS M, V3
[6]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[7]   Science, medicine, and the future - Childhood leukaemia [J].
Greaves, M .
BRITISH MEDICAL JOURNAL, 2002, 324 (7332) :283-287
[8]  
HARDLE W, 1990, EC SERIES MONOGRAPHS, V19
[9]   Proteomic analysis of childhood leukemia [J].
Hegedus, CM ;
Gunn, L ;
Skibola, C ;
Zhang, L ;
Shiao, R ;
Fu, S ;
Dalmasso, EA ;
Metayer, C ;
Dahl, GV ;
Buffler, PA ;
Smith, MT .
LEUKEMIA, 2005, 19 (10) :1713-1718
[10]   The SELDI-TOF MS approach to proteomics: Protein profiling and biomarker identification [J].
Issaq, HJ ;
Veenstra, TD ;
Conrads, TP ;
Felschow, D .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2002, 292 (03) :587-592