Reversible jump MCMC approach for peak identification for stroke SELDI mass spectrometry using mixture model

被引:19
作者
Wang, Yuan [1 ,2 ,3 ]
Zhou, Xiaobo [1 ,2 ]
Wang, Honghui [4 ]
Li, King [1 ,2 ]
Yao, Lixiu [3 ]
Wong, Stephen T. C. [1 ,2 ]
机构
[1] Methodist Hosp, Res Inst, CBI, Houston, TX 77030 USA
[2] Methodist Hosp, Weill Cornell Med Coll, Dept Radiol, Houston, TX 77030 USA
[3] Shanghai Jiao Tong Univ, Sch Elect Informt & Elect Engn, Shanghai 200030, Peoples R China
[4] NIH, Crit Care Med Dept, Ctr Clin, Bethesda, MD 20892 USA
关键词
D O I
10.1093/bioinformatics/btn143
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Mass spectrometry (MS) has shown great potential in detecting disease-related biomarkers for early diagnosis of stroke. To discover potential biomarkers from large volume of noisy MS data, peak detection must be performed first. This article proposes a novel automatic peak detection method for the stroke MS data. In this method, a mixture model is proposed to model the spectrum. Bayesian approach is used to estimate parameters of the mixture model, and Markov chain Monte Carlo method is employed to perform Bayesian inference. By introducing a reversible jump method, we can automatically estimate the number of peaks in the model. Instead of separating peak detection into substeps, the proposed peak detection method can do baseline correction, denoising and peak identification simultaneously. Therefore, it minimizes the risk of introducing irrecoverable bias and errors from each substep. In addition, this peak detection method does not require a manually selected denoising threshold. Experimental results on both simulated dataset and stroke MS dataset show that the proposed peak detection method not only has the ability to detect small signal-to-noise ratio peaks, but also greatly reduces false detection rate while maintaining the same sensitivity.
引用
收藏
页码:I407 / I413
页数:7
相关论文
共 24 条
[1]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[2]   Robust full Bayesian learning for radial basis networks [J].
Andrieu, C ;
de Freitas, N ;
Doucet, A .
NEURAL COMPUTATION, 2001, 13 (10) :2359-2407
[3]   A comprehensive approach to the analysis of matrix-assisted laser desorption/ionization-time of flight proteomics spectra from serum samples [J].
Baggerly, KA ;
Morris, JS ;
Wang, J ;
Gold, D ;
Xiao, LC ;
Coombes, KR .
PROTEOMICS, 2003, 3 (09) :1667-1672
[4]   Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments [J].
Baggerly, KA ;
Morris, JS ;
Coombes, KR .
BIOINFORMATICS, 2004, 20 (05) :777-U710
[5]  
Coombes KR, 2005, CANCER INFORM, V1, P41
[6]   Improved peak detection and quantification of mass spectrometry data acquired from surface-enhanced laser desorption and ionization by denoising spectra with the undecimated discrete wavelet transform [J].
Coombes, KR ;
Tsavachidis, S ;
Morris, JS ;
Baggerly, KA ;
Hung, MC ;
Kuerer, HM .
PROTEOMICS, 2005, 5 (16) :4107-4117
[7]   SELDI-TOF mass spectra: A view on sources of variation [J].
Dijkstra, Martijn ;
Vonk, Roel J. ;
Jansen, Ritsert C. .
JOURNAL OF CHROMATOGRAPHY B-ANALYTICAL TECHNOLOGIES IN THE BIOMEDICAL AND LIFE SCIENCES, 2007, 847 (01) :12-23
[8]   Peak quantification in surface-enhanced laser desorption/ionization by using mixture models [J].
Dijkstra, Martijn ;
Roelofsen, Han ;
Vonk, Roel J. ;
Jansen, Ritsert C. .
PROTEOMICS, 2006, 6 (19) :5106-5116
[9]   Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching [J].
Du, Pan ;
Kibbe, Warren A. ;
Lin, Simon M. .
BIOINFORMATICS, 2006, 22 (17) :2059-2065
[10]  
FUNG ET, 2002, BIOTECHNIQUES S34, V81, P40