Classification-based quantitative analysis of stable isotope labeling by amino acids in cell culture (SILAC) data

被引:4
作者
Kim, Seongho [1 ,2 ]
Carruthers, Nicholas [3 ,4 ]
Lee, Joohyoung [1 ,5 ]
Chinni, Sreenivasa [6 ]
Stemmer, Paul [3 ,4 ]
机构
[1] Wayne State Univ, Karmanos Canc Inst, Biostat Core, Detroit, MI 48201 USA
[2] Wayne State Univ, Dept Oncol, Detroit, MI 48201 USA
[3] Wayne State Univ, Karmanos Canc Inst, Prote Core, Detroit, MI 48201 USA
[4] Wayne State Univ, Inst Environm Hlth Sci, Detroit, MI 48201 USA
[5] Wayne State Univ, Dept Family Med & Publ Hlth Sci, Detroit, MI 48201 USA
[6] Wayne State Univ, Sch Med, Dept Urol, Detroit, MI 48201 USA
关键词
Classification; Mass spectrometry; Particle swarm optimization; Proteomics; SILAC; PROTEOMICS;
D O I
10.1016/j.cmpb.2016.09.017
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and objective: Stable isotope labeling by amino acids in cell culture (SILAC) is a practical and powerful approach for quantitative proteomic analysis. A key advantage of SILAC is the ability to simultaneously detect the isotopically labeled peptides in a single instrument run and so guarantee relative quantitation for a large number of peptides without introducing any variation caused by separate experiment. However, there are a few approaches available to assessing protein ratios and none of the existing algorithms pays considerable attention to the proteins having only one peptide hit. Methods: We introduce new quantitative approaches to dealing with SILAC protein-level summary using classification-based methodologies, such as Gaussian mixture models with EM algorithms and its Bayesian approach as well as K-means clustering. In addition, a new approach is developed using Gaussian mixture model and a stochastic, metaheuristic global optimization algorithm, particle swarm optimization (PSO), to avoid either a premature convergence or being stuck in a local optimum. Results: Our simulation studies show that the newly developed PSO-based method performs the best among others in terms of F1 score and the proposed methods further demonstrate the ability of detecting potential markers through real SILAC experimental data. Conclusions: No matter how many peptide hits the protein has, the developed approach can be applicable, rescuing many proteins doomed to removal. Furthermore, no additional correction for multiple comparisons is necessary for the developed methods, enabling direct interpretation of the analysis outcomes. (C) 2016 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:137 / 148
页数:12
相关论文
共 24 条
[1]  
Benaglia T, 2009, J STAT SOFTW, V32, P1
[2]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[3]  
Bishop C., 2006, Pattern recognition and machine learning, P423
[4]   Online Quantitative Proteomics p-Value Calculator for Permutation-Based Statistical Testing of Peptide Ratios [J].
Chen, David ;
Shah, Anup ;
Hien Nguyen ;
Loo, Dorothy ;
Inder, Kerry L. ;
Hill, Michelle M. .
JOURNAL OF PROTEOME RESEARCH, 2014, 13 (09) :4184-4191
[5]   1D and 2D annotation enrichment: a statistical method integrating quantitative proteomics with complementary high-throughput data [J].
Cox, Juergen ;
Mann, Matthias .
BMC BIOINFORMATICS, 2012, 13 :S12
[6]   MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification [J].
Cox, Juergen ;
Mann, Matthias .
NATURE BIOTECHNOLOGY, 2008, 26 (12) :1367-1372
[7]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[8]   A guided tour of the Trans-Proteomic Pipeline [J].
Deutsch, Eric W. ;
Mendoza, Luis ;
Shteynberg, David ;
Farrah, Terry ;
Lam, Henry ;
Tasman, Natalie ;
Sun, Zhi ;
Nilsson, Erik ;
Pratt, Brian ;
Prazen, Bryan ;
Eng, Jimmy K. ;
Martin, Daniel B. ;
Nesvizhskii, Alexey I. ;
Aebersold, Ruedi .
PROTEOMICS, 2010, 10 (06) :1150-1159
[9]  
Engelbrecht A.P., 2007, Computational Intelligence, P289, DOI [10.1002/9780470512517.ch16, DOI 10.1002/9780470512517.CH16]
[10]  
Fraley C., 2006, MCLUST Version 3 for R: Normal Mixture Modeling and Model-Based Clustering