Classification-based quantitative analysis of stable isotope labeling by amino acids in cell culture (SILAC) data

被引：4

作者：

Kim, Seongho ^{[1
,2
]}

Carruthers, Nicholas ^{[3
,4
]}

Lee, Joohyoung ^{[1
,5
]}

Chinni, Sreenivasa ^{[6
]}

Stemmer, Paul ^{[3
,4
]}

机构：

[1] Wayne State Univ, Karmanos Canc Inst, Biostat Core, Detroit, MI 48201 USA

[2] Wayne State Univ, Dept Oncol, Detroit, MI 48201 USA

[3] Wayne State Univ, Karmanos Canc Inst, Prote Core, Detroit, MI 48201 USA

[4] Wayne State Univ, Inst Environm Hlth Sci, Detroit, MI 48201 USA

[5] Wayne State Univ, Dept Family Med & Publ Hlth Sci, Detroit, MI 48201 USA

[6] Wayne State Univ, Sch Med, Dept Urol, Detroit, MI 48201 USA

来源：

COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE | 2016年 / 137卷

关键词：

Classification; Mass spectrometry; Particle swarm optimization; Proteomics; SILAC; PROTEOMICS;

D O I：

10.1016/j.cmpb.2016.09.017

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Background and objective: Stable isotope labeling by amino acids in cell culture (SILAC) is a practical and powerful approach for quantitative proteomic analysis. A key advantage of SILAC is the ability to simultaneously detect the isotopically labeled peptides in a single instrument run and so guarantee relative quantitation for a large number of peptides without introducing any variation caused by separate experiment. However, there are a few approaches available to assessing protein ratios and none of the existing algorithms pays considerable attention to the proteins having only one peptide hit. Methods: We introduce new quantitative approaches to dealing with SILAC protein-level summary using classification-based methodologies, such as Gaussian mixture models with EM algorithms and its Bayesian approach as well as K-means clustering. In addition, a new approach is developed using Gaussian mixture model and a stochastic, metaheuristic global optimization algorithm, particle swarm optimization (PSO), to avoid either a premature convergence or being stuck in a local optimum. Results: Our simulation studies show that the newly developed PSO-based method performs the best among others in terms of F1 score and the proposed methods further demonstrate the ability of detecting potential markers through real SILAC experimental data. Conclusions: No matter how many peptide hits the protein has, the developed approach can be applicable, rescuing many proteins doomed to removal. Furthermore, no additional correction for multiple comparisons is necessary for the developed methods, enabling direct interpretation of the analysis outcomes. (C) 2016 Elsevier Ireland Ltd. All rights reserved.

引用

页码：137 / 148

页数：12

共 24 条

[1]

Benaglia T, 2009, J STAT SOFTW, V32, P1

[2] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].