PatternLab for proteomics: a tool for differential shotgun proteomics

被引:110
作者
Carvalho, Paulo C. [1 ]
Fischer, Juliana Sg [2 ,3 ]
Chen, Emily I. [4 ]
Yates, John R., III [4 ]
Barbosa, Valmir C. [1 ]
机构
[1] Univ Fed Rio de Janeiro, COPPE, Syst Engn & Comp Sci Program, BR-21945 Rio De Janeiro, Brazil
[2] Univ Fed Rio de Janeiro, Inst Chem, Prot Chem Lab, BR-21945 Rio De Janeiro, Brazil
[3] Univ Fed Rio de Janeiro, Rio de Janeiro Prote Network, BR-21945 Rio De Janeiro, Brazil
[4] Scripps Res Inst, Biol Mass Spectrometry Lab, La Jolla, CA 92037 USA
关键词
D O I
10.1186/1471-2105-9-316
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: A goal of proteomics is to distinguish between states of a biological system by identifying protein expression differences. Liu et al. demonstrated a method to perform semi-relative protein quantitation in shotgun proteomics data by correlating the number of tandem mass spectra obtained for each protein, or "spectral count", with its abundance in a mixture; however, two issues have remained open: how to normalize spectral counting data and how to efficiently pinpoint differences between profiles. Moreover, Chen et al. recently showed how to increase the number of identified proteins in shotgun proteomics by analyzing samples with different MS-compatible detergents while performing proteolytic digestion. The latter introduced new challenges as seen from the data analysis perspective, since replicate readings are not acquired. Results: To address the open issues above, we present a program termed PatternLab for proteomics. This program implements existing strategies and adds two new methods to pinpoint differences in protein profiles. The first method, ACFold, addresses experiments with less than three replicates from each state or having assays acquired by different protocols as described by Chen et al. ACFold uses a combined criterion based on expression fold changes, the AC test, and the false-discovery rate, and can supply a "bird's-eye view" of differentially expressed proteins. The other method addresses experimental designs having multiple readings from each state and is referred to as nSVM ( natural support vector machine) because of its roots in evolutionary computing and in statistical learning theory. Our observations suggest that nSVM's niche comprises projects that select a minimum set of proteins for classification purposes; for example, the development of an early detection kit for a given pathology. We demonstrate the effectiveness of each method on experimental data and confront them with existing strategies. Conclusion: PatternLab offers an easy and unified access to a variety of feature selection and normalization strategies, each having its own niche. Additionally, graphing tools are available to aid in the analysis of high throughput experimental data.
引用
收藏
页数:14
相关论文
共 33 条
[1]   Determination of the differentially expressed genes in microarray experiments using local FDR [J].
Aubert, J ;
Bar-Hen, A ;
Daudin, JJ ;
Robin, S .
BMC BIOINFORMATICS, 2004, 5 (1)
[2]   The significance of digital gene expression profiles [J].
Audic, S ;
Claverie, JM .
GENOME RESEARCH, 1997, 7 (10) :986-995
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]  
Carvalho PC, 2007, J EXP THER ONCOL, V6, P137
[5]   Analysis of microarray data using Z score transformation [J].
Cheadle, C ;
Vawter, MP ;
Freed, WJ ;
Becker, KG .
JOURNAL OF MOLECULAR DIAGNOSTICS, 2003, 5 (02) :73-81
[6]   Optimization of mass spectrometry-compatible surfactants for shotgun proteomics [J].
Chen, Emily I. ;
Cociorva, Daniel ;
Norris, Jeremy L. ;
Yates, John R., III .
JOURNAL OF PROTEOME RESEARCH, 2007, 6 (07) :2529-2538
[7]  
Cleary J. G., 1995, Proceedings. DCC '95 Data Compression Conference (Cat. No.95TH8037), DOI 10.1109/DCC.1995.515590
[8]   POSSIBLE ORDERINGS IN MEASUREMENT SELECTION PROBLEM [J].
COVER, TM ;
VANCAMPENHOUT, JM .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1977, 7 (09) :657-661
[9]   A proteomic view of the Plasmodium falciparum life cycle [J].
Florens, L ;
Washburn, MP ;
Raine, JD ;
Anthony, RM ;
Grainger, M ;
Haynes, JD ;
Moch, JK ;
Muster, N ;
Sacci, JB ;
Tabb, DL ;
Witney, AA ;
Wolters, D ;
Wu, YM ;
Gardner, MJ ;
Holder, AA ;
Sinden, RE ;
Yates, JR ;
Carucci, DJ .
NATURE, 2002, 419 (6906) :520-526
[10]  
Frohlich H., 2004, International Journal on Artificial Intelligence Tools (Architectures, Languages, Algorithms), V13, P791, DOI 10.1142/S0218213004001818