HTAPP: High-throughput autonomous proteomic pipeline

被引:27
作者
Yu, Kebing [1 ]
Salomon, Arthur R. [1 ,2 ,3 ]
机构
[1] Brown Univ, Dept Chem, Providence, RI 02903 USA
[2] Brown Univ, Dept Mol Biol Cell Biol & Biochem, Providence, RI 02903 USA
[3] Brown Univ, Ctr Genom & Prote, Providence, RI 02903 USA
基金
美国国家卫生研究院;
关键词
Automation; Bioinformatics; Laboratory information management system; MS/MS database search; Peptide analysis; Relational database; MASS-SPECTROMETRY; PROTEIN IDENTIFICATION; PHOSPHOPROTEOMIC ANALYSIS; TANDEM; PLATFORM; DATABASE; PEPTIDE; PERFORMANCE; SEQUENCES;
D O I
10.1002/pmic.200900159
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Recent advances in the speed and sensitivity of mass spectrometers and in analytical methods, the exponential acceleration of computer processing speeds, and the availability of genomic databases from an array of species and protein information databases have led to a deluge of proteomic data. The development of a lab-based automated proteomic software platform for the automated collection, processing, storage, and visualization of expansive proteomic data sets is critically important. The high-throughput autonomous proteomic pipeline described here is designed from the ground up to provide critically important flexibility for diverse proteomic workflows and to streamline the total analysis of a complex proteomic sample. This tool is composed of a software that controls the acquisition of mass spectral data along with automation of post-acquisition tasks such as peptide quantification, clustered MS/MS spectral database searching, statistical validation, and data exploration within a user-configurable lab-based relational database. The software design of high-throughput autonomous proteomic pipeline focuses on accommodating diverse workflows and providing missing software functionality to a wide range of proteomic researchers to accelerate the extraction of biological meaning from immense proteomic data sets. Although individual software modules in our integrated technology platform may have some similarities to existing tools, the true novelty of the approach described here is in the synergistic and flexible combination of these tools to provide an integrated and efficient analysis of proteomic samples.
引用
收藏
页码:2113 / 2122
页数:10
相关论文
共 43 条
[11]   TANDEM: matching proteins with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
BIOINFORMATICS, 2004, 20 (09) :1466-1467
[12]  
Desiere F, 2005, GENOME BIOL, V6
[13]   mzML: A single, unifying data format for mass spectrometer output [J].
Deutsch, Eric .
PROTEOMICS, 2008, 8 (14) :2776-2777
[14]   Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry [J].
Elias, Joshua E. ;
Gygi, Steven P. .
NATURE METHODS, 2007, 4 (03) :207-214
[15]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989
[16]   Automated immobilized metal affinity chromatography/nano-liquid chromatography/electrospray ionization mass spectrometry platform for profiling protein phosphorylation sites [J].
Ficarro, SB ;
Salomon, AR ;
Brill, LM ;
Mason, DE ;
Stettler-Gill, M ;
Brock, A ;
Peters, EC .
RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2005, 19 (01) :57-71
[17]   ExPASy: the proteomics server for in-depth protein knowledge and analysis [J].
Gasteiger, E ;
Gattiker, A ;
Hoogland, C ;
Ivanyi, I ;
Appel, RD ;
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3784-3788
[18]   CDART: Protein homology by domain architecture [J].
Geer, LY ;
Domrachev, M ;
Lipman, DJ ;
Bryant, SH .
GENOME RESEARCH, 2002, 12 (10) :1619-1623
[19]   Phosphosite: A bioinformatics resource dedicated to physiological protein phosphorylation [J].
Hornbeck, PV ;
Chabra, I ;
Kornhauser, JM ;
Skrzypek, E ;
Zhang, B .
PROTEOMICS, 2004, 4 (06) :1551-1561
[20]  
Keller Andrew, 2005, Mol Syst Biol, V1