On Comprehensive Mass Spectrometry Data Analysis for Proteome Profiling of Human Blood Samples

被引:0
作者
Manchanda S. [1 ]
Meyer M. [2 ]
Li Q. [3 ]
Liang K. [3 ]
Li Y. [3 ]
Kong N. [4 ]
机构
[1] Department of Computer Science, Purdue University, West Lafayette
[2] Department of Statistics and Mathematics, Purdue University, West Lafayette, IN
[3] Institute of Biophysics, Chinese Academy of Sciences, Beijing
[4] Weldon School of Biomedical Engineering, Purdue University, 206 S. Martin Jischke Dr, West Lafayette, 47906, IN
基金
美国国家科学基金会;
关键词
Binary classification; Blood sample; Feature ranking; Mass spectrometry; Proteome profiling;
D O I
10.1007/s41666-018-0022-0
中图分类号
学科分类号
摘要
To guarantee meaningful interpretation of data in basic and translational medicine, it is critical to ensure the quality of biological samples. Mass spectrometers have become promising instruments to acquire proteomic information that is known to be associated with the quality of samples. However, a universally applicable mass spectrometry data analysis platform for quality assessment remains of great need. We present a comprehensive pattern recognition study to facilitate the development of such a platform. This study involves feature extraction, binary classification, and feature ranking. In this study, we develop classifiers with classification accuracy higher than 90% in distinguishing human serum samples stored for different amounts of time. We also derive fingerprint patterns of serum peptides that can be conveniently used for temporal classification. © 2018, Springer International Publishing AG, part of Springer Nature.
引用
收藏
页码:305 / 318
页数:13
相关论文
共 30 条
[1]  
Ayache S., Panelli M., Marincola F.M., Stroncek D.F., Effects of Storage Time and Exogenous Protease Inhibitors on Plasma Protein Levels, American Journal of Clinical Pathology, 126, 2, pp. 174-184, (2006)
[2]  
Baggerly K.A., Morris J.S., Coombes K.R., Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments, Bioinformatics, 20, 5, pp. 777-785, (2004)
[3]  
Ball G., Et al., An integrated approach utilizing artificial neural networks and SELDI mass spectrometry for the classification of human tumours and rapid identification of potential biomarkers, Bioinformatics, 18, 3, pp. 395-404, (2002)
[4]  
Bishop C.M., Pattern recognition and machine learning (information science and statistics), (2006)
[5]  
Breiman L., Bagging predictors, Mach Learn, 24, 2, pp. 123-140, (1996)
[6]  
Carvalho P.C., Et al., Identifying differences in protein expression levels by spectral counting and feature selection, Genet Mol Res, 7, 2, (2008)
[7]  
Chaigneau C., Et al., Serum biobank certification and the establishment of quality controls for biological fluids: examples of serum biomarker stability after temperature variation, Clin Chem Lab Med, 45, 10, pp. 1390-1395, (2007)
[8]  
Datta S., DePadilla L.M., Feature selection and machine learning with mass spectrometry data for distinguishing cancer and noncancer samples, Stat Methodol, 3, 1, pp. 79-92, (2006)
[9]  
Jackson D.H., Banks R.E., Banking of clinical samples for proteomic biomarker studies: a consideration of logistical issues with a focus on pre-analytical variation, Proteomics Clin Appl, 4, 3, pp. 250-270, (2010)
[10]  
Jenkins M.A., Quality control and quality assurance aspects of the routine use of capillary electrophoresis for serum and urine proteins in clinical laboratories, Electrophoresis, 25, 10-11, pp. 1555-1560, (2004)