A method of finding optimal weight factors for compound identification in gas chromatography-mass spectrometry

被引:52
作者
Kim, Seongho [1 ]
Koo, Imhoi [1 ,2 ]
Wei, Xiaoli [2 ]
Zhang, Xiang [2 ]
机构
[1] Univ Louisville, Dept Bioinformat & Biostat, Louisville, KY 40292 USA
[2] Univ Louisville, Dept Chem, Louisville, KY 40292 USA
关键词
LARGE-SCALE PROTEOMICS; SPECTRUM LIBRARIES; SIMILARITY;
D O I
10.1093/bioinformatics/bts083
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The compound identification in gas chromatographymass spectrometry (GC-MS) is achieved by matching the experimental mass spectrum to the mass spectra in a spectral library. It is known that the intensities with higher m/z value in the GCMS mass spectrum are the most diagnostic. Therefore, to increase the relative significance of peak intensities of higher m/z value, the intensities andm/z values are usually transformed with a set of weight factors. A poor quality of weight factors can significantly decrease the accuracy of compound identification. With the significant enrichment of the mass spectral database and the broad application of GC-MS, it is important to re-visit the methods of discovering the optimal weight factors for high confident compound identification. Results: We developed a novel approach to finding the optimal weight factors only through a reference library for high accuracy compound identification. The developed approach first calculates the ratio of skewness to kurtosis of the mass spectral similarity scores among spectra (compounds) in a reference library and then considers a weight factor with the maximum ratio as the optimal weight factor. We examined our approach by comparing the accuracy of compound identification using the mass spectral library maintained by the National Institute of Standards and Technology. The results demonstrate that the optimal weight factors for fragment ion peak intensity and m/z value found by the developed approach outperform the current weight factors for compound identification.
引用
收藏
页码:1158 / 1163
页数:6
相关论文
共 13 条
[1]   RELIABILITY RANKING AND SCALING IMPROVEMENTS TO THE PROBABILITY BASED MATCHING SYSTEM FOR UNKNOWN MASS-SPECTRA [J].
ATWATER, BL ;
STAUFFER, DB ;
MCLAFFERTY, FW ;
PETERSON, DW .
ANALYTICAL CHEMISTRY, 1985, 57 (04) :899-903
[2]   Improving large-scale proteomics by clustering of mass spectrometry data [J].
Beer, I ;
Barnea, E ;
Ziv, T ;
Admon, A .
PROTEOMICS, 2004, 4 (04) :950-960
[3]   Using annotated peptide mass spectrum libraries for protein identification [J].
Craig, R. ;
Cortens, J. C. ;
Fenyo, D. ;
Beavis, R. C. .
JOURNAL OF PROTEOME RESEARCH, 2006, 5 (08) :1843-1849
[4]   Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries [J].
Frewen, Barbara E. ;
Merrihew, Gennifer E. ;
Wu, Christine C. ;
Noble, William Stafford ;
MacCoss, Michael J. .
ANALYTICAL CHEMISTRY, 2006, 78 (16) :5678-5684
[5]  
Gustafson P, 1978, 2 FINN CORP
[6]   IDENTIFICATION OF MASS SPECTRA BY COMPUTER-SEARCHING A FILE OF KNOWN SPECTRA [J].
HERTZ, HS ;
HITES, RA ;
BIEMANN, K .
ANALYTICAL CHEMISTRY, 1971, 43 (06) :681-&
[7]   MassBank: a public repository for sharing mass spectral data for life sciences [J].
Horai, Hisayuki ;
Arita, Masanori ;
Kanaya, Shigehiko ;
Nihei, Yoshito ;
Ikeda, Tasuku ;
Suwa, Kazuhiro ;
Ojima, Yuya ;
Tanaka, Kenichi ;
Tanaka, Satoshi ;
Aoshima, Ken ;
Oda, Yoshiya ;
Kakazu, Yuji ;
Kusano, Miyako ;
Tohge, Takayuki ;
Matsuda, Fumio ;
Sawada, Yuji ;
Hirai, Masami Yokota ;
Nakanishi, Hiroki ;
Ikeda, Kazutaka ;
Akimoto, Naoshige ;
Maoka, Takashi ;
Takahashi, Hiroki ;
Ara, Takeshi ;
Sakurai, Nozomu ;
Suzuki, Hideyuki ;
Shibata, Daisuke ;
Neumann, Steffen ;
Iida, Takashi ;
Tanaka, Ken ;
Funatsu, Kimito ;
Matsuura, Fumito ;
Soga, Tomoyoshi ;
Taguchi, Ryo ;
Saito, Kazuki ;
Nishioka, Takaaki .
JOURNAL OF MASS SPECTROMETRY, 2010, 45 (07) :703-714
[8]   A method for quantitatively differentiating crude natural extracts using high-performance liquid chromatography electrospray mass spectrometry [J].
Julian, RK ;
Higgs, RE ;
Gygi, JD ;
Hilton, MD .
ANALYTICAL CHEMISTRY, 1998, 70 (15) :3249-3254
[9]   Wavelet- and Fourier-Transform-Based Spectrum Similarity Approaches to Compound Identification in Gas Chromatography/Mass Spectrometry [J].
Koo, Imhoi ;
Zhang, Xiang ;
Kim, Seongho .
ANALYTICAL CHEMISTRY, 2011, 83 (14) :5631-5638
[10]   MASS-SPECTRAL LIBRARY SEARCHES USING ION SERIES DATA COMPRESSION [J].
RASMUSSEN, GT ;
ISENHOUR, TL ;
MARSHALL, JC .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1979, 19 (02) :98-104