Rapid Prediction of Electron-Ionization Mass Spectrometry Using Neural Networks

被引:126
作者
Wei, Jennifer N. [1 ,2 ]
Belanger, David [1 ]
Adams, Ryan P. [3 ]
Sculley, D. [1 ]
机构
[1] Google Brain, Cambridge, MA 02142 USA
[2] Harvard Univ, Dept Chem & Chem Biol, Cambridge, MA 02138 USA
[3] Princeton Univ, Dept Comp Sci, Princeton, NJ 08540 USA
关键词
SPECTRA; IDENTIFICATION; RESOURCE;
D O I
10.1021/acscentsci.9b00085
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
When confronted with a substance of unknown identity, researchers often perform mass spectrometry on the sample and compare the observed spectrum to a library of previously collected spectra to identify the molecule. While popular, this approach will fail to identify molecules that are not in the existing library. In response, we propose to improve the library's coverage by augmenting it with synthetic spectra that are predicted from candidate molecules using machine learning. We contribute a lightweight neural network model that quickly predicts mass spectra for small molecules, averaging 5 ms per molecule with a recall-at-10 accuracy of 91.8%. Achieving high-accuracy predictions requires a novel neural network architecture that is designed to capture typical fragmentation patterns from electron ionization. We analyze the effects of our modeling innovations on library matching performance and compare our models to prior machine-learning-based work on spectrum prediction.
引用
收藏
页码:700 / 708
页数:9
相关论文
共 41 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   Computational Prediction of Electron Ionization Mass Spectra to Assist in GC/MS Compound Identification [J].
Allen, Felicity ;
Pon, Allison ;
Greiner, Russ ;
Wishart, David .
ANALYTICAL CHEMISTRY, 2016, 88 (15) :7689-7697
[3]  
[Anonymous], 2015, ARXIV15120338
[4]   Quantum chemical calculation of electron ionization mass spectra for general organic and inorganic molecules [J].
Asgeirsson, Vilhjalmur ;
Bauer, Christoph A. ;
Grimme, Stefan .
CHEMICAL SCIENCE, 2017, 8 (07) :4879-4895
[5]   How to Compute Electron Ionization Mass Spectra from First Principles [J].
Bauer, Christoph Alexander ;
Grimme, Stefan .
JOURNAL OF PHYSICAL CHEMISTRY A, 2016, 120 (21) :3755-3766
[6]  
Buchanan B. G., 1981, READINGS ARTIFICIAL, P313
[7]  
Curry B., 1990, TETRAHED COMP METHOD, V3, P213, DOI [10.1016/0898-5529(90)90053-B, DOI 10.1016/0898-5529(90)90053-B]
[8]  
Duvenaud David K, 2015, C NEUR INF PROC SYST, P2224
[9]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989
[10]  
Gilmer J., 2017, ARXIV17040121