Deep MS/MS-Aided Structural-Similarity Scoring for Unknown Metabolite Identification

被引:61
作者
Ji, Hongchao [1 ]
Xu, Yamei [1 ]
Lu, Hongmei [1 ]
Zhang, Zhimin [1 ]
机构
[1] Cent South Univ, Coll Chem & Chem Engn, Changsha 410083, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
MOLECULAR-STRUCTURE DATABASES; MASS-SPECTROMETRY DATA; NEURAL-NETWORK; SPECTRA; PREDICTION; ALGORITHM; DESCRIPTOR; LIBRARY; TOOL;
D O I
10.1021/acs.analchem.8b05405
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Tandem mass spectrometry (MS/MS) is the workhorse for structural annotation of metabolites, because it can provide abundance of structural information. Currently, metabolite identification mainly relies on querying experimental spectra against public or in-house spectral databases. The identification is severely limited by the available spectra in the databases. Although, the metabolome consists of a huge number of different functional metabolites, the whole metabolome derives from a limited number of initial metabolites via bioreactions. In each bioreaction, the reactant and the product often change some substructures but are still structurally related. These structurally related metabolites often have related MS/MS spectra, which provide the possibility to identify unknown metabolites through known ones. However, it is challenging to explore the internal relationship between MS/MS spectra and structural similarity. In this study, we present the deep-learning-based approach for MS/MS-aided structural-similarity scoring (DeepMASS), which can score the structural similarity of unknown metabolite against the known one with MS/MS spectra and deep neural networks. We evaluated DeepMASS with leave-one-out cross-validation on MS/MS spectra of 662 compounds in KEGG and an external test on the biomarkers from male infertility study measured on Shimadzu LC-ESI-IT-TOF and Bruker Compact LC-ESI-QTOF. Results show that the identification of unknown compound is valid if its structure-related metabolite is available in the database. It provides an effective approach to extend the identification range of metabolites for existing MS/MS databases.
引用
收藏
页码:5629 / 5637
页数:9
相关论文
共 55 条
[1]   iMet: A Network-Based Computational Tool To Assist in the Annotation of Metabolites from Tandem Mass Spectra [J].
Aguilar-Mogas, Antoni ;
Sales-Pardo, Marta ;
Navarro, Miriam ;
Guimera, Roger ;
Yanes, Oscar .
ANALYTICAL CHEMISTRY, 2017, 89 (06) :3474-3482
[2]   Rhea-a manually curated resource of biochemical reactions [J].
Alcantara, Rafael ;
Axelsen, Kristian B. ;
Morgat, Anne ;
Belda, Eugeni ;
Coudert, Elisabeth ;
Bridge, Alan ;
Cao, Hong ;
de Matos, Paula ;
Ennis, Marcus ;
Turner, Steve ;
Owen, Gareth ;
Bougueleret, Lydie ;
Xenarios, Ioannis ;
Steinbeck, Christoph .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D754-D760
[3]   CFM-ID: a web server for annotation, spectrum prediction and metabolite identification from tandem mass spectra [J].
Allen, Felicity ;
Pon, Allison ;
Wilson, Michael ;
Greiner, Russ ;
Wishart, David .
NUCLEIC ACIDS RESEARCH, 2014, 42 (W1) :W94-W99
[4]   Application of ensemble deep neural network to metabolomics studies [J].
Asakura, Taiga ;
Date, Yasuhiro ;
Kikuchi, Jun .
ANALYTICA CHIMICA ACTA, 2018, 1037 :230-236
[5]   Quantitative Comparison of Tandem Mass Spectra Obtained on Various Instruments [J].
Bazso, Fanni Laura ;
Ozohanics, Oliver ;
Schlosser, Gitta ;
Ludanyi, Krisztina ;
Vekey, Karoly ;
Drahos, Laszlo .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2016, 27 (08) :1357-1365
[6]   Deep learning for tumor classification in imaging mass spectrometry [J].
Behrmann, Jens ;
Etmann, Christian ;
Boskamp, Tobias ;
Casadonte, Rita ;
Kriegsmann, Joerg ;
Maass, Peter .
BIOINFORMATICS, 2018, 34 (07) :1215-1223
[7]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[8]   XCMS2:: Processing tandem mass spectrometry data for metabolite identification and structural characterization [J].
Benton, H. P. ;
Wong, D. M. ;
Trauger, S. A. ;
Siuzdak, G. .
ANALYTICAL CHEMISTRY, 2008, 80 (16) :6382-6389
[9]   Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy [J].
Blazenovic, Ivana ;
Kind, Tobias ;
Torbasinovic, Hrvoje ;
Obrenovic, Slobodan ;
Mehta, Sajjan S. ;
Tsugawa, Hiroshi ;
Wermuth, Tobias ;
Schauer, Nicolas ;
Jahn, Martina ;
Biedendieck, Rebekka ;
Jahn, Dieter ;
Fiehn, Oliver .
JOURNAL OF CHEMINFORMATICS, 2017, 9
[10]   Searching molecular structure databases using tandem MS data: are we there yet? [J].
Boecker, Sebastian .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2017, 36 :1-6