Mass Spectral Feature List Optimizer (MS-FLO): A Tool To Minimize False Positive Peak Reports in Untargeted Liquid Chromatography-Mass Spectroscopy (LC-MS) Data Processing

被引:139
作者
DeFelice, Brian C. [1 ]
Mehta, Sajjan Singh [1 ]
Samra, Stephanie [1 ]
Cajka, Tomas [1 ]
Wancewicz, Benjamin [1 ]
Fahrmann, Johannes F. [1 ,2 ]
Fiehn, Oliver [1 ,3 ]
机构
[1] Univ Calif Davis, West Coast Metabol Ctr, 451 E Hlth Sci Dr,Rm 1300, Davis, CA 95616 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Clin Canc Prevent, 6767 Bertner Ave, Houston, TX 77030 USA
[3] King Abdulaziz Univ, Dept Biochem, Fac Sci, Jeddah 21589, Saudi Arabia
关键词
METABOLOMICS DATA; SPECTROMETRY DATA; STRATEGY; DESIGN;
D O I
10.1021/acs.analchem.6b04372
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Untargeted metabolomics by liquid chromatography-mass spectrometry generates data-rich chromatograms in the form of m/z-retention time features. Managing such datasets is a bottleneck. Many popular data processing tools, including XCMS-online and MZmine2, yield numerous false-positive peak detections. Flagging and removing such false peaks manually is a time-consuming task and prone to human error. We present a web application, Mass Spectral Feature List Optimizer (MS-FLO), to improve the quality of feature lists after initial processing to expedite the process of data curation. The tool utilizes retention time alignments, accurate mass tolerances, Pearson's correlation analysis, and peak height similarity to identify ion adducts, duplicate peak reports, and isotopic features of the main monoisotopic metabolites. Removing such erroneous peaks reduces the overall number of metabolites in data reports and improves the quality of subsequent statistical investigations. To demonstrate the effectiveness of MS-FLO, we processed 28 biological studies and uploaded raw and results data to the Metabolomics Workbench website (www.metabolomicsworkbench.org), encompassing 1481 chromatograms produced by two different data processing programs used in-house (MZmine2 and later MS-DIAL). Post-processing of datasets with MS-FLO yielded a 7.8% automated reduction of total peak features and flagged an additional 7.9% of features, per dataset, for review by the user. When manually curated, 87% of these additional flagged features were verified false positives. MS-FLO is an open source web application that is freely available for use at http://msflo.fiehnlab.ucdavis.edu.
引用
收藏
页码:3250 / 3255
页数:6
相关论文
共 11 条
[1]   Toward Merging Untargeted and Targeted Methods in Mass Spectrometry-Based Metabolomics and Lipidomics [J].
Cajka, Tomas ;
Fiehn, Oliver .
ANALYTICAL CHEMISTRY, 2016, 88 (01) :524-545
[2]   Algorithms and tools for the preprocessing of LC-MS metabolomics data [J].
Castillo, Sandra ;
Gopalacharyulu, Peddinti ;
Yetukuri, Laxman ;
Oresic, Matej .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2011, 108 (01) :23-32
[3]   Strategy for Optimizing LC-MS Data Processing in Metabolomics: A Design of Experiments Approach [J].
Eliasson, Mattias ;
Rannar, Stefan ;
Madsen, Rasmus ;
Donten, Magdalena A. ;
Marsden-Edwards, Emma ;
Moritz, Thomas ;
Shockcor, John P. ;
Johansson, Erik ;
Trygg, Johan .
ANALYTICAL CHEMISTRY, 2012, 84 (15) :6869-6876
[4]   ALLocator: An Interactive Web Platform for the Analysis of Metabolomic LC-ESI-MS Datasets, Enabling Semi-Automated, User-Revised Compound Annotation and Mass Isotopomer Ratio Analysis [J].
Kessler, Nikolas ;
Walter, Frederik ;
Persicke, Marcus ;
Albaum, Stefan P. ;
Kalinowski, Joern ;
Goesmann, Alexander ;
Niehaus, Karsten ;
Nattkemper, Tim W. .
PLOS ONE, 2014, 9 (11)
[5]   CAMERA: An Integrated Strategy for Compound Spectra Extraction and Annotation of Liquid Chromatography/Mass Spectrometry Data Sets [J].
Kuhl, Carsten ;
Tautenhahn, Ralf ;
Boettcher, Christoph ;
Larson, Tony R. ;
Neumann, Steffen .
ANALYTICAL CHEMISTRY, 2012, 84 (01) :283-289
[6]   MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data [J].
Pluskal, Tomas ;
Castillo, Sandra ;
Villar-Briones, Alejandro ;
Oresic, Matej .
BMC BIOINFORMATICS, 2010, 11
[7]   XCMS: Processing mass spectrometry data for metabolite profiling using Nonlinear peak alignment, matching, and identification [J].
Smith, CA ;
Want, EJ ;
O'Maille, G ;
Abagyan, R ;
Siuzdak, G .
ANALYTICAL CHEMISTRY, 2006, 78 (03) :779-787
[8]   Metabolomics Workbench: An international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools [J].
Sud, Manish ;
Fahy, Eoin ;
Cotter, Dawn ;
Azam, Kenan ;
Vadivelu, Ilango ;
Burant, Charles ;
Edison, Arthur ;
Fiehn, Oliver ;
Higashi, Richard ;
Nair, K. Sreekumaran ;
Sumner, Susan ;
Subramaniam, Shankar .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D463-D470
[9]   MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis [J].
Tsugawa, Hiroshi ;
Cajka, Tomas ;
Kind, Tobias ;
Ma, Yan ;
Higgins, Brendan ;
Ikeda, Kazutaka ;
Kanazawa, Mitsuhiro ;
VanderGheynst, Jean ;
Fiehn, Oliver ;
Arita, Masanori .
NATURE METHODS, 2015, 12 (06) :523-+
[10]   Ion Fusion of High-Resolution LC MS-Based Metabolomics Data to Discover More Reliable Biomarkers [J].
Zeng, Zhongda ;
Liu, Xinyu ;
Dai, Weidong ;
Yin, Peiyuan ;
Zhou, Lina ;
Huang, Qiang ;
Lin, Xiaohui ;
Xu, Guowang .
ANALYTICAL CHEMISTRY, 2014, 86 (08) :3793-3800