Binary Simplification as an Effective Tool in Metabolomics Data Analysis

被引:8
作者
Traquete, Francisco [1 ]
Luz, Joao [1 ]
Cordeiro, Carlos [1 ]
Sousa Silva, Marta [1 ]
Ferreira, Antonio E. N. [1 ]
机构
[1] Univ Lisbon, MARE Marine & Environm Sci Ctr, Lab FTICR & Espectrometria Massa Estrutural, Fac Ciencias, P-1749016 Lisbon, Portugal
基金
瑞典研究理事会; 欧盟地平线“2020”;
关键词
metabolomics; data treatment; data analysis; Fourier Transform Ion Cyclotron Resonance mass spectrometry; multivariate analysis;
D O I
10.3390/metabo11110788
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Metabolomics aims to perform a comprehensive identification and quantification of the small molecules present in a biological system. Due to metabolite diversity in concentration, structure, and chemical characteristics, the use of high-resolution methodologies, such as mass spectrometry (MS) or nuclear magnetic resonance (NMR), is required. In metabolomics data analysis, suitable data pre-processing, and pre-treatment procedures are fundamental, with subsequent steps aiming at highlighting the significant biological variation between samples over background noise. Traditional data analysis focuses primarily on the comparison of the features' intensity values. However, intensity data are highly variable between experimental batches, instruments, and pre-processing methods or parameters. The aim of this work was to develop a new pre-treatment method for MS-based metabolomics data, in the context of sample profiling and discrimination, considering only the occurrence of spectral features, encoding feature presence as 1 and absence as 0. This "Binary Simplification " encoding (BinSim) was used to transform several benchmark datasets before the application of clustering and classification methods. The performance of these methods after the BinSim pre-treatment was consistently as good as and often better than after different combinations of traditional, intensity-based, pre-treatments. Binary Simplification is, therefore, a viable pre-treatment procedure that effectively simplifies metabolomics data-analysis pipelines.
引用
收藏
页数:23
相关论文
共 39 条
  • [1] Analytical methods in untargeted metabolomics: state of the art in 2015
    Alonso, Arnald
    Marsal, Sara
    Julia, Antonio
    [J]. FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2015, 3
  • [2] A roadmap of clustering algorithms: finding a match for a biomedical application
    Andreopoulos, Bill
    An, Aijun
    Wang, Xiaogang
    Schroeder, Michael
    [J]. BRIEFINGS IN BIOINFORMATICS, 2009, 10 (03) : 297 - 314
  • [3] STABILITY OF 2 HIERARCHICAL GROUPING TECHNIQUES CASE 1 - SENSITIVITY TO DATA ERRORS
    BAKER, FB
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1974, 69 (346) : 440 - 445
  • [4] Statistical methods for the analysis of high-throughput metabolomics data
    Bartel, Joerg
    Krumsiek, Jan
    Theis, Fabian J.
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2013, 4 (05):
  • [5] Analysis of metabolomic data: tools, current strategies and future challenges for omics data integration
    Cambiaghi, Alice
    Ferrario, Manuela
    Masseroli, Marco
    [J]. BRIEFINGS IN BIOINFORMATICS, 2017, 18 (03) : 498 - 510
  • [6] Preoperative Metabolic Signatures of Prostate Cancer Recurrence Following Radical Prostatectomy
    Clendinen, Chaevien S.
    Gaul, David A.
    Eugenia Monge, Maria
    Arnold, Rebecca S.
    Edison, Arthur S.
    Petros, John A.
    Fernandez, Facundo M.
    [J]. JOURNAL OF PROTEOME RESEARCH, 2019, 18 (03) : 1316 - 1327
  • [7] Non-targeted UHPLC-MS metabolomic data processing methods: a comparative investigation of normalisation, missing value imputation, transformation and scaling
    Di Guida, Riccardo
    Engel, Jasper
    Allwood, J. William
    Weber, Ralf J. M.
    Jones, Martin R.
    Sommer, Ulf
    Viant, Mark R.
    Dunn, Warwick B.
    [J]. METABOLOMICS, 2016, 12 (05)
  • [8] Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures.: Application in 1H NMR metabonomics
    Dieterle, Frank
    Ross, Alfred
    Schlotterbeck, Gotz
    Senn, Hans
    [J]. ANALYTICAL CHEMISTRY, 2006, 78 (13) : 4281 - 4290
  • [9] Ferreira, 2020, DATASET, DOI [10.6084/m9.figshare.12357314.v1, DOI 10.6084/M9.FIGSHARE.12357314.V1]
  • [10] dendextend: an R package for visualizing, adjusting and comparing trees of hierarchical clustering
    Galili, Tal
    [J]. BIOINFORMATICS, 2015, 31 (22) : 3718 - 3720