Discriminant analysis and feature selection in mass spectrometry imaging using constrained repeated random sampling - Cross validation (CORRS-CV)

被引:13
作者
Perez-Guaita, David [1 ]
Quintas, Guillermo [2 ,3 ]
Kuligowski, Julia [4 ]
机构
[1] FOCAS Res Inst, Dublin, Ireland
[2] LEITAT Technol Ctr, Hlth & Biomed, Barcelona, Spain
[3] Hlth Res Inst Hosp La Fe, Unidad Analit, Valencia, Spain
[4] Hlth Res Inst Hosp La Fe, Neonatal Res Unit, Valencia, Spain
基金
欧洲研究理事会;
关键词
Mass spectrometry imaging (MSI); Cross validation (CV); Constrained repeated random sampling; Cross validation (CORRSCV); Partial least squares-discriminant analysis (PLS-DA); Feature selection;
D O I
10.1016/j.aca.2019.10.039
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The identification of biomarkers through Mass spectrometry imaging (MSI) is gaining popularity in the clinical field. However, considering the complexity of spectral and spatial variables faced, data mining of the hyperspectral images can be troublesome. The discovery of markers generally depends on the creation of classification models which should be validated to ensure the statistical significance of the discriminants m/z detected. Internal validation using resampling methods such as cross validation (CV) are widely used for model selection, the estimation of its generalization performance and biomarker discovery when sample sizes are limited and an independent test set is not available. Here, we introduce for first time the use of Constrained Repeated Random Subsampling CV (CORRS-CV) on multi-images for the validation of classification models on MSI. Although several aspects must be taken into account (e.g. image size, CORRS-CVavalue, the similarity across spatially close pixels, the total computation time), CORRS-CV provides more accurate estimates of the model performance than k-fold CV using of biological replicates to define the data split when the number of biological replicates is scarce and holding images back for testing is a waste of valuable information. Besides, the combined use of CORRS-CV and rank products increases the robustness of the selection of discriminant features as candidate biomarkers which is an important issue due to the increased biological, environmental and technical variabilities when analysing multiple images, especially from human tissues collected in clinical studies. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:30 / 36
页数:7
相关论文
共 20 条
  • [1] Spatial Segmentation of Imaging Mass Spectrometry Data with Edge-Preserving Image Denoising and Clustering
    Alexandrov, Theodore
    Becker, Michael
    Deininger, Soren-Oliver
    Ernst, Gunther
    Wehder, Liane
    Grasmair, Markus
    von Eggeling, Ferdinand
    Thiele, Herbert
    Maass, Peter
    [J]. JOURNAL OF PROTEOME RESEARCH, 2010, 9 (12) : 6535 - 6546
  • [2] [Anonymous], 2009, CHEMOMETRICS PATTERN
  • [3] Cardinal: an R package for statistical analysis of mass spectrometry-based imaging experiments
    Bemis, Kyle D.
    Harry, April
    Eberlin, Livia S.
    Ferreira, Christina
    van de Ven, Stephanie M.
    Mallick, Parag
    Stolowitz, Mark
    Vitek, Olga
    [J]. BIOINFORMATICS, 2015, 31 (14) : 2418 - 2420
  • [4] Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments
    Breitling, R
    Armengaud, P
    Amtmann, A
    Herzyk, P
    [J]. FEBS LETTERS, 2004, 573 (1-3) : 83 - 92
  • [5] Mass Spectrometry Imaging: A Review of Emerging Advancements and Future Insights
    Buchberger, Amanda Rae
    DeLaney, Kellen
    Johnson, Jillian
    Li, Lingjun
    [J]. ANALYTICAL CHEMISTRY, 2018, 90 (01) : 240 - 265
  • [6] Mass spectrometry imaging: How will it affect clinical research in the future?
    Dilillo, Marialaura
    Heijs, Bram
    McDonnell, Liam A.
    [J]. EXPERT REVIEW OF PROTEOMICS, 2018, 15 (09) : 709 - 716
  • [7] Multivariate statistical differentiation of renal cell carcinomas based on lipidomic analysis by ambient ionization imaging mass spectrometry
    Dill, Allison L.
    Eberlin, Livia S.
    Zheng, Cheng
    Costa, Anthony B.
    Ifa, Demian R.
    Cheng, Liang
    Masterson, Timothy A.
    Koch, Michael O.
    Vitek, Olga
    Cooks, R. Graham
    [J]. ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2010, 398 (7-8) : 2969 - 2978
  • [8] Past-in-the-Future. Peak detection improves targeted mass spectrometry imaging
    Falcetta, Francesca
    Morosi, Lavinia
    Ubezio, Paolo
    Giordano, Silvia
    Decio, Alessandra
    Giavazzi, Raffaella
    Frapolli, Roberta
    Prasad, Mridula
    Franceschi, Pietro
    D'Incalci, Maurizio
    Davoli, Enrico
    [J]. ANALYTICA CHIMICA ACTA, 2018, 1042 : 1 - 10
  • [9] Comparison of the variable importance in projection (VIP) and of the selectivity ratio (SR) methods for variable selection and interpretation
    Farres, Mireia
    Platikanov, Stefan
    Tsakovski, Stefan
    Tauler, Roma
    [J]. JOURNAL OF CHEMOMETRICS, 2015, 29 (10) : 528 - 536
  • [10] Friedman J., 2017, ELEMENTS STAT LEARNI