Compression strategies for the chemometric analysis of mass spectrometry imaging data

被引:25
作者
Bedia, Carmen [1 ]
Tauler, Roma [1 ]
Jaumot, Joaquim [1 ]
机构
[1] IDAEA CSIC, Dept Environm Chem, Jordi Girona 18-26, Barcelona 08034, Catalonia, Spain
基金
欧洲研究理事会;
关键词
data compression; mass spectrometry imaging; preprocessing; MULTIVARIATE CURVE RESOLUTION; DATA SETS; MCR-ALS; METABONOMICS; METABOLOMICS; IMAGES; INFORMATION; PROTEOMICS;
D O I
10.1002/cem.2821
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Application of chemometric methods to mass spectrometry imaging (MSI) data faces a bottleneck concerning the vast size of the experimental data sets. This drawback is critical when considering high-resolution mass spectrometry data, which provide several thousand points for each considered pixel. In this work, different approaches have been tested to reduce the size of the analyzed data with the aim to allow the subsequent application of typical chemometric methods for image analysis. The standard approach for MSI data compression consists in binning mass spectra for each pixel to reduce the number of m/z values. In this work, a method is proposed to handle the huge size of MSI data based on the adaptation of a liquid chromatography-mass spectrometry data compression method by the detection of regions of interest. Results showed that both approaches achieved high compression rates, although the proposed regions of interest-based method attains this reduction requiring lower computational requirements and keeping utter spectral information. For instance, typical compression rate reached values higher than 90% without loss of information in images and spectra. Application of chemometric methods to high-resolution mass spectrometry images requires a preliminary compression of experimental data. An algorithm for the detection of relevant variables for the analysis of images obtained by mass spectrometry is presented. Evaluation of different approaches is performed considering achieved compression rate, mass spectrometry information loss, and computational resources needed.
引用
收藏
页码:575 / 588
页数:14
相关论文
共 39 条
[1]   Statistical methods for the analysis of high-throughput metabolomics data [J].
Bartel, Joerg ;
Krumsiek, Jan ;
Theis, Fabian J. .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2013, 4 (05)
[2]   Spectral pre-treatments of hyperspectral near infrared images: analysis of diffuse reflectance scattering [J].
Burger, James ;
Geladi, Paul .
JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2007, 15 (01) :29-37
[3]   Data handling in hyperspectral image analysis [J].
Burger, James ;
Gowen, Aoife .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2011, 108 (01) :13-22
[4]   Mass Spectrometric Imaging for Biomedical Tissue Analysis [J].
Chughtai, Kamila ;
Heeren, Ron M. A. .
CHEMICAL REVIEWS, 2010, 110 (05) :3237-3277
[5]   MALDI imaging mass spectrometry: molecular snapshots of biochemical systems [J].
Cornett, Dale S. ;
Reyzer, Michelle L. ;
Chaurand, Pierre ;
Caprioli, Richard M. .
NATURE METHODS, 2007, 4 (10) :828-833
[6]  
desJuan A, 2009, INFRARED RAMAN SPECT, P65
[7]   Using chemometrics for navigating in the large data sets of genomics, proteomics, and metabonomics (gpm) [J].
Eriksson, L ;
Antti, H ;
Gottfries, J ;
Holmes, E ;
Johansson, E ;
Lindgren, F ;
Long, I ;
Lundstedt, T ;
Trygg, J ;
Wold, S .
ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2004, 380 (03) :419-429
[8]   Vibrational spectroscopic image analysis of biological material using multivariate curve resolution-alternating least squares (MCR-ALS) [J].
Felten, Judith ;
Hall, Hardy ;
Jaumot, Joaquim ;
Tauler, Roma ;
de Juan, Anna ;
Gorzsas, Andras .
NATURE PROTOCOLS, 2015, 10 (02) :217-240
[9]   Robust Data Processing and Normalization Strategy for MALDI Mass Spectrometric Imaging [J].
Fonville, Judith M. ;
Carter, Claire ;
Cloarec, Olivier ;
Nicholson, Jeremy K. ;
Lindon, John C. ;
Bunch, Josephine ;
Holmes, Elaine .
ANALYTICAL CHEMISTRY, 2012, 84 (03) :1310-1319
[10]   SINGULAR VALUE DECOMPOSITION AND LEAST SQUARES SOLUTIONS [J].
GOLUB, GH ;
REINSCH, C .
NUMERISCHE MATHEMATIK, 1970, 14 (05) :403-&