Concise Representation of Mass Spectrometry Images by Probabilistic Latent Semantic Analysis

被引:94
作者
Hanselmann, Michael [1 ]
Kirchner, Marc [1 ]
Renard, Bernhard Y. [1 ]
Amstalden, Erika R. [2 ]
Glunde, Kristine [3 ]
Heeren, Ron M. A. [2 ]
Hamprecht, Fred A. [1 ]
机构
[1] Heidelberg Univ, Interdisciplinary Ctr Sci Comp IWR, Heidelberg, Germany
[2] FOM, Inst Atom & Mol Phys, AMOLF, Amsterdam, Netherlands
[3] Johns Hopkins Univ, Dept Radiol & Radiol Sci, Sch Med, Baltimore, MD 21205 USA
关键词
D O I
10.1021/ac801303x
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Imaging mass spectrometry (IMS) is a promising technology which allows for detailed analysis of spatial distributions of (bio)molecules in organic samples. In many current applications, IMS relies heavily on (semi)automated exploratory data analysis procedures to decompose the data into characteristic component spectra and corresponding abundance maps, visualizing spectral and spatial structure. The most commonly used techniques are principal component analysis (PCA) and independent component analysis (ICA). Both methods operate in an unsupervised manner. However, their decomposition estimates usually feature negative counts and are not amenable to direct physical interpretation. We propose probabilistic latent semantic analysis (pLSA) for non-negative decomposition and the elucidation of interpretable component spectra and abundance maps. We compare this algorithm to PCA, ICA, and non-negative PARAFAC (parallel factors analysis) and show on simulated and real-world data that pLSA and non-negative PARAFAC are superior to PCA or ICA in terms of complementarity of the resulting components and reconstruction accuracy. We further combine pLSA decomposition with a statistical complexity estimation scheme based on the Akaike information criterion (AIC) to automatically estimate the number of components present in a tissue sample data set and show that this results in sensible complexity estimates.
引用
收藏
页码:9649 / 9658
页数:10
相关论文
共 30 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
[Anonymous], P 28 INT ACM SIGIR C
[3]   TOF-SIMS characterization and imaging of controlled-release drug delivery systems [J].
Belu, AM ;
Davies, MC ;
Newton, JM ;
Patel, N .
ANALYTICAL CHEMISTRY, 2000, 72 (22) :5625-5638
[4]   The study of an iterative method for the reconstruction of images corrupted by Poisson and Gaussian noise [J].
Benvenuto, F. ;
La Camera, A. ;
Theys, C. ;
Ferrari, A. ;
Lanteri, H. ;
Bertero, M. .
INVERSE PROBLEMS, 2008, 24 (03)
[5]  
Bro R, 1998, THESIS U AMSTERDAM N
[6]  
Broersen A, 2005, PROCEEDINGS OF THE FIFTH IASTED INTERNATIONAL CONFERENCE ON VISUALIZATION, IMAGING, AND IMAGE PROCESSING, P540
[7]  
Burnham K.P., 2002, Model selection and multimodel inference: a practical information-theoretic approach, DOI 10.1007/978-1-4757-2917-7_3
[8]   Molecular imaging of biological samples: Localization of peptides and proteins using MALDI-TOF MS [J].
Caprioli, RM ;
Farmer, TB ;
Gile, J .
ANALYTICAL CHEMISTRY, 1997, 69 (23) :4751-4760
[9]   ANALYSIS OF INDIVIDUAL DIFFERENCES IN MULTIDIMENSIONAL SCALING VIA AN N-WAY GENERALIZATION OF ECKART-YOUNG DECOMPOSITION [J].
CARROLL, JD ;
CHANG, JJ .
PSYCHOMETRIKA, 1970, 35 (03) :283-&
[10]   Imaging mass spectrometry: a new tool to investigate the spatial organization of peptides and proteins in mammalian tissue sections [J].
Chaurand, P ;
Schwartz, SA ;
Caprioli, RM .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2002, 6 (05) :676-681