Metabolite fingerprinting: detecting biological features by independent component analysis

被引:183
作者
Scholz, M [1 ]
Gatzek, S
Sterling, A
Fiehn, O
Selbig, J
机构
[1] Max Planck Inst Mol Plant Physiol, D-14424 Potsdam, Germany
[2] Advion BioSci Ltd, Norwich NR9 3DB, Norfolk, England
关键词
D O I
10.1093/bioinformatics/bth270
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Metabolite fingerprinting is a technology for providing information from spectra of total compositions of metabolites. Here, spectra acquisitions by microchip-based nanoflow-direct-infusion QTOF mass spectrometry, a simple and high throughput technique, is tested for its informative power. As a simple test case we are using Arabidopsis thaliana crosses. The question is how metabolite fingerprinting reflects the biological background. In many applications the classical principal component analysis (PCA) is used for detecting relevant information. Here a modern alternative is introduced-the independent component analysis (ICA). Due to its independence condition, ICA is more suitable for our questions than PCA. However, ICA has not been developed for a small number of high-dimensional samples, therefore a strategy is needed to overcome this limitation. Results: To apply ICA successfully it is essential first to reduce the high dimension of the dataset, by using PCA. The number of principal components determines the quality of ICA significantly, therefore we propose a criterion for estimating the optimal dimension automatically. The kurtosis measure is used to order the extracted components to our interest. Applied to our A. thaliana data, ICA detects three relevant factors, two biological and one technical, and clearly outperforms the PCA.
引用
收藏
页码:2447 / 2454
页数:8
相关论文
共 21 条
[1]  
[Anonymous], 2002, Adaptive Blind Signal and Image Processing: Learning Algorithms and Applications
[2]  
[Anonymous], P INT C ART NEUR NET
[3]   AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159
[4]   INDEPENDENT COMPONENT ANALYSIS, A NEW CONCEPT [J].
COMON, P .
SIGNAL PROCESSING, 1994, 36 (03) :287-314
[5]  
Diamantaras KI, 1996, Principal Component Neural Networks: Theory and Applications
[6]   Combining genomics, metabolome analysis, and biochemical modelling to understand metabolic networks [J].
Fiehn, O .
COMPARATIVE AND FUNCTIONAL GENOMICS, 2001, 2 (03) :155-168
[7]   Chemometric discrimination of unfractionated plant extracts analyzed by electrospray mass spectrometry [J].
Goodacre, R ;
York, EV ;
Heald, JK ;
Scott, IM .
PHYTOCHEMISTRY, 2003, 62 (06) :859-863
[8]  
Hyvärinen A, 2001, INDEPENDENT COMPONENT ANALYSIS: PRINCIPLES AND PRACTICE, P71
[9]   Independent component analysis:: algorithms and applications [J].
Hyvärinen, A ;
Oja, E .
NEURAL NETWORKS, 2000, 13 (4-5) :411-430
[10]   Metabolic fingerprinting of salt-stressed tomatoes [J].
Johnson, HE ;
Broadhurst, D ;
Goodacre, R ;
Smith, AR .
PHYTOCHEMISTRY, 2003, 62 (06) :919-928