Fusion of metabolomics and proteomics data for biomarkers discovery: case study on the experimental autoimmune encephalomyelitis

被引:60
作者
Blanchet, Lionel [1 ]
Smolinska, Agnieszka [1 ]
Attali, Amos [2 ]
Stoop, Marcel P. [3 ]
Ampt, Kirsten A. M. [1 ]
van Aken, Hans [2 ]
Suidgeest, Ernst [2 ]
Tuinstra, Tinka [2 ]
Wijmenga, Sybren S. [1 ]
Luider, Theo [3 ]
Buydens, Lutgarde M. C. [1 ]
机构
[1] Radboud Univ Nijmegen, Inst Mol & Mat, NL-6524 NP Nijmegen, Netherlands
[2] Abbott Healthcare Pharmaceut Nederland BV, NL-1381 CP Weesp, Netherlands
[3] Erasmus Univ, Med Ctr Rotterdam, Dept Neurol, NL-3015 GE Rotterdam, Netherlands
来源
BMC BIOINFORMATICS | 2011年 / 12卷
关键词
MULTIPLE-SCLEROSIS; CEREBROSPINAL-FLUID; PROTEIN EXPRESSION; METABONOMIC DATA; DISEASE; PLASMA; SERUM;
D O I
10.1186/1471-2105-12-254
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Analysis of Cerebrospinal Fluid (CSF) samples holds great promise to diagnose neurological pathologies and gain insight into the molecular background of these pathologies. Proteomics and metabolomics methods provide invaluable information on the biomolecular content of CSF and thereby on the possible status of the central nervous system, including neurological pathologies. The combined information provides a more complete description of CSF content. Extracting the full combined information requires a combined analysis of different datasets i.e. fusion of the data. Results: A novel fusion method is presented and applied to proteomics and metabolomics data from a pre-clinical model of multiple sclerosis: an Experimental Autoimmune Encephalomyelitis (EAE) model in rats. The method follows a mid-level fusion architecture. The relevant information is extracted per platform using extended canonical variates analysis. The results are subsequently merged in order to be analyzed jointly. We find that the combined proteome and metabolome data allow for the efficient and reliable discrimination between healthy, peripherally inflamed rats, and rats at the onset of the EAE. The predicted accuracy reaches 89% on a test set. The important variables (metabolites and proteins) in this model are known to be linked to EAE and/or multiple sclerosis. Conclusions: Fusion of proteomics and metabolomics data is possible. The main issues of high-dimensionality and missing values are overcome. The outcome leads to higher accuracy in prediction and more exhaustive description of the disease profile. The biological interpretation of the involved variables validates our fusion approach.
引用
收藏
页数:12
相关论文
共 55 条
  • [1] The N-way Toolbox for MATLAB
    Andersson, CA
    Bro, R
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2000, 52 (01) : 1 - 4
  • [2] Improving the speed of multi-way algorithms: Part I. Tucker3
    Andersson, CA
    Bro, R
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1998, 42 (1-2) : 93 - 103
  • [3] [Anonymous], Graphviz - Graph Visualization Software
  • [4] [Anonymous], 2000, Principles of multivariate analysis
  • [5] Complement and demyelinating disease: No MAC needed?
    Barnum, Scott R.
    Szalai, Alexander J.
    [J]. BRAIN RESEARCH REVIEWS, 2006, 52 (01) : 58 - 68
  • [6] The origin and application of experimental autoimmune encephalomyelitis
    Baxter, Alan G.
    [J]. NATURE REVIEWS IMMUNOLOGY, 2007, 7 (11) : 904 - 912
  • [7] BLOEMBERG TG, CHEMOM INTELL LAB SY
  • [8] Brown SD, 2009, COMPREHENSIVE CHEMOMETRICS: CHEMICAL AND BIOCHEMICAL DATA ANALYSIS, VOLS 1-4, P1
  • [9] Chambers G, 2000, J PATHOL, V192, P280, DOI 10.1002/1096-9896(200011)192:3<280::AID-PATH748>3.0.CO
  • [10] 2-L