NMR metabolic analysis of samples using fuzzy K-means clustering

被引:26
作者
Cuperlovic-Culf, Miroslava [1 ]
Belacel, Nabil [1 ]
Cuif, Adrian S. [2 ]
Chute, Ian C. [2 ]
Ouellette, Rodney J. [2 ]
Burton, Ian W. [3 ]
Karakach, Tobias K. [3 ]
Walter, John A. [3 ]
机构
[1] Natl Res Council Canada, Inst Informat Technol, Moncton, NB E1A 7R1, Canada
[2] Atlantic Canc Res Inst, Moncton, NB, Canada
[3] Natl Res Council Canada, Atlantic Reg Lab, Inst Marine Biosci, Halifax, NS B3H 3Z1, Canada
关键词
fuzzy clustering; sample classification; metabolomics; metabolic profiling; mixture analysis; sample subtypes; H-1; NMR; phenotype analysis; CANCER CELL-LINES; H-1-NMR METABONOMICS; C-MEANS; CLASSIFICATION; GUILT;
D O I
10.1002/mrc.2502
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The global analysis of metabolites can be used to define the phenotypes of cells, tissues or organisms. Classifying groups of samples based on their metabolic profile is one of the main topics of metabolomics research. Crisp clustering methods assign each feature to one cluster, thereby omitting information about the multiplicity of sample subtypes. Here, we present the application of fuzzy K-means clustering method for the classification of samples based on metabolomics 1D H-1 NMR fingerprints. The sample classification was performed on NMR spectra of cancer cell line extracts and of urine samples of type 2 diabetes patients and animal models. The cell line data set included NMR spectra of lipophilic cell extracts for two normal and three cancer cell lines with cancer cell lines including two invasive and one non-invasive cancers. The second data set included previously published NMR spectra of urine samples of human type 2 diabetics and healthy controls, mouse wild type and diabetes model and rat obese and lean phenotypes. The fuzzy K-means clustering method allowed more accurate sample classification in both data sets relative to the other tested methods including principal component analysis (PCA), hierarchical clustering (HCL) and K-means clustering. In the cell line samples, fuzzy clustering provided a clear separation of individual cell lines, groups of cancer and normal cell lines as well as non-invasive and invasive tumour cell lines. In the diabetes data set, clear separation of healthy controls and diabetics in all three models was possible only by using the fuzzy clustering method. Copyright (C) 2009 Crown in the right of Canada. Published by John Wiley & Sons, Ltd.
引用
收藏
页码:S96 / S104
页数:9
相关论文
共 29 条
  • [11] Metabolic profiles of cancer cells
    Griffin, JL
    Shockcor, JP
    [J]. NATURE REVIEWS CANCER, 2004, 4 (07) : 551 - 561
  • [12] Bagged K-means clustering of metabolome data
    Hageman, J. A.
    van den Berg, R. A.
    Westerhuis, J. A.
    Hoefsloot, H. C. J.
    Smilde, A. K.
    [J]. CRITICAL REVIEWS IN ANALYTICAL CHEMISTRY, 2006, 36 (3-4) : 211 - 220
  • [13] Negative impact of noise on the principal component analysis of NMR data
    Halouska, S
    Powers, R
    [J]. JOURNAL OF MAGNETIC RESONANCE, 2006, 178 (01) : 88 - 95
  • [14] Metabolic phenotyping in health and disease
    Holmes, Elaine
    Wilson, Ian D.
    Nicholson, Jeremy K.
    [J]. CELL, 2008, 134 (05) : 714 - 717
  • [15] Relevance of breast cancer cell lines as models for breast tumours: an update
    Lacroix, M
    Leclercq, G
    [J]. BREAST CANCER RESEARCH AND TREATMENT, 2004, 83 (03) : 249 - 289
  • [16] Human disease classification in the postgenomic era: A complex systems approach to human pathobiology
    Loscalzo, Joseph
    Kohane, Isaac
    Barabasi, Albert-Laszlo
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1)
  • [17] 1H NMR metabonomics approach to the disease continuum of diabetic complications and premature death
    Makinen, Ville-Petteri
    Soininen, Pasi
    Forsblom, Carol
    Parkkonen, Maija
    Ingman, Petri
    Kaski, Kimmo
    Groop, Per-Henrik
    Ala-Korpela, Mika
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2008, 4 (1)
  • [18] A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes
    Neve, Richard M.
    Chin, Koei
    Fridlyand, Jane
    Yeh, Jennifer
    Baehner, Frederick L.
    Fevr, Tea
    Clark, Laura
    Bayani, Nora
    Coppe, Jean-Philippe
    Tong, Frances
    Speed, Terry
    Spellman, Paul T.
    DeVries, Sandy
    Lapuk, Anna
    Wang, Nick J.
    Kuo, Wen-Lin
    Stilwell, Jackie L.
    Pinkel, Daniel
    Albertson, Donna G.
    Waldman, Frederic M.
    McCormick, Frank
    Dickson, Robert B.
    Johnson, Michael D.
    Lippman, Marc
    Ethier, Stephen
    Gazdar, Adi
    Gray, Joe W.
    [J]. CANCER CELL, 2006, 10 (06) : 515 - 527
  • [19] Smoothly distributed fuzzy c-means:: a new self-organizing map
    Pascual-Marqui, RD
    Pascual-Montano, AD
    Kochi, K
    Carazo, JM
    [J]. PATTERN RECOGNITION, 2001, 34 (12) : 2395 - 2402
  • [20] Microarrays - Guilt by association
    Quackenbush, J
    [J]. SCIENCE, 2003, 302 (5643) : 240 - 241