NMR metabolic analysis of samples using fuzzy K-means clustering

被引:26
作者
Cuperlovic-Culf, Miroslava [1 ]
Belacel, Nabil [1 ]
Cuif, Adrian S. [2 ]
Chute, Ian C. [2 ]
Ouellette, Rodney J. [2 ]
Burton, Ian W. [3 ]
Karakach, Tobias K. [3 ]
Walter, John A. [3 ]
机构
[1] Natl Res Council Canada, Inst Informat Technol, Moncton, NB E1A 7R1, Canada
[2] Atlantic Canc Res Inst, Moncton, NB, Canada
[3] Natl Res Council Canada, Atlantic Reg Lab, Inst Marine Biosci, Halifax, NS B3H 3Z1, Canada
关键词
fuzzy clustering; sample classification; metabolomics; metabolic profiling; mixture analysis; sample subtypes; H-1; NMR; phenotype analysis; CANCER CELL-LINES; H-1-NMR METABONOMICS; C-MEANS; CLASSIFICATION; GUILT;
D O I
10.1002/mrc.2502
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The global analysis of metabolites can be used to define the phenotypes of cells, tissues or organisms. Classifying groups of samples based on their metabolic profile is one of the main topics of metabolomics research. Crisp clustering methods assign each feature to one cluster, thereby omitting information about the multiplicity of sample subtypes. Here, we present the application of fuzzy K-means clustering method for the classification of samples based on metabolomics 1D H-1 NMR fingerprints. The sample classification was performed on NMR spectra of cancer cell line extracts and of urine samples of type 2 diabetes patients and animal models. The cell line data set included NMR spectra of lipophilic cell extracts for two normal and three cancer cell lines with cancer cell lines including two invasive and one non-invasive cancers. The second data set included previously published NMR spectra of urine samples of human type 2 diabetics and healthy controls, mouse wild type and diabetes model and rat obese and lean phenotypes. The fuzzy K-means clustering method allowed more accurate sample classification in both data sets relative to the other tested methods including principal component analysis (PCA), hierarchical clustering (HCL) and K-means clustering. In the cell line samples, fuzzy clustering provided a clear separation of individual cell lines, groups of cancer and normal cell lines as well as non-invasive and invasive tumour cell lines. In the diabetes data set, clear separation of healthy controls and diabetics in all three models was possible only by using the fuzzy clustering method. Copyright (C) 2009 Crown in the right of Canada. Published by John Wiley & Sons, Ltd.
引用
收藏
页码:S96 / S104
页数:9
相关论文
共 29 条
[11]   Metabolic profiles of cancer cells [J].
Griffin, JL ;
Shockcor, JP .
NATURE REVIEWS CANCER, 2004, 4 (07) :551-561
[12]   Bagged K-means clustering of metabolome data [J].
Hageman, J. A. ;
van den Berg, R. A. ;
Westerhuis, J. A. ;
Hoefsloot, H. C. J. ;
Smilde, A. K. .
CRITICAL REVIEWS IN ANALYTICAL CHEMISTRY, 2006, 36 (3-4) :211-220
[13]   Negative impact of noise on the principal component analysis of NMR data [J].
Halouska, S ;
Powers, R .
JOURNAL OF MAGNETIC RESONANCE, 2006, 178 (01) :88-95
[14]   Metabolic phenotyping in health and disease [J].
Holmes, Elaine ;
Wilson, Ian D. ;
Nicholson, Jeremy K. .
CELL, 2008, 134 (05) :714-717
[15]   Relevance of breast cancer cell lines as models for breast tumours: an update [J].
Lacroix, M ;
Leclercq, G .
BREAST CANCER RESEARCH AND TREATMENT, 2004, 83 (03) :249-289
[16]   Human disease classification in the postgenomic era: A complex systems approach to human pathobiology [J].
Loscalzo, Joseph ;
Kohane, Isaac ;
Barabasi, Albert-Laszlo .
MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1)
[17]   1H NMR metabonomics approach to the disease continuum of diabetic complications and premature death [J].
Makinen, Ville-Petteri ;
Soininen, Pasi ;
Forsblom, Carol ;
Parkkonen, Maija ;
Ingman, Petri ;
Kaski, Kimmo ;
Groop, Per-Henrik ;
Ala-Korpela, Mika .
MOLECULAR SYSTEMS BIOLOGY, 2008, 4 (1)
[18]   A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes [J].
Neve, Richard M. ;
Chin, Koei ;
Fridlyand, Jane ;
Yeh, Jennifer ;
Baehner, Frederick L. ;
Fevr, Tea ;
Clark, Laura ;
Bayani, Nora ;
Coppe, Jean-Philippe ;
Tong, Frances ;
Speed, Terry ;
Spellman, Paul T. ;
DeVries, Sandy ;
Lapuk, Anna ;
Wang, Nick J. ;
Kuo, Wen-Lin ;
Stilwell, Jackie L. ;
Pinkel, Daniel ;
Albertson, Donna G. ;
Waldman, Frederic M. ;
McCormick, Frank ;
Dickson, Robert B. ;
Johnson, Michael D. ;
Lippman, Marc ;
Ethier, Stephen ;
Gazdar, Adi ;
Gray, Joe W. .
CANCER CELL, 2006, 10 (06) :515-527
[19]   Smoothly distributed fuzzy c-means:: a new self-organizing map [J].
Pascual-Marqui, RD ;
Pascual-Montano, AD ;
Kochi, K ;
Carazo, JM .
PATTERN RECOGNITION, 2001, 34 (12) :2395-2402
[20]   Microarrays - Guilt by association [J].
Quackenbush, J .
SCIENCE, 2003, 302 (5643) :240-241