Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data

被引:50
|
作者
Wang, JB [1 ]
Bo, TH
Jonassen, I
Myklebost, O
Hovig, E
机构
[1] Norwegian Radium Hosp, Dept Tumor Biol, N-0310 Oslo, Norway
[2] Univ Bergen, HIB, Dept Informat, N-5020 Bergen, Norway
[3] Univ Bergen, Bergen Ctr Computat Sci, Computat Biol Unit, N-5020 Bergen, Norway
[4] Univ Oslo, Dept Mol Biosci, N-0316 Oslo, Norway
关键词
D O I
10.1186/1471-2105-4-60
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Using DNA microarrays, we have developed two novel models for tumor classification and target gene prediction. First, gene expression profiles are summarized by optimally selected Self-Organizing Maps (SOMs), followed by tumor sample classification by Fuzzy C-means clustering. Then, the prediction of marker genes is accomplished by either manual feature selection (visualizing the weighted/mean SOM component plane) or automatic feature selection (by pair-wise Fisher's linear discriminant). Results: The proposed models were tested on four published datasets: (1) Leukemia (2) Colon cancer (3) Brain tumors and (4) NCI cancer cell lines. The models gave class prediction with markedly reduced error rates compared to other class prediction approaches, and the importance of feature selection on microarray data analysis was also emphasized. Conclusions: Our models identify marker genes with predictive potential, often better than other available methods in the literature. The models are potentially useful for medical diagnostics and may reveal some insights into cancer classification. Additionally, we illustrated two limitations in tumor classification from microarray data related to the biology underlying the data, in terms of (1) the class size of data, and (2) the internal structure of classes. These limitations are not specific for the classification models used.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Automatic nasal tumor detection by grey prediction and Fuzzy C-means clustering
    Huang Wenchen
    Chang Chunpin
    JOURNAL OF GREY SYSTEM, 2008, 20 (03): : 205 - 218
  • [32] Classification via Deep Fuzzy c-Means Clustering
    Yeganejou, Mojtaba
    Dick, Scott
    2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,
  • [33] Automatic nasal tumor detection by grey prediction and fuzzy C-means clustering
    Wen-Chen Huang
    Chun-Pin Chang
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 542 - +
  • [34] Optimization of Fuzzy C-Means Algorithm Using Feature Selection Strategies
    Maheshwari, Kanika
    Sharma, Vivek
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, INDIA 2017, 2018, 672 : 368 - 379
  • [35] Modified fuzzy C-means algorithm for feature selection
    Frosini, Graziano
    Lazzerini, Beatrice
    Marcelloni, Francesco
    Annual Conference of the North American Fuzzy Information Processing Society - NAFIPS, 2000, : 148 - 152
  • [36] A modified fuzzy C-means algorithm for feature selection
    Frosini, G
    Lazzerini, B
    Marcelloni, F
    PEACHFUZZ 2000 : 19TH INTERNATIONAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 2000, : 148 - 152
  • [37] Microarray Filtering-Based Fuzzy C-Means Clustering and Classification in Genomic Signal Processing
    Purnendu Mishra
    Nilamani Bhoi
    Arabian Journal for Science and Engineering, 2019, 44 : 9381 - 9395
  • [38] Microarray Filtering-Based Fuzzy C-Means Clustering and Classification in Genomic Signal Processing
    Mishra, Purnendu
    Bhoi, Nilamani
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (11) : 9381 - 9395
  • [39] Clustering of COVID-19 data for knowledge discovery using c-means and fuzzy c-means
    Afzal, Asif
    Ansari, Zahid
    Alshahrani, Saad
    Raj, Arun K.
    Kuruniyan, Mohamed Saheer
    Saleel, C. Ahamed
    Nisar, Kottakkaran Sooppy
    RESULTS IN PHYSICS, 2021, 29
  • [40] Prediction of Depth of Seawater Using Fuzzy C-Means Clustering Algorithm of Crowdsourced SONAR Data
    Kamolov, Ahmadhon Akbarkhonovich
    Park, Suhyun
    SUSTAINABILITY, 2021, 13 (11)