A kernel-based clustering method for gene selection with gene expression data

被引:49
作者
Chen, Huihui [1 ]
Zhang, Yusen [1 ]
Gutman, Ivan [2 ]
机构
[1] Shandong Univ Weihai, Sch Math & Stat, Weihai 264209, Peoples R China
[2] Univ Kragujevac, Fac Sci, POB 60, Kragujevac 34000, Serbia
关键词
Gene expression data; Kernel-based clustering; Adaptive distance; Gene selection; Cancer classification; CANCER CLASSIFICATION; PREDICTION; ALGORITHM; DISCOVERY;
D O I
10.1016/j.jbi.2016.05.007
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Gene selection is important for cancer classification based on gene expression data, because of high dimensionality and small sample size. In this paper, we present a new gene selection method based on clustering, in which dissimilarity measures are obtained through kernel functions. It searches for best weights of genes iteratively at the same time to optimize the clustering objective function. Adaptive distance is used in the process, which is suitable to learn the weights of genes during the clustering process, improving the performance of the algorithm. The proposed algorithm is simple and does not require any modification or parameter optimization for each dataset. We tested it on eight publicly available datasets, using two classifiers (support vector machine, k-nearest neighbor), compared with other six competitive feature selectors. The results show that the proposed algorithm is capable of achieving better accuracies and may be an efficient tool for finding possible biomarkers from gene expression data. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:12 / 20
页数:9
相关论文
共 50 条
  • [31] ANFIS-Based Wrapper Model Gene Selection for Cancer Classification on Microarray Gene Expression Data
    Mahmoudi, Sina
    Lahijan, Biyuk Sadeghi
    Kanan, Hamidreza Rashidy
    2013 13TH IRANIAN CONFERENCE ON FUZZY SYSTEMS (IFSC), 2013,
  • [32] Initial points selection for clustering gene expression data: A spatial contiguity analysis-based approach
    Yi, Hui
    Bo, Cuimei
    Song, Xiaofeng
    Yuan, Yuhao
    BIO-MEDICAL MATERIALS AND ENGINEERING, 2014, 24 (06) : 3709 - 3717
  • [33] An Agent-Based Clustering Approach for Gene Selection in Gene Expression Microarray
    Ramos, Juan
    Castellanos-Garzon, Jose A.
    Gonzalez-Briones, Alfonso
    de Paz, Juan F.
    Corchado, Juan M.
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2017, 9 (01) : 1 - 13
  • [34] An Agent-Based Clustering Approach for Gene Selection in Gene Expression Microarray
    Juan Ramos
    José A. Castellanos-Garzón
    Alfonso González-Briones
    Juan F. de Paz
    Juan M. Corchado
    Interdisciplinary Sciences: Computational Life Sciences, 2017, 9 : 1 - 13
  • [35] A Novel Recursive Gene Selection Method Based on Least Square Kernel Extreme Learning Machine
    Ding, Xiaojian
    Yang, Fan
    Zhong, Yaoyi
    Cao, Jie
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (04) : 2026 - 2038
  • [36] An efficient statistical feature selection approach for classification of gene expression data
    Chandra, B.
    Gupta, Manish
    JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (04) : 529 - 535
  • [37] A model for gene selection and classification of gene expression data
    Mohamad M.S.
    Omatu S.
    Deris S.
    Hashim S.Z.M.
    Artificial Life and Robotics, 2007, 11 (2) : 219 - 222
  • [38] A Comprehensive Survey of Recent Hybrid Feature Selection Methods in Cancer Microarray Gene Expression Data
    Almazrua, Halah
    Alshamlan, Hala
    IEEE ACCESS, 2022, 10 : 71427 - 71449
  • [39] FEED: a feature selection method based on gene expression decomposition for single cell clustering
    Zhang, Chao
    Duan, Zhi-Wei
    Xu, Yun-Pei
    Liu, Jin
    Li, Hong-Dong
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [40] Gene selection and classification using non-linear kernel support vector machines based on gene expression data
    Zhang Qizhong
    2007 IEEE/ICME INTERNATIONAL CONFERENCE ON COMPLEX MEDICAL ENGINEERING, VOLS 1-4, 2007, : 1606 - 1611