Ensemble classification for gene expression data based on parallel clustering

被引:1
|
作者
Meng, Jun [1 ]
Jiang, Dingling [1 ]
Zhang, Jing [1 ]
Luan, Yushi [2 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
[2] Dalian Univ Technol, Sch Life Sci & Biotechnol, Dalian, Peoples R China
基金
中国国家自然科学基金;
关键词
ensemble classification; microarray data; MapReduce programming model; parallel information fusion; MICROARRAY; SELECTION; STRESS; CANCER; MODEL;
D O I
10.1504/IJDMB.2018.094779
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Analysis of large-scale gene expression data is a research hotspot in the field of bioinformatics, which can be used to study abnormal phenomenon in plant growth process. This paper proposes a biological knowledge integration method based on parallel clustering to select gene subsets effectively. Gene ontology is utilised to obtain the biological functional similarity, and combined with gene expression data. Parallelised affinity propagation algorithm is used to cluster data since it can not only obtain more biologically meaningful subsets, but also avoid the loss of some potential value in genes from simple gene primary selection. The algorithm is verified with four typical plant datasets and compared with other well-known integration methods. Experimental results on plant stress response datasets demonstrate that the proposed method can select genes with stronger classification ability.
引用
收藏
页码:213 / 229
页数:17
相关论文
共 50 条
  • [1] Ensemble biclustering gene expression data based on the spectral clustering
    Yin, Lu
    Liu, Yongguo
    NEURAL COMPUTING & APPLICATIONS, 2018, 30 (08): : 2403 - 2416
  • [2] Ensemble biclustering gene expression data based on the spectral clustering
    Lu Yin
    Yongguo Liu
    Neural Computing and Applications, 2018, 30 : 2403 - 2416
  • [3] Study on Ensemble based Clustering Algorithm for Gene Expression Data
    Chu, Zhenfang
    Cao, Buyang
    Yu, Fang
    3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2018), 2018, 1069
  • [4] Clustering cancer gene expression data by projective clustering ensemble
    Yu, Xianxue
    Yu, Guoxian
    Wang, Jun
    PLOS ONE, 2017, 12 (02):
  • [5] Ensemble clustering method based on the resampling similarity measure for gene expression data
    Kim, Seo Young
    Lee, Jae Won
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2007, 16 (06) : 539 - 564
  • [6] A clustering and ensemble based classifier for data stream classification
    Wankhade, Kapil K.
    Jondhale, Kalpana C.
    Dongre, Snehlata S.
    APPLIED SOFT COMPUTING, 2021, 102
  • [7] Clustering analysis of microarray gene expression data with new clustering ensemble method
    Luo, Fei
    Liu, Juan
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 500 - 504
  • [8] An ensemble approach for phenotype classification based on fuzzy partitioning of gene expression data
    Dragomir, A.
    Maraziotis, I.
    Bezerianos, A.
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 1930 - +
  • [9] Spatial clustering based gene selection for gene expression analysis in microarray data classification
    Dhas, P. Edwin
    Lalitha, S.
    Govindaraj, Annalakshmi
    Jyoshna, B.
    AUTOMATIKA, 2024, 65 (01) : 152 - 158
  • [10] Dissimilarity based ensemble of extreme learning machine for gene expression data classification
    Lu, Hui-juan
    An, Chun-lin
    Zheng, En-hui
    Lu, Yi
    NEUROCOMPUTING, 2014, 128 : 22 - 30