Ensemble classification for gene expression data based on parallel clustering

被引:1
|
作者
Meng, Jun [1 ]
Jiang, Dingling [1 ]
Zhang, Jing [1 ]
Luan, Yushi [2 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
[2] Dalian Univ Technol, Sch Life Sci & Biotechnol, Dalian, Peoples R China
基金
中国国家自然科学基金;
关键词
ensemble classification; microarray data; MapReduce programming model; parallel information fusion; MICROARRAY; SELECTION; STRESS; CANCER; MODEL;
D O I
10.1504/IJDMB.2018.094779
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Analysis of large-scale gene expression data is a research hotspot in the field of bioinformatics, which can be used to study abnormal phenomenon in plant growth process. This paper proposes a biological knowledge integration method based on parallel clustering to select gene subsets effectively. Gene ontology is utilised to obtain the biological functional similarity, and combined with gene expression data. Parallelised affinity propagation algorithm is used to cluster data since it can not only obtain more biologically meaningful subsets, but also avoid the loss of some potential value in genes from simple gene primary selection. The algorithm is verified with four typical plant datasets and compared with other well-known integration methods. Experimental results on plant stress response datasets demonstrate that the proposed method can select genes with stronger classification ability.
引用
收藏
页码:213 / 229
页数:17
相关论文
共 50 条
  • [31] Chronological Pattern Exploration Algorithm for Gene Expression Data Clustering and Classification
    L. Sharmila
    U. Sakthi
    Wireless Personal Communications, 2018, 102 : 1503 - 1519
  • [32] Chronological Pattern Exploration Algorithm for Gene Expression Data Clustering and Classification
    Sharmila, L.
    Sakthi, U.
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 102 (02) : 1503 - 1519
  • [33] A novel combined ica and clustering technique for the classification of gene expression data
    Kapoor, A
    Bowles, T
    Chambers, J
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 621 - 624
  • [34] Fuzzy clustering-based discretization for gene expression classification
    Keivan Kianmehr
    Mohammed Alshalalfa
    Reda Alhajj
    Knowledge and Information Systems, 2010, 24 : 441 - 465
  • [35] Fuzzy clustering-based discretization for gene expression classification
    Kianmehr, Keivan
    Alshalalfa, Mohammed
    Alhajj, Reda
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (03) : 441 - 465
  • [36] Fuzzy Rule Based Clustering for Gene Expression Data
    Sinaee, Mehrnoosh
    Mansoori, Eghbal G.
    FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION (ISMS 2013), 2013, : 7 - 11
  • [37] Clustering of Gene Expression Data Based on Shape Similarity
    Hestilow, Travis J.
    Huang, Yufei
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2009, (01)
  • [38] DYNAMIC CORE BASED CLUSTERING OF GENE EXPRESSION DATA
    Bocicor, Maria-Iuliana
    Sirbu, Adela
    Czibula, Gabriela
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (03): : 1051 - 1069
  • [39] Dynamic core based clustering of gene expression data
    1600, ICIC International (10):
  • [40] Ensemble dependence model for classification and prediction of cancer and normal gene expression data
    Qiu, P
    Wang, ZJ
    Liu, KJR
    BIOINFORMATICS, 2005, 21 (14) : 3114 - 3121