Ensemble classification for gene expression data based on parallel clustering

被引:1
|
作者
Meng, Jun [1 ]
Jiang, Dingling [1 ]
Zhang, Jing [1 ]
Luan, Yushi [2 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
[2] Dalian Univ Technol, Sch Life Sci & Biotechnol, Dalian, Peoples R China
基金
中国国家自然科学基金;
关键词
ensemble classification; microarray data; MapReduce programming model; parallel information fusion; MICROARRAY; SELECTION; STRESS; CANCER; MODEL;
D O I
10.1504/IJDMB.2018.094779
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Analysis of large-scale gene expression data is a research hotspot in the field of bioinformatics, which can be used to study abnormal phenomenon in plant growth process. This paper proposes a biological knowledge integration method based on parallel clustering to select gene subsets effectively. Gene ontology is utilised to obtain the biological functional similarity, and combined with gene expression data. Parallelised affinity propagation algorithm is used to cluster data since it can not only obtain more biologically meaningful subsets, but also avoid the loss of some potential value in genes from simple gene primary selection. The algorithm is verified with four typical plant datasets and compared with other well-known integration methods. Experimental results on plant stress response datasets demonstrate that the proposed method can select genes with stronger classification ability.
引用
收藏
页码:213 / 229
页数:17
相关论文
共 50 条
  • [21] Attribute clustering for grouping, selection, and classification of gene expression data
    Au, WH
    Chan, KCC
    Wong, AKC
    Wang, Y
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2005, 2 (02) : 83 - 101
  • [22] Projection Based Clustering of Gene Expression Data
    Tasoulis, Sotiris K.
    Plagianakos, Vassilis P.
    Tasoulis, Dimitris K.
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2010, 6160 : 228 - +
  • [23] Cancer Classification Ensemble System Based on Gene Expression Profiles
    Tarek, Sara
    Elwahab, Reda Abd
    Shoman, Mahmoud
    2016 5TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA), 2016,
  • [24] GECC: Gene Expression Based Ensemble Classification of Colon Samples
    Rathore, Saima
    Hussain, Mutawarra
    Khan, Asifullah
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (06) : 1131 - 1145
  • [25] Individual clustering and homogeneous cluster ensemble approaches applied to gene expression data
    Silva, SCM
    de Araujo, DSA
    Paradeda, RB
    Severiano-Sobrinho, VS
    de Souto, MCP
    AI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2005, 3809 : 930 - 933
  • [26] Gene-Ontology-based clustering of gene expression data
    Adryan, B
    Schuh, R
    BIOINFORMATICS, 2004, 20 (16) : 2851 - 2852
  • [27] Parallel Point Symmetry Based Clustering for Gene Microarray Data
    Sarkar, Anasua
    Maulik, Ujjwa
    ICAPR 2009: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, PROCEEDINGS, 2009, : 351 - 354
  • [28] A fuzzy approach to clustering and selecting features for classification of gene expression data
    Chitsaz, Elham
    Taheri, Mohammad
    Katebi, Seraj D.
    WORLD CONGRESS ON ENGINEERING 2008, VOLS I-II, 2008, : 1650 - 1655
  • [29] Impact of missing data imputation methods on gene expression clustering and classification
    de Souto, Marcilio C. P.
    Jaskowiak, Pablo A.
    Costa, Ivan G.
    BMC BIOINFORMATICS, 2015, 16
  • [30] Impact of missing data imputation methods on gene expression clustering and classification
    Marcilio CP de Souto
    Pablo A Jaskowiak
    Ivan G Costa
    BMC Bioinformatics, 16