Data mining for feature selection in gene expression autism data

被引:43
|
作者
Latkowski, Tomasz [1 ]
Osowski, Stanislaw [1 ,2 ]
机构
[1] Mil Univ Technol, Fac Elect, PL-00908 Warsaw, Poland
[2] Warsaw Univ Technol, Fac Elect Engn, Warsaw, Poland
关键词
Gene expression microarrays; Feature selection; Clustering; Classification; Autism; CLASSIFICATION; CANCER;
D O I
10.1016/j.eswa.2014.08.043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper presents application of data mining methods for recognizing the most significant genes and gene sequences (treated as features) stored in a dataset of gene expression microarray. The investigations are performed for autism data. Few chosen methods of feature selection have been applied and their results integrated in the final outcome. In this way we find the contents of small set of the most important genes associated with autism. They have been applied in the classification procedure aimed on recognition of autism from reference group members. The results of numerical experiments concerning selection of the most important genes and classification of the cases on the basis of the selected genes will be discussed. The main contribution of the paper is in developing the fusion system of the results of many selection approaches into the final set, most closely associated with autism. We have also proposed special procedure of estimating the number of highest rank genes used in classification procedure. 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:864 / 872
页数:9
相关论文
共 50 条
  • [1] Feature selection and gene clustering from gene expression data
    Mitra, P
    Majumder, DD
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 343 - 346
  • [2] Informative Feature Clustering and Selection for Gene Expression Data
    Yang, Yuqi
    Yin, Pengshuai
    Luo, Zhihang
    Gu, Wenwen
    Chen, Renjie
    Wu, Qingyao
    IEEE ACCESS, 2019, 7 : 169174 - 169184
  • [3] Efficient Feature Selection Model for Gene Expression Data
    Saengsiri, Patharawut
    Wichian, Sageemas Na
    Meesad, Phayung
    MECHANICAL AND AEROSPACE ENGINEERING, PTS 1-7, 2012, 110-116 : 1948 - +
  • [4] A Review on Feature Selection Techniques for Gene Expression Data
    Vanjimalar, S.
    Ramyachitra, D.
    Manikandan, P.
    2018 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC 2018), 2018, : 26 - 29
  • [5] The painter's feature selection for gene expression data
    Apiletti, Daniele
    Baralis, Elena
    Bruno, Giulia
    Fiori, Alessandro
    2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 4227 - 4230
  • [6] Feature Selection and Classification in gene expression cancer data
    Pavithra, D.
    Lakshmanan, B.
    2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS), 2017,
  • [7] An Efficient Feature Selection Technique for Gene Expression Data
    Chandra, B.
    2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2018, : 132 - 137
  • [8] Evolutionary rough feature selection in gene expression data
    Banerjee, Mohua
    Mitra, Sushmita
    Banka, Haider
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2007, 37 (04): : 622 - 632
  • [9] Feature Selection and Extraction in Data mining
    Aparna, U. R.
    Paul, Shaiju
    PROCEEDINGS OF 2016 ONLINE INTERNATIONAL CONFERENCE ON GREEN ENGINEERING AND TECHNOLOGIES (IC-GET), 2016,
  • [10] DATA MINING METHODS FOR GENE SELECTION ON THE BASIS OF GENE EXPRESSION ARRAYS
    Muszynski, Michal
    Osowski, Stanislaw
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2014, 24 (03) : 657 - 668