Identifying Genes Relevant to Specific Biological Conditions in Time Course Microarray Experiments

被引:0
|
作者
Singh, Nitesh Kumar [1 ]
Repsilber, Dirk [2 ]
Liebscher, Volkmar [3 ]
Taher, Leila [1 ]
Fuellen, Georg [1 ]
机构
[1] Univ Rostock, Dept Med, Inst Biostat & Informat Med & Ageing Res, D-18055 Rostock, Germany
[2] Leibniz Inst Farm Anim Biol, Inst Genet & Biometry, Dummerstorf, Germany
[3] Ernst Moritz Arndt Univ Greifswald, Inst Math & Informat, Greifswald, Germany
来源
PLOS ONE | 2013年 / 8卷 / 10期
关键词
EXPRESSION; CLASSIFICATION; ASSOCIATIONS; PREDICTION; SELECTION; NETWORKS; PROTEINS; DATABASE; MODEL;
D O I
10.1371/journal.pone.0076561
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Microarrays have been useful in understanding various biological processes by allowing the simultaneous study of the expression of thousands of genes. However, the analysis of microarray data is a challenging task. One of the key problems in microarray analysis is the classification of unknown expression profiles. Specifically, the often large number of non-informative genes on the microarray adversely affects the performance and efficiency of classification algorithms. Furthermore, the skewed ratio of sample to variable poses a risk of overfitting. Thus, in this context, feature selection methods become crucial to select relevant genes and, hence, improve classification accuracy. In this study, we investigated feature selection methods based on gene expression profiles and protein interactions. We found that in our setup, the addition of protein interaction information did not contribute to any significant improvement of the classification results. Furthermore, we developed a novel feature selection method that relies exclusively on observed gene expression changes in microarray experiments, which we call "relative Signal-to-Noise ratio" (rSNR). More precisely, the rSNR ranks genes based on their specificity to an experimental condition, by comparing intrinsic variation, i.e. variation in gene expression within an experimental condition, with extrinsic variation, i.e. variation in gene expression across experimental conditions. Genes with low variation within an experimental condition of interest and high variation across experimental conditions are ranked higher, and help in improving classification accuracy. We compared different feature selection methods on two time-series microarray datasets and one static microarray dataset. We found that the rSNR performed generally better than the other methods.
引用
收藏
页数:12
相关论文
共 12 条
  • [1] Microarray Time Course Experiments: Finding Profiles
    Irigoien, Itziar
    Vives, Sergi
    Arenas, Concepcion
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (02) : 464 - 475
  • [2] Gene profiling for determining pluripotent genes in a time course microarray experiment
    Tuke, J.
    Glonek, G. F. V.
    Solomon, P. J.
    BIOSTATISTICS, 2009, 10 (01) : 80 - 93
  • [3] Retrieving relevant time-course experiments: a study on Arabidopsis microarrays
    Sener, Duygu Dede
    Ogul, Hasan
    IET SYSTEMS BIOLOGY, 2016, 10 (03) : 87 - 93
  • [4] A Novel Approach for Identifying Relevant Genes for Breast Cancer Survivability on Specific Therapies
    Abou Tabl, Ashraf
    Alkhateeb, Abedalrhman
    Huy Quang Pham
    Rueda, Luis
    ElMaraghy, Waguih
    Ngom, Alioune
    EVOLUTIONARY BIOINFORMATICS, 2018, 14
  • [5] Systems Approach to Identifying Relevant Pathways from Phenotype Information in Dose-Dependent Time Series Microarray Data
    Dymacek, Julian
    Guo, Nancy Lan
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 290 - 293
  • [6] An integrated approach to identifying sex-specific genes, transcription factors, and pathways relevant to Alzheimer's disease
    Lopez-Cerdan, Adolfo
    Andreu, Zoraida
    Hidalgo, Marta R.
    Soler-Saez, Irene
    de la Iglesia-Vaya, Maria
    Mikozami, Akiko
    Guerini, Franca R.
    Garcia-Garcia, Francisco
    NEUROBIOLOGY OF DISEASE, 2024, 199
  • [7] An Order Estimation Based Approach to Identify Response Genes for Microarray Time Course Data
    Lu, Zhiheng K.
    Allen, O. Brian
    Desmond, Anthony F.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2012, 11 (06)
  • [8] Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments
    Liu, Tianqing
    Lin, Nan
    Shi, Ningzhong
    Zhang, Baoxue
    BMC BIOINFORMATICS, 2009, 10
  • [9] Assessing co-regulation of directly linked genes in biological networks using microarray time series analysis
    Del Sorbo, Maria Rosaria
    Balzano, Walter
    Donato, Michele
    Draghici, Sorin
    BIOSYSTEMS, 2013, 114 (02) : 149 - 154
  • [10] A time-course microarray data analysis reveals consistent dysregulated genes and upstream microRNAs in autoantibody-mediated arthritis
    Wang, Xinwen
    Bai, Jie
    Jia, Zhen
    Zhu, Yangjun
    Liu, Jijun
    Zhang, Kun
    Hao, Dingjun
    Heng, Lisong
    JOURNAL OF ORTHOPAEDIC SURGERY AND RESEARCH, 2017, 12