Assessment of gene set analysis methods based on microarray data

被引:6
|
作者
Alavi-Majd, Hamid [1 ]
Khodakarim, Soheila [2 ]
Zayeri, Farid [3 ]
Rezaei-Tavirani, Mostafa [3 ]
Tabatabaei, Seyyed Mohammad [4 ]
Heydarpour-Meymeh, Maryam [4 ]
机构
[1] Shahid Beheshti Univ Med Sci, Fac Paramed Sci, Dept Biostat, Tehran, Iran
[2] Shahid Beheshti Univ Med Sci, Fac Publ Hlth, Dept Epidemiol, Tehran, Iran
[3] Shahid Beheshti Univ Med Sci, Prote Res Ctr, Tehran, Iran
[4] Shahid Beheshti Univ Med Sci, Fac Paramed Sci, Tehran, Iran
关键词
Gene set; Category; Hotelling's T-2; Globaltest; ACUTE LYMPHOBLASTIC-LEUKEMIA; ENRICHMENT ANALYSIS; EXPRESSION DATA; ASSOCIATION; EXPLORATION; BIOLOGY; PURINE; ALPHA; TESTS; CELLS;
D O I
10.1016/j.gene.2013.08.063
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Gene set analysis (GSA) incorporates biological information into statistical knowledge to identify gene sets differently expressed between two or more phenotypes. It allows us to gain an insight into the functional working mechanism of cells beyond the detection of differently expressed gene sets. In order to evaluate the competence of GSA approaches, three self-contained GSA approaches with different statistical methods were chosen; Category, Globaltest and Hotelling's T-2 together with their assayed power to identify the differences expressed via simulation and real microarray data. The Category does not take care of the correlation structure, while the other two deal with correlations. In order to perform these methods, Rand Bioconductor were used. Furthermore, venous thromboembolism and acute lymphoblastic leukemia microarray data were applied. The results of three GSAs showed that the competence of these methods depends on the distribution of gene expression in a dataset It is very important to assay the distribution of gene expression data before choosing the GSA method to identify gene sets differently expressed between phenotypes. On the other hand, assessment of common genes among significant gene sets indicated that there was a significant agreement between the result of GSA and the findings of biologists. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:383 / 389
页数:7
相关论文
共 50 条
  • [11] A Unified Mixed Effects Model for Gene Set Analysis of Time Course Microarray Experiments
    Wang, Lily
    Chen, Xi
    Wolfinger, Russell D.
    Franklin, Jeffrey L.
    Coffey, Robert J.
    Zhang, Bing
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2009, 8 (01):
  • [12] Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods
    Fridley, Brooke L.
    Jenkins, Gregory D.
    Biernacka, Joanna M.
    PLOS ONE, 2010, 5 (09): : 1 - 9
  • [13] Impact of DNA microarray data transformation on gene expression analysis - comparison of two normalization methods
    Schmidt, Marcin T.
    Handschuh, Luiza
    Zyprych, Joanna
    Szabelska, Alicja
    Olejnik-Schmidt, Agnieszka K.
    Siatkowski, Idzi
    Figlerowicz, Marek
    ACTA BIOCHIMICA POLONICA, 2011, 58 (04) : 573 - 580
  • [14] GOing Bayesian: model-based gene set analysis of genome-scale data
    Bauer, Sebastian
    Gagneur, Julien
    Robinson, Peter N.
    NUCLEIC ACIDS RESEARCH, 2010, 38 (11) : 3523 - 3532
  • [15] Gene Set Based Integrated Data Analysis Reveals Phenotypic Differences in a Brain Cancer Model
    Petersen, Kjell
    Rajcevic, Uros
    Rahim, Siti Aminah Abdul
    Jonassen, Inge
    Kalland, Karl-Henning
    Jimenez, Connie R.
    Bjerkvig, Rolf
    Niclou, Simone P.
    PLOS ONE, 2013, 8 (07):
  • [16] Detection of Differentially Expressed Gene Sets in a Partially Paired Microarray Data Set
    Lim, Johan
    Kim, Jayoun
    Kim, Sang-cheol
    Yu, Donghyeon
    Kim, Kyunga
    Kim, Byung Soo
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2012, 11 (03)
  • [17] Gene set analysis methods: a systematic comparison
    Mathur, Ravi
    Rotroff, Daniel
    Ma, Jun
    Shojaie, Ali
    Motsinger-Reif, Alison
    BIODATA MINING, 2018, 11
  • [18] Framework for knowledge-based integrative analysis of microarray data
    Shi, Jiantao
    Wang, Kankan
    Zhang, Ji
    2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 56 - +
  • [19] Rough set based maximum relevance-maximum significance criterion and gene selection from microarray data
    Maji, Pradipta
    Paul, Sushmita
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2011, 52 (03) : 408 - 426
  • [20] Effect of the absolute statistic on gene-sampling gene-set analysis methods
    Nam, Dougu
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (03) : 1248 - 1260