Assessment of gene set analysis methods based on microarray data

被引:6
|
作者
Alavi-Majd, Hamid [1 ]
Khodakarim, Soheila [2 ]
Zayeri, Farid [3 ]
Rezaei-Tavirani, Mostafa [3 ]
Tabatabaei, Seyyed Mohammad [4 ]
Heydarpour-Meymeh, Maryam [4 ]
机构
[1] Shahid Beheshti Univ Med Sci, Fac Paramed Sci, Dept Biostat, Tehran, Iran
[2] Shahid Beheshti Univ Med Sci, Fac Publ Hlth, Dept Epidemiol, Tehran, Iran
[3] Shahid Beheshti Univ Med Sci, Prote Res Ctr, Tehran, Iran
[4] Shahid Beheshti Univ Med Sci, Fac Paramed Sci, Tehran, Iran
关键词
Gene set; Category; Hotelling's T-2; Globaltest; ACUTE LYMPHOBLASTIC-LEUKEMIA; ENRICHMENT ANALYSIS; EXPRESSION DATA; ASSOCIATION; EXPLORATION; BIOLOGY; PURINE; ALPHA; TESTS; CELLS;
D O I
10.1016/j.gene.2013.08.063
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Gene set analysis (GSA) incorporates biological information into statistical knowledge to identify gene sets differently expressed between two or more phenotypes. It allows us to gain an insight into the functional working mechanism of cells beyond the detection of differently expressed gene sets. In order to evaluate the competence of GSA approaches, three self-contained GSA approaches with different statistical methods were chosen; Category, Globaltest and Hotelling's T-2 together with their assayed power to identify the differences expressed via simulation and real microarray data. The Category does not take care of the correlation structure, while the other two deal with correlations. In order to perform these methods, Rand Bioconductor were used. Furthermore, venous thromboembolism and acute lymphoblastic leukemia microarray data were applied. The results of three GSAs showed that the competence of these methods depends on the distribution of gene expression in a dataset It is very important to assay the distribution of gene expression data before choosing the GSA method to identify gene sets differently expressed between phenotypes. On the other hand, assessment of common genes among significant gene sets indicated that there was a significant agreement between the result of GSA and the findings of biologists. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:383 / 389
页数:7
相关论文
共 50 条
  • [1] GSVA: gene set variation analysis for microarray and RNA-Seq data
    Haenzelmann, Sonja
    Castelo, Robert
    Guinney, Justin
    BMC BIOINFORMATICS, 2013, 14
  • [2] Investigating the effect of paralogs on microarray gene-set analysis
    Faure, Andre J.
    Seoighe, Cathal
    Mulder, Nicola J.
    BMC BIOINFORMATICS, 2011, 12
  • [3] Gene set analysis methods: statistical models and methodological differences
    Maciejewski, Henryk
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (04) : 504 - 518
  • [4] Enriching the gene set analysis of genome-wide data by incorporating directionality of gene expression and combining statistical hypotheses and methods
    Varemo, Leif
    Nielsen, Jens
    Nookaew, Intawat
    NUCLEIC ACIDS RESEARCH, 2013, 41 (08) : 4378 - 4391
  • [5] Time-Course Gene Set Analysis for Longitudinal Gene Expression Data
    Hejblum, Boris P.
    Skinner, Jason
    Thiebaut, Rodolphe
    PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (06)
  • [6] Gene-set activity toolbox (GAT): A platform for microarray-based cancer diagnosis using an integrative gene-set analysis approach
    Engchuan, Worrawat
    Meechai, Asawin
    Tongsima, Sissades
    Doungpan, Narumol
    Chan, Jonathan H.
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2016, 14 (04)
  • [7] Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data
    Mi Hyeon Kim
    Hwa Jeong Seo
    Je-Gun Joung
    Ju Han Kim
    BMC Bioinformatics, 12
  • [8] Discriminant Analysis Methods for Microarray Data Classification
    Chen, Chuanliang
    Gong, Yun-Chao
    Bie, Rongfang
    AI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5360 : 268 - +
  • [9] Gene set analysis methods for the functional interpretation of non-mRNA data-Genomic range and ncRNA data
    Mora, Antonio
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (05) : 1495 - 1508
  • [10] GSAASeqSP: A Toolset for Gene Set Association Analysis of RNA-Seq Data
    Xiong, Qing
    Mukherjee, Sayan
    Furey, Terrence S.
    SCIENTIFIC REPORTS, 2014, 4