Assessment of gene set analysis methods based on microarray data

被引:6
|
作者
Alavi-Majd, Hamid [1 ]
Khodakarim, Soheila [2 ]
Zayeri, Farid [3 ]
Rezaei-Tavirani, Mostafa [3 ]
Tabatabaei, Seyyed Mohammad [4 ]
Heydarpour-Meymeh, Maryam [4 ]
机构
[1] Shahid Beheshti Univ Med Sci, Fac Paramed Sci, Dept Biostat, Tehran, Iran
[2] Shahid Beheshti Univ Med Sci, Fac Publ Hlth, Dept Epidemiol, Tehran, Iran
[3] Shahid Beheshti Univ Med Sci, Prote Res Ctr, Tehran, Iran
[4] Shahid Beheshti Univ Med Sci, Fac Paramed Sci, Tehran, Iran
关键词
Gene set; Category; Hotelling's T-2; Globaltest; ACUTE LYMPHOBLASTIC-LEUKEMIA; ENRICHMENT ANALYSIS; EXPRESSION DATA; ASSOCIATION; EXPLORATION; BIOLOGY; PURINE; ALPHA; TESTS; CELLS;
D O I
10.1016/j.gene.2013.08.063
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Gene set analysis (GSA) incorporates biological information into statistical knowledge to identify gene sets differently expressed between two or more phenotypes. It allows us to gain an insight into the functional working mechanism of cells beyond the detection of differently expressed gene sets. In order to evaluate the competence of GSA approaches, three self-contained GSA approaches with different statistical methods were chosen; Category, Globaltest and Hotelling's T-2 together with their assayed power to identify the differences expressed via simulation and real microarray data. The Category does not take care of the correlation structure, while the other two deal with correlations. In order to perform these methods, Rand Bioconductor were used. Furthermore, venous thromboembolism and acute lymphoblastic leukemia microarray data were applied. The results of three GSAs showed that the competence of these methods depends on the distribution of gene expression in a dataset It is very important to assay the distribution of gene expression data before choosing the GSA method to identify gene sets differently expressed between phenotypes. On the other hand, assessment of common genes among significant gene sets indicated that there was a significant agreement between the result of GSA and the findings of biologists. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:383 / 389
页数:7
相关论文
共 50 条
  • [41] ROAST: rotation gene set tests for complex microarray experiments
    Wu, Di
    Lim, Elgene
    Vaillant, Francois
    Asselin-Labat, Marie-Liesse
    Visvader, Jane E.
    Smyth, Gordon K.
    BIOINFORMATICS, 2010, 26 (17) : 2176 - 2182
  • [42] Neural network-based analysis of DNA microarray data
    Patra, JC
    Wang, L
    Ang, EL
    Chaudhari, NS
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 503 - 508
  • [43] Gene set analysis: limitations in popular existing methods and proposed improvements
    Mishra, Pashupati
    Toronen, Petri
    Leino, Yrjo
    Holm, Liisa
    BIOINFORMATICS, 2014, 30 (19) : 2747 - 2756
  • [44] A Comparison of Gene Set Analysis Methods in Terms of Sensitivity, Prioritization and Specificity
    Tarca, Adi L.
    Bhatti, Gaurav
    Romero, Roberto
    PLOS ONE, 2013, 8 (11):
  • [45] Integrating the Principal Component Analysis with Partial Decision Tree in Microarray Gene Data
    Al-Batah, Mohammad Subhi
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (03): : 24 - 29
  • [46] Gene Expression Microarray Data Identify Hub Genes Involved in Osteoarthritis
    Zhou, Jian
    Zou, Dazhi
    Wan, Rongjun
    Liu, Jie
    Zhou, Qiong
    Zhou, Zhen
    Wang, Wanchun
    Tao, Cheng
    Liu, Tang
    FRONTIERS IN GENETICS, 2022, 13
  • [47] Multivariate analysis of variance test for gene set analysis
    Tsai, Chen-An
    Chen, James J.
    BIOINFORMATICS, 2009, 25 (07) : 897 - 903
  • [48] Gene set analysis approaches for RNA-seq data: performance evaluation and application guideline
    Rahmatallah, Yasir
    Emmert-Streib, Frank
    Glazko, Galina
    BRIEFINGS IN BIOINFORMATICS, 2016, 17 (03) : 393 - 407
  • [49] Algorithmic paradigms for stability-based cluster validity and model selection statistical methods, with applications to microarray data analysis
    Giancarlo, R.
    Utro, F.
    THEORETICAL COMPUTER SCIENCE, 2012, 428 : 58 - 79
  • [50] Microarray based analysis of gene expression patterns in pancreatic neuroendocrine tumors
    Wang, D. -D.
    Liu, Z. -W.
    Han, M. -M.
    Zhu, Z. -M.
    Tu, Y. -L.
    Dou, C. -Q.
    Jin, X.
    Cai, S. -W.
    Du, N.
    EUROPEAN REVIEW FOR MEDICAL AND PHARMACOLOGICAL SCIENCES, 2015, 19 (18) : 3367 - 3374