A shortcut for multiple testing on the directed acyclic graph of gene ontology

被引:5
|
作者
Saunders, Garrett [1 ,3 ]
Stevens, John R. [1 ]
Isom, S. Clay [2 ]
机构
[1] Utah State Univ, Dept Math & Stat, Logan, UT 84322 USA
[2] Utah State Univ, Dept Anim Dairy & Vet Sci, Logan, UT 84322 USA
[3] Brigham Young Univ, Dept Math, Rexburg, ID USA
来源
BMC BIOINFORMATICS | 2014年 / 15卷
关键词
Bonferroni; Holm; Gene ontology; Multiple testing; EXPRESSION DATA; RNA-SEQ; MICROARRAY; SETS; TOOL;
D O I
10.1186/s12859-014-0349-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Gene set testing has become an important analysis technique in high throughput microarray and next generation sequencing studies for uncovering patterns of differential expression of various biological processes. Often, the large number of gene sets that are tested simultaneously require some sort of multiplicity correction to account for the multiplicity effect. This work provides a substantial computational improvement to an existing familywise error rate controlling multiplicity approach (the Focus Level method) for gene set testing in high throughput microarray and next generation sequencing studies using Gene Ontology graphs, which we call the Short Focus Level. Results: The Short Focus Level procedure, which performs a shortcut of the full Focus Level procedure, is achieved by extending the reach of graphical weighted Bonferroni testing to closed testing situations where restricted hypotheses are present, such as in the Gene Ontology graphs. The Short Focus Level multiplicity adjustment can perform the full top-down approach of the original Focus Level procedure, overcoming a significant disadvantage of the otherwise powerful Focus Level multiplicity adjustment. The computational and power differences of the Short Focus Level procedure as compared to the original Focus Level procedure are demonstrated both through simulation and using real data. Conclusions: The Short Focus Level procedure shows a significant increase in computation speed over the original Focus Level procedure (as much as similar to 15,000 times faster). The Short Focus Level should be used in place of the Focus Level procedure whenever the logical assumptions of the Gene Ontology graph structure are appropriate for the study objectives and when either no a priori focus level of interest can be specified or the focus level is selected at a higher level of the graph, where the Focus Level procedure is computationally intractable.
引用
收藏
页数:16
相关论文
共 48 条
  • [1] A shortcut for multiple testing on the directed acyclic graph of gene ontology
    Garrett Saunders
    John R Stevens
    S Clay Isom
    BMC Bioinformatics, 15
  • [2] Multiple testing on the directed acyclic graph of gene ontology
    Goeman, Jelle J.
    Mansmann, Ulrich
    BIOINFORMATICS, 2008, 24 (04) : 537 - 544
  • [3] A multiple testing method for hypotheses structured in a directed acyclic graph
    Meijer, Rosa J.
    Goeman, Jelle J.
    BIOMETRICAL JOURNAL, 2015, 57 (01) : 123 - 143
  • [4] DAGViz: a directed acyclic graph browser that supports analysis of Gene Ontology annotation
    Yano, Kentaro
    Aoki, Koh
    Suzuki, Hideyuki
    Shibata, Daisuke
    PLANT BIOTECHNOLOGY, 2009, 26 (01) : 9 - 13
  • [5] A Hidden Markov Model Approach to Testing Multiple Hypotheses on a Tree-Transformed Gene Ontology Graph
    Liang, Kun
    Nettleton, Dan
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (492) : 1444 - 1454
  • [6] Multiple Testing of Gene Sets from Gene Ontology: Possibilities and Pitfalls
    Meijer, Rosa J.
    Goeman, Jelle J.
    BRIEFINGS IN BIOINFORMATICS, 2016, 17 (05) : 808 - 818
  • [7] Gene Ontology analysis in multiple gene clusters under multiple hypothesis testing framework
    Zhong, Sheng
    Xie, Dan
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2007, 41 (02) : 105 - 115
  • [8] Comparative analysis of gene sets in the gene ontology space under the multiple hypothesis testing framework
    Zhong, S
    Tian, L
    Li, C
    Storch, KF
    Wong, WH
    2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 425 - 435
  • [9] Smoothed nested testing on directed acyclic graphs
    Loper, J. H.
    Lei, L.
    Fithian, W.
    Tansey, W.
    BIOMETRIKA, 2022, 109 (02) : 457 - 471
  • [10] Handling multiple testing while interpreting microarrays with the Gene Ontology Database
    Michael V Osier
    Hongyu Zhao
    Kei-Hoi Cheung
    BMC Bioinformatics, 5