Improved identification of concordant and discordant gene expression signatures using an updated rank-rank hypergeometric overlap approach

被引:145
作者
Cahill, Kelly M. [1 ]
Huo, Zhiguang [1 ,2 ]
Tseng, George C. [1 ,3 ]
Logan, Ryan W. [4 ,5 ,6 ]
Seney, Marianne L. [4 ,5 ]
机构
[1] Univ Pittsburgh, Grad Sch Publ Hlth, Dept Biostat, Pittsburgh, PA 15261 USA
[2] Univ Florida, Coll Med, Coll Publ Hlth & Hlth Profess, Dept Biostat, Gainesville, FL USA
[3] Univ Pittsburgh, Sch Med, Dept Computat & Syst Biol, Pittsburgh, PA USA
[4] Univ Pittsburgh, Sch Med, Dept Psychiat, Pittsburgh, PA 15260 USA
[5] Univ Pittsburgh, Sch Med, Translat Neurosci Program, Pittsburgh, PA 15260 USA
[6] Jackson Lab, Ctr Syst Neurogenet Addict, 600 Main St, Bar Harbor, ME 04609 USA
来源
SCIENTIFIC REPORTS | 2018年 / 8卷
关键词
CANCER; FRAMEWORK; REVEALS;
D O I
10.1038/s41598-018-27903-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent advances in large-scale gene expression profiling necessitate concurrent development of biostatistical approaches to reveal meaningful biological relationships. Most analyses rely on significance thresholds for identifying differentially expressed genes. We use an approach to compare gene expression datasets using 'threshold-free' comparisons. Significance cut-offs to identify genes shared between datasets may be too stringent and may miss concordant patterns of gene expression with potential biological relevance. A threshold-free approach gaining popularity in several research areas, including neuroscience, is Rank-Rank Hypergeometric Overlap (RRHO). Genes are ranked by their p-value and effect size direction, and ranked lists are compared to identify significantly overlapping genes across a continuous significance gradient rather than at a single arbitrary cut-off. We have updated the previous RRHO analysis by accurately detecting overlap of genes changed in the same and opposite directions between two datasets. Here, we use simulated and real data to show the drawbacks of the previous algorithm as well as the utility of our new algorithm. For example, we show the power of detecting discordant transcriptional patterns in the postmortem brain of subjects with psychiatric disorders. The new R package, RRHO2, offers a new, more intuitive visualization of concordant and discordant gene overlap.
引用
收藏
页数:11
相关论文
共 13 条
  • [11] HYPOTHESIS SETTING AND ORDER STATISTIC FOR ROBUST GENOMIC META-ANALYSIS
    Song, Chi
    Tseng, George C.
    [J]. ANNALS OF APPLIED STATISTICS, 2014, 8 (02) : 777 - 800
  • [12] A Quantitative Framework to Evaluate Modeling of Cortical Development by Neural Stem Cells
    Stein, Jason L.
    de la Torre-Ubieta, Luis
    Tian, Yuan
    Parikshak, Neelroop N.
    Hernandez, Israel A.
    Marchetto, Maria C.
    Baker, Dylan K.
    Lu, Daning
    Hinman, Cassidy R.
    Lowe, Jennifer K.
    Wexler, Eric M.
    Muotri, Alysson R.
    Gage, Fred H.
    Kosik, Kenneth S.
    Geschwind, Daniel H.
    [J]. NEURON, 2014, 83 (01) : 69 - 86
  • [13] Predictive genomics: A cancer hallmark network framework for predicting tumor clinical phenotypes using genome sequencing data
    Wang, Edwin
    Zaman, Naif
    Mcgee, Shauna
    Milanese, Jean-Sebastien
    Masoudi-Nejad, Ali
    O'Connor-McCourt, Maureen
    [J]. SEMINARS IN CANCER BIOLOGY, 2015, 30 : 4 - 12