Gene-set distance analysis (GSDA): a powerful tool for gene-set association analysis

被引:0
作者
Cao, Xueyuan [1 ]
Pounds, Stan [2 ]
机构
[1] Univ Tennessee, Hlth Sci Ctr, Dept Acute & Tertiary Care, Memphis, TN 38163 USA
[2] St Jude Childrens Res Hosp, Dept Biostat, 332 N Lauderdale St, Memphis, TN 38105 USA
关键词
Gene profiling; Gene set; Distance correlation; ACUTE MYELOID-LEUKEMIA; FALSE DISCOVERY RATE; FUNCTIONAL CATEGORIES; ENRICHMENT ANALYSIS; EXPRESSION; MICROARRAY;
D O I
10.1186/s12859-021-04110-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Identifying sets of related genes (gene sets) that are empirically associated with a treatment or phenotype often yields valuable biological insights. Several methods effectively identify gene sets in which individual genes have simple monotonic relationships with categorical, quantitative, or censored event-time variables. Some distance-based methods, such as distance correlations, may detect complex non-monotone associations of a gene-set with a quantitative variable that elude other methods. However, the distance correlations have yet to be generalized to associate gene-sets with categorical and censored event-time endpoints. Also, there is a need to determine which genes empirically drive the significance of an association of a gene set with an endpoint. Results: We develop gene-set distance analysis (GSDA) by generalizing distance correlations to evaluate the association of a gene set with categorical and censored event-time variables. We also develop a backward elimination procedure to identify a subset of genes that empirically drive significant associations. In simulation studies, GSDA more effectively identified complex non-monotone gene-set associations than did six other published methods. In the analysis of a pediatric acute myeloid leukemia (AML) data set, GSDA was the only method to discover that event-free survival (EFS) was associated with the 56-gene AML pathway gene-set, narrow that result down to 5 genes, and confirm the association of those 5 genes with EFS in a separate validation cohort. These results indicate that GSDA effectively identifies and characterizes complex non-monotonic gene-set associations that are missed by other methods. Conclusion: GSDA is a powerful and flexible method to detect gene-set association with categorical, quantitative, or censored event-time variables, especially to detect complex non-monotonic gene-set associations. Available at https://CRAN.R-project.org/package=GSDA..
引用
收藏
页数:22
相关论文
共 50 条
  • [31] Identification of key transcription factors associated with cerebral ischemia-reperfusion injury based on gene-set enrichment analysis
    Zhang, Ying-Ying
    Wang, Kai
    Liu, Yun-E
    Wang, Wei
    Liu, Ao-Fei
    Zhou, Ji
    Li, Chen
    Zhang, Yi-Qun
    Zhang, Ai-Ping
    Lv, Jin
    Jiang, Wei-Jian
    INTERNATIONAL JOURNAL OF MOLECULAR MEDICINE, 2019, 43 (06) : 2429 - 2439
  • [32] Development and validation of an immune gene-set based prognostic signature for soft tissue sarcoma
    Shen, Rui
    Liu, Bo
    Li, Xuesen
    Yu, Tengbo
    Xu, Kuishuai
    Ma, Jinfeng
    BMC CANCER, 2021, 21 (01)
  • [33] Beyond main effects of gene-sets: harsh parenting moderates the association between a dopamine gene-set and child externalizing behavior
    Windhorst, Dafna A.
    Mileva-Seitz, Viara R.
    Rippe, Ralph C. A.
    Tiemeier, Henning
    Jaddoe, Vincent W. V.
    Verhulst, Frank C.
    van IJzendoorn, Marinus H.
    Bakermans-Kranenburg, Marian J.
    BRAIN AND BEHAVIOR, 2016, 6 (08):
  • [34] Gene-Set Local Hierarchical Clustering (GSLHC)-A Gene Set-Based Approach for Characterizing Bioactive Compounds in Terms of Biological Functional Groups
    Chung, Feng-Hsiang
    Jin, Zhen-Hua
    Hsu, Tzu-Ting
    Hsu, Chueh-Lin
    Liu, Hsueh-Chuan
    Lee, Hoong-Chien
    PLOS ONE, 2015, 10 (10):
  • [35] Identification of Gene-Set Signature in Early-Stage Hepatocellular Carcinoma and Relevant Immune Characteristics
    Zhao, Qijie
    Wongpoomchai, Rawiwan
    Chariyakornkul, Arpamas
    Xiao, Zhangang
    Pilapong, Chalermchai
    FRONTIERS IN ONCOLOGY, 2021, 11
  • [36] Pathway and gene-set activation measurement from mRNA expression data: the tissue distribution of human pathways
    David M Levine
    David R Haynor
    John C Castle
    Sergey B Stepaniants
    Matteo Pellegrini
    Mao Mao
    Jason M Johnson
    Genome Biology, 7
  • [37] MeDiA: Mean Distance Association and Its Applications in Nonlinear Gene Set Analysis
    Peng, Hesen
    Ma, Junjie
    Bai, Yun
    Lu, Jianwei
    Yu, Tianwei
    PLOS ONE, 2015, 10 (04):
  • [38] Integrated gene set analysis for microRNA studies
    Garcia-Garcia, Francisco
    Panadero, Joaquin
    Dopazo, Joaquin
    Montaner, David
    BIOINFORMATICS, 2016, 32 (18) : 2809 - 2816
  • [39] Pairwise common variant meta-analyses of schizophrenia with other psychiatric disorders reveals shared and distinct gene and gene-set associations
    Reay, William R.
    Cairns, Murray J.
    TRANSLATIONAL PSYCHIATRY, 2020, 10 (01)
  • [40] Gene set meta-analysis with Quantitative Set Analysis for Gene Expression (QuSAGE)
    Meng, Hailong
    Yaari, Gur
    Bolen, Christopher R.
    Avey, Stefan
    Kleinstein, Steven H.
    PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (04) : 1 - 10