DeepGSEA: explainable deep gene set enrichment analysis for single-cell transcriptomic data

被引:1
|
作者
Xiong, Guangzhi [1 ]
Leroy, Nathan J. [2 ]
Bekiranov, Stefan [3 ]
Sheffield, Nathan C. [2 ]
Zhang, Aidong [1 ]
机构
[1] Univ Virginia, Dept Comp Sci, 85 Engineers Way, Charlottesville, VA 22904 USA
[2] Univ Virginia, Ctr Publ Hlth Genom, Charlottesville, VA 22904 USA
[3] Univ Virginia, Dept Biochem & Mol Genet, Charlottesville, VA 22908 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
REVEALS; WHETHER; CA1;
D O I
10.1093/bioinformatics/btae434
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Gene set enrichment (GSE) analysis allows for an interpretation of gene expression through pre-defined gene set databases and is a critical step in understanding different phenotypes. With the rapid development of single-cell RNA sequencing (scRNA-seq) technology, GSE analysis can be performed on fine-grained gene expression data to gain a nuanced understanding of phenotypes of interest. However, with the cellular heterogeneity in single-cell gene profiles, current statistical GSE analysis methods sometimes fail to identify enriched gene sets. Meanwhile, deep learning has gained traction in applications like clustering and trajectory inference in single-cell studies due to its prowess in capturing complex data patterns. However, its use in GSE analysis remains limited, due to interpretability challenges.Results In this paper, we present DeepGSEA, an explainable deep gene set enrichment analysis approach which leverages the expressiveness of interpretable, prototype-based neural networks to provide an in-depth analysis of GSE. DeepGSEA learns the ability to capture GSE information through our designed classification tasks, and significance tests can be performed on each gene set, enabling the identification of enriched sets. The underlying distribution of a gene set learned by DeepGSEA can be explicitly visualized using the encoded cell and cellular prototype embeddings. We demonstrate the performance of DeepGSEA over commonly used GSE analysis methods by examining their sensitivity and specificity with four simulation studies. In addition, we test our model on three real scRNA-seq datasets and illustrate the interpretability of DeepGSEA by showing how its results can be explained.Availability and implementation https://github.com/Teddy-XiongGZ/DeepGSEA
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Adapting gene set enrichment analysis to single cell data
    Wenzel, Alexander T.
    Jun, John
    Mesirov, Jill P.
    CANCER RESEARCH, 2024, 84 (06)
  • [2] Reconstructing gene regulatory networks in single-cell transcriptomic data analysis
    Hao Dai
    Qi-Qi Jin
    Lin Li
    Luo-Nan Chen
    Zoological Research, 2020, 41 (06) : 599 - 604
  • [3] Reconstructing gene regulatory networks in single-cell transcriptomic data analysis
    Dai, Hao
    Jin, Qi-Qi
    Li, Lin
    Chen, Luo-Nan
    ZOOLOGICAL RESEARCH, 2020, 41 (06) : 599 - 604
  • [4] Single-cell gene set enrichment analysis and transfer learning for functional annotation of scRNA-seq data
    Franchini, Melania
    Pellecchia, Simona
    Viscido, Gaetano
    Gambardella, Gennaro
    NAR GENOMICS AND BIOINFORMATICS, 2023, 5 (01)
  • [5] irGSEA: the integration of single-cell rank-based gene set enrichment analysis
    Fan, Chuiqin
    Chen, Fuyi
    Chen, Yuanguo
    Huang, Liangping
    Wang, Manna
    Liu, Yulin
    Wang, Yu
    Guo, Huijie
    Zheng, Nanpeng
    Liu, Yanbing
    Wang, Hongwu
    Ma, Lian
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (04)
  • [6] Single-cell Transcriptomic Analysis
    Zheng, Zhihong
    Chen, Enguo
    Lu, Weiguo
    Mouradian, Gary
    Hodges, Matthew
    Liang, Mingyu
    Liu, Pengyuan
    Lu, Yan
    COMPREHENSIVE PHYSIOLOGY, 2020, 10 (02) : 767 - 783
  • [7] A hybrid deep learning framework for gene regulatory network inference from single-cell transcriptomic data
    Zhao, Mengyuan
    He, Wenying
    Tang, Jijun
    Zou, Quan
    Guo, Fei
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [8] Topological and geometric analysis of cell states in single-cell transcriptomic data
    Huynh, Tram
    Cang, Zixuan
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [9] Cross-Species Analysis of Single-Cell Transcriptomic Data
    Shafer, Maxwell E. R.
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2019, 7
  • [10] Single-cell transcriptomic analysis of endometriosis
    Fonseca, Marcos A. S.
    Haro, Marcela
    Wright, Kelly N.
    Lin, Xianzhi
    Abbasi, Forough
    Sun, Jennifer
    Hernandez, Lourdes
    Orr, Natasha L.
    Hong, Jooyoon
    Choi-Kuaea, Yunhee
    Maluf, Horacio M.
    Balzer, Bonnie L.
    Fishburn, Aaron
    Hickey, Ryan
    Cass, Ilana
    Goodridge, Helen S.
    Truong, Mireille
    Wang, Yemin
    Pisarska, Margareta D.
    Dinh, Huy Q.
    EL-Naggar, Amal
    Huntsman, David G.
    Anglesio, Michael S.
    Goodman, Marc T.
    Medeiros, Fabiola
    Siedhoff, Matthew
    Lawrenson, Kate
    NATURE GENETICS, 2023, 55 (02) : 255 - 267