scGREAT: Transformer-based deep-language model for gene regulatory network inference from single-cell transcriptomics

被引:4
|
作者
Wang, Yuchen [1 ]
Chen, Xingjian [1 ,2 ]
Zheng, Zetian [1 ]
Huang, Lei [1 ]
Xie, Weidun [1 ]
Wang, Fuzhou [1 ]
Zhang, Zhaolei [4 ,5 ]
Wong, Ka -Chun [1 ,3 ,6 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Kowloon Tong, Hong Kong, Peoples R China
[2] Massachusetts Gen Hosp, Cutaneous Biol Res Ctr, Harvard Med Sch, Boston, MA USA
[3] City Univ Hong Kong, Shenzhen Res Inst, Shenzhen, Peoples R China
[4] Univ Toronto, Dept Mol Genet, Toronto, ON, Canada
[5] Univ Toronto, Donnelly Ctr Cellular & Biomol Res, Toronto, ON, Canada
[6] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
基金
中国国家自然科学基金;
关键词
external validation; EXPRESSION; STAT3;
D O I
10.1016/j.isci.2024.109352
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene regulatory networks (GRNs) involve complex and multi -layer regulatory interactions between regulators and their target genes. Precise knowledge of GRNs is important in understanding cellular processes and molecular functions. Recent breakthroughs in single -cell sequencing technology made it possible to infer GRNs at single -cell level. Existing methods, however, are limited by expensive computations, and sometimes simplistic assumptions. To overcome these obstacles, we propose scGREAT, a framework to infer GRN using gene embeddings and transformer from single -cell transcriptomics. scGREAT starts by constructing gene expression and gene biotext dictionaries from scRNA-seq data and gene text information. The representation of TF gene pairs is learned through optimizing embedding space by transformer -based engine. Results illustrated scGREAT outperformed other contemporary methods on benchmarks. Besides, gene representations from scGREAT provide valuable gene regulation insights, and external validation on spatial transcriptomics illuminated the mechanism behind scGREAT annotation. Moreover, scGREAT identified several TF target regulations corroborated in studies.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Single-Cell Regulatory Network Inference and Clustering Identifies Cell-Type Specific Expression Pattern of Transcription Factors in Mouse Sciatic Nerve
    Li, Mingchao
    Min, Qing
    Banton, Matthew C.
    Dun, Xinpeng
    FRONTIERS IN CELLULAR NEUROSCIENCE, 2021, 15
  • [42] SIGNET: single-cell RNA-seq-based gene regulatory network prediction using multiple-layer perceptron bagging
    Luo, Qinhuan
    Yu, Yongzhen
    Lan, Xun
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [43] Advanced methods for gene network identification and noise decomposition from single-cell data
    Fang, Zhou
    Gupta, Ankit
    Kumar, Sant
    Khammash, Mustafa
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [44] A review on gene regulatory network reconstruction algorithms based on single cell RNA sequencing
    Kim, Hyeonkyu
    Choi, Hwisoo
    Lee, Daewon
    Kim, Junil
    GENES & GENOMICS, 2024, 46 (01) : 121 - 133
  • [45] Dynamical Systems Model of RNA Velocity Improves Inference of Single-cell Trajectory, Pseudo-time and Gene Regulation
    Liu, Ruishan
    Pisco, Angela Oliveira
    Braun, Emelie
    Linnarsson, Sten
    Zou, James
    JOURNAL OF MOLECULAR BIOLOGY, 2022, 434 (15)
  • [46] Identifying progressive gene network perturbation from single-cell RNA-seq data
    Mukherjee, Sumit
    Carignano, Alberto
    Seelig, Georg
    Lee, Su-In
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 5034 - 5040
  • [47] Gene set inference from single-cell sequencing data using a hybrid of matrix factorization and variational autoencoders
    Lukassen, Soeren
    Ten, Foo Wei
    Adam, Lukas
    Eils, Roland
    Conrad, Christian
    NATURE MACHINE INTELLIGENCE, 2020, 2 (12) : 800 - 809
  • [48] The Role of Immunocyte Infiltration Regulatory Network Based on hdWGCNA and Single-Cell Bioinformatics Analysis in Intervertebral Disc Degeneration
    Shao, Tuo
    Gao, Qichang
    Tang, Weilong
    Ma, Yiming
    Gu, Jiaao
    Yu, Zhange
    INFLAMMATION, 2024, 47 (06) : 1987 - 1999
  • [49] From Noise to Knowledge: Diffusion Probabilistic Model-Based Neural Inference of Gene Regulatory Networks
    Zhu, Hao
    Slonim, Donna
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2024, 31 (11) : 1087 - 1103
  • [50] Gene network inference from single-cell omics data and domain knowledge for constructing COVID-19-specific ICAM1-associated pathways
    Odaka, Mitsuhiro
    Magnin, Morgan
    Inoue, Katsumi
    FRONTIERS IN GENETICS, 2023, 14