Toward universal cell embeddings: integrating single-cell RNA-seq datasets across species with SATURN

被引:17
|
作者
Rosen, Yanay [1 ]
Brbic, Maria [2 ]
Roohani, Yusuf [3 ]
Swanson, Kyle [1 ]
Li, Ziang [4 ]
Leskovec, Jure [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Swiss Fed Inst Technol EPFL, Sch Comp & Commun Sci, Lausanne, Switzerland
[3] Stanford Univ, Dept Biomed Data Sci, Stanford, CA USA
[4] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
基金
美国国家卫生研究院;
关键词
RESONANCE MASS-SPECTROMETRY; BRAIN; THROUGHPUT; ATLAS; METABOLITES; MICROSCOPY; SIGNALS;
D O I
10.1038/s41592-024-02191-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Analysis of single-cell datasets generated from diverse organisms offers unprecedented opportunities to unravel fundamental evolutionary processes of conservation and diversification of cell types. However, interspecies genomic differences limit the joint analysis of cross-species datasets to homologous genes. Here we present SATURN, a deep learning method for learning universal cell embeddings that encodes genes' biological properties using protein language models. By coupling protein embeddings from language models with RNA expression, SATURN integrates datasets profiled from different species regardless of their genomic similarity. SATURN can detect functionally related genes coexpressed across species, redefining differential expression for cross-species analysis. Applying SATURN to three species whole-organism atlases and frog and zebrafish embryogenesis datasets, we show that SATURN can effectively transfer annotations across species, even when they are evolutionarily remote. We also demonstrate that SATURN can be used to find potentially divergent gene functions between glaucoma-associated genes in humans and four other species. SATURN performs cross-species integration and analysis using both single-cell gene expression and protein representations generated by protein language models.
引用
收藏
页码:1492 / 1500
页数:29
相关论文
共 50 条
  • [41] Single-cell RNA-seq keeps cells alive
    Kotsiliti, Eleni
    NATURE BIOTECHNOLOGY, 2022, 40 (10) : 1432 - 1432
  • [42] Submodular sketches of single-cell RNA-seq measurements
    Yang, Wei
    Bilmes, Jeffrey
    Noble, William Stafford
    ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
  • [43] Single-cell RNA-seq to decipher tumour architecture
    Cloney, Ross
    NATURE REVIEWS GENETICS, 2017, 18 (01) : 2 - 3
  • [44] Recent Developments in Single-Cell RNA-Seq of Microorganisms
    Zhang, Yi
    Gao, Jiaxin
    Huang, Yanyi
    Wang, Jianbin
    BIOPHYSICAL JOURNAL, 2018, 115 (02) : 173 - 180
  • [45] Embracing the dropouts in single-cell RNA-seq analysis
    Qiu, Peng
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [46] Single-cell RNA-seq to decipher tumour architecture
    Ross Cloney
    Nature Reviews Genetics, 2017, 18 : 2 - 3
  • [47] Quality control of single-cell RNA-seq by SinQC
    Jiang, Peng
    Thomson, James A.
    Stewart, Ron
    BIOINFORMATICS, 2016, 32 (16) : 2514 - 2516
  • [48] Single-Cell RNA-Seq in Human Lung Cancer
    Kim, J.
    Xu, Z.
    Marignani, P.
    JOURNAL OF THORACIC ONCOLOGY, 2018, 13 (10) : S911 - S911
  • [49] The Advances of Single-Cell RNA-Seq in Kidney Immunology
    Zeng, Honghui
    Yang, Xiaoqiang
    Luo, Siweier
    Zhou, Yiming
    FRONTIERS IN PHYSIOLOGY, 2021, 12
  • [50] Comparison of transformations for single-cell RNA-seq data
    Ahlmann-Eltze, Constantin
    Huber, Wolfgang
    NATURE METHODS, 2023, 20 (05) : 665 - +