moSCminer: a cell subtype classification framework based on the attention neural network integrating the single-cell multi-omics dataset on the cloud

被引:0
作者
Choi, Joung Min [1 ]
Park, Chaelin [2 ]
Chae, Heejoon [2 ]
机构
[1] Virginia Polytech Inst & State Univ Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
[2] Sookmyung Womens Univ, Div Comp Sci, Seoul, South Korea
来源
PEERJ | 2024年 / 12卷
基金
新加坡国家研究基金会;
关键词
Attention-based neural network; Cell subtype classification; Deep learning-based framework; Single-cell multi-omics; Self attention; Web platform; Cloud system; RNA; MATURATION;
D O I
10.7717/peerj.17006
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Single-cell omics sequencing has rapidly advanced, enabling the quantification of diverse omics profiles at a single-cell resolution. To facilitate comprehensive biological insights, such as cellular differentiation trajectories, precise annotation of cell subtypes is essential. Conventional methods involve clustering cells and manually assigning subtypes based on canonical markers, a labor-intensive and expert-dependent process. Hence, an automated computational prediction framework is crucial. While several classification frameworks for predicting cell subtypes from single-cell RNA sequencing datasets exist, these methods solely rely on single-omics data, offering insights at a single molecular level. They often miss inter-omic correlations and a holistic understanding of cellular processes. To address this, the integration of multi-omics datasets from individual cells is essential for accurate subtype annotation. This article introduces moSCminer, a novel framework for classifying cell subtypes that harnesses the power of single-cell multi-omics sequencing datasets through an attention-based neural network operating at the omics level. By integrating three distinct omics datasets-gene expression, DNA methylation, and DNA accessibility-while accounting for their biological relationships, moSCminer excels at learning the relative significance of each omics feature. It then transforms this knowledge into a novel representation for cell subtype classification. Comparative evaluations against standard machine learning-based classifiers demonstrate moSCminer's superior performance, consistently achieving the highest average performance on real datasets. The efficacy of multi-omics integration is further corroborated through an in-depth analysis of the omics-level attention module, which identifies potential markers for cell subtype annotation. To enhance accessibility and scalability, moSCminer is accessible as a user-friendly web-based platform seamlessly connected to a cloud system, publicly accessible at http://203.252.206.118:5568. Notably, this study marks the pioneering integration of three single-cell multi-omics datasets for cell subtype identification.
引用
收藏
页数:19
相关论文
共 46 条
  • [21] Lin ZH, 2017, Arxiv, DOI [arXiv:1703.03130, 10.48550/arXiv.1703.03130]
  • [22] Control of Embryonic Stem Cell Lineage Commitment by Core Promoter Factor, TAF3
    Liu, Zhe
    Scannell, Devin R.
    Eisen, Michael B.
    Tjian, Robert
    [J]. CELL, 2011, 146 (05) : 720 - 731
  • [23] Lix LM, 1996, REV EDUC RES, V66, P579, DOI 10.2307/1170654
  • [24] Single-cell multiomics analysis reveals regulatory programs in clear cell renal cell carcinoma
    Long, Zhilin
    Sun, Chengfang
    Tang, Min
    Wang, Yin
    Ma, Jiayan
    Yu, Jichuan
    Wei, Jingchao
    Ma, Jianzhu
    Wang, Bohan
    Xie, Qi
    Wen, Jiaming
    [J]. CELL DISCOVERY, 2022, 8 (01)
  • [25] Current best practices in single-cell RNA-seq analysis: a tutorial
    Luecken, Malte D.
    Theis, Fabian J.
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2019, 15 (06)
  • [26] Chromatin Potential Identified by Shared Single-Cell Profiling of RNA and Chromatin
    Ma, Sai
    Zhang, Bing
    LaFave, Lindsay M.
    Earl, Andrew S.
    Chiang, Zachary
    Hu, Yan
    Ding, Jiarui
    Brack, Alison
    Kartha, Vinay K.
    Tay, Tristan
    Law, Travis
    Lareau, Caleb
    Hsu, Ya-Chieh
    Regev, Aviv
    Buenrostro, Jason D.
    [J]. CELL, 2020, 183 (04) : 1103 - +
  • [27] MOMA: a multi-task attention learning algorithm for multi-omics data interpretation and classification
    Moon, Sehwan
    Lee, Hyunju
    [J]. BIOINFORMATICS, 2022, 38 (08) : 2287 - 2296
  • [28] scAnnotatR: framework to accurately classify cell types in single-cell RNA-sequencing data
    Nguyen, Vy
    Griss, Johannes
    [J]. BMC BIOINFORMATICS, 2022, 23 (01)
  • [29] Single-cell genomics to understand disease pathogenesis
    Nomura, Seitaro
    [J]. JOURNAL OF HUMAN GENETICS, 2021, 66 (01) : 75 - 84
  • [30] Major histocompatibility complex (Mhc) class Ib gene duplications, organization and expression patterns in mouse strain C57BL/6
    Ohtsuka, Masato
    Inoko, Hidetoshi
    Kulski, Jerzy K.
    Yoshimura, Shinichi
    [J]. BMC GENOMICS, 2008, 9 (1)