moSCminer: a cell subtype classification framework based on the attention neural network integrating the single-cell multi-omics dataset on the cloud

被引:0
作者
Choi, Joung Min [1 ]
Park, Chaelin [2 ]
Chae, Heejoon [2 ]
机构
[1] Virginia Polytech Inst & State Univ Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
[2] Sookmyung Womens Univ, Div Comp Sci, Seoul, South Korea
来源
PEERJ | 2024年 / 12卷
基金
新加坡国家研究基金会;
关键词
Attention-based neural network; Cell subtype classification; Deep learning-based framework; Single-cell multi-omics; Self attention; Web platform; Cloud system; RNA; MATURATION;
D O I
10.7717/peerj.17006
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Single-cell omics sequencing has rapidly advanced, enabling the quantification of diverse omics profiles at a single-cell resolution. To facilitate comprehensive biological insights, such as cellular differentiation trajectories, precise annotation of cell subtypes is essential. Conventional methods involve clustering cells and manually assigning subtypes based on canonical markers, a labor-intensive and expert-dependent process. Hence, an automated computational prediction framework is crucial. While several classification frameworks for predicting cell subtypes from single-cell RNA sequencing datasets exist, these methods solely rely on single-omics data, offering insights at a single molecular level. They often miss inter-omic correlations and a holistic understanding of cellular processes. To address this, the integration of multi-omics datasets from individual cells is essential for accurate subtype annotation. This article introduces moSCminer, a novel framework for classifying cell subtypes that harnesses the power of single-cell multi-omics sequencing datasets through an attention-based neural network operating at the omics level. By integrating three distinct omics datasets-gene expression, DNA methylation, and DNA accessibility-while accounting for their biological relationships, moSCminer excels at learning the relative significance of each omics feature. It then transforms this knowledge into a novel representation for cell subtype classification. Comparative evaluations against standard machine learning-based classifiers demonstrate moSCminer's superior performance, consistently achieving the highest average performance on real datasets. The efficacy of multi-omics integration is further corroborated through an in-depth analysis of the omics-level attention module, which identifies potential markers for cell subtype annotation. To enhance accessibility and scalability, moSCminer is accessible as a user-friendly web-based platform seamlessly connected to a cloud system, publicly accessible at http://203.252.206.118:5568. Notably, this study marks the pioneering integration of three single-cell multi-omics datasets for cell subtype identification.
引用
收藏
页数:19
相关论文
共 46 条
  • [1] Abadi M., 2015, TENSORFLOW LARGE SCA
  • [2] Computational strategies for single-cell multi-omics integration
    Adossa, Nigatu
    Khan, Sofia
    Rytkonen, Kalle T.
    Elo, Laura L.
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 2588 - 2596
  • [3] Advances in single-cell multi-omics profiling
    Bai, Dongsheng
    Peng, Jinying
    Yi, Chengqi
    [J]. RSC CHEMICAL BIOLOGY, 2021, 2 (02): : 441 - 449
  • [4] DeepTRIAGE: interpretable and individualised biomarker scores using attention mechanism for the classification of breast cancer sub-types
    Beykikhoshk, Adham
    Quinn, Thomas P.
    Lee, Samuel C.
    Truyen Tran
    Venkatesh, Svetha
    [J]. BMC MEDICAL GENOMICS, 2020, 13 (Suppl 3)
  • [5] Integrative single-cell multiomics analyses dissect molecular signatures of intratumoral heterogeneities and differentiation states of human gastric cancer
    Bian, Shuhui
    Wang, Yicheng
    Zhou, Yuan
    Wang, Wendong
    Guo, Limei
    Wen, Lu
    Fu, Wei
    Zhou, Xin
    Tang, Fuchou
    [J]. NATIONAL SCIENCE REVIEW, 2023, 10 (06)
  • [6] CHETAH: a selective, hierarchical cell type identification method for single-cell RNA sequencing
    de Kanter, Jurrian K.
    Lijnzaad, Philip
    Candelli, Tito
    Margaritis, Thanasis
    Holstege, Frank C. P.
    [J]. NUCLEIC ACIDS RESEARCH, 2019, 47 (16)
  • [7] CpG islands and the regulation of transcription
    Deaton, Aimee M.
    Bird, Adrian
    [J]. GENES & DEVELOPMENT, 2011, 25 (10) : 1010 - 1022
  • [8] scMoC: single-cell multi-omics clustering
    Eltager, Mostafa
    Abdelaal, Tamim
    Mahfouz, Ahmed
    Reinders, Marcel J. T.
    [J]. BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [9] Multi-omics integration method based on attention deep learning network for biomedical data classification
    Gong, Ping
    Cheng, Lei
    Zhang, Zhiyuan
    Meng, Ao
    Li, Enshuo
    Chen, Jie
    Zhang, Longzhen
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 231
  • [10] Single-cell multi-omics and lineage tracing to dissect cell fate decision-making
    Haghverdi, Laleh
    Ludwig, Leif S.
    [J]. STEM CELL REPORTS, 2023, 18 (01): : 13 - 25