scAAGA: Single cell data analysis framework using asymmetric autoencoder with gene attention

被引:64
作者
Meng, Rui [1 ]
Yin, Shuaidong [1 ]
Sun, Jianqiang [2 ]
Hu, Huan [3 ]
Zhao, Qi [1 ]
机构
[1] Univ Sci & Technol Liaoning, Sch Comp Sci & Software Engn, Anshan 114051, Peoples R China
[2] Linyi Univ, Sch Informat Sci & Engn, Linyi 276000, Peoples R China
[3] Fuzhou Univ, Inst Appl Genom, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
scRNA-seq; Deep learning; Gene attention; Data augmentation; COVID-19; RNA;
D O I
10.1016/j.compbiomed.2023.107414
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In recent years, single-cell RNA sequencing (scRNA-seq) has emerged as a powerful technique for investigating cellular heterogeneity and structure. However, analyzing scRNA-seq data remains challenging, especially in the context of COVID-19 research. Single-cell clustering is a key step in analyzing scRNA-seq data, and deep learning methods have shown great potential in this area. In this work, we propose a novel scRNA-seq analysis framework called scAAGA. Specifically, we utilize an asymmetric autoencoder with a gene attention module to learn important gene features adaptively from scRNA-seq data, with the aim of improving the clustering effect. We apply scAAGA to COVID19 peripheral blood mononuclear cell (PBMC) scRNA-seq data and compare its performance with state-of-the-art methods. Our results consistently demonstrate that scAAGA outperforms existing methods in terms of adjusted rand index (ARI), normalized mutual information (NMI), and adjusted mutual information (AMI) scores, achieving improvements ranging from 2.8% to 27.8% in NMI scores. Additionally, we discuss a data augmentation technology to expand the datasets and improve the accuracy of scAAGA. Overall, scAAGA presents a robust tool for scRNA-seq data analysis, enhancing the accuracy and reliability of clustering results in COVID-19 research.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] GRNUlar: A Deep Learning Framework for Recovering Single-Cell Gene Regulatory Networks
    Shrivastava, Harsh
    Zhang, Xiuwei
    Song, Le
    Aluru, Srinivas
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2022, 29 (01) : 27 - 44
  • [32] scAnnotatR: framework to accurately classify cell types in single-cell RNA-sequencing data
    Nguyen, Vy
    Griss, Johannes
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [33] Gene Data Analysis for Disease Detection Using Data Mining Algorithms
    Raman, Ramakrishnan
    CARDIOMETRY, 2022, (25): : 178 - 181
  • [34] Aspect based sentiment analysis of consumer reviews using unsupervised attention neural framework
    Dey, Atanu
    Jenamani, Mamata
    APPLIED SOFT COMPUTING, 2024, 167
  • [35] scAnnotatR: framework to accurately classify cell types in single-cell RNA-sequencing data
    Vy Nguyen
    Johannes Griss
    BMC Bioinformatics, 23
  • [36] Topological and geometric analysis of cell states in single-cell transcriptomic data
    Huynh, Tram
    Cang, Zixuan
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [37] Computational Cell Cycle Analysis of Single Cell RNA-Seq Data
    Moussa, Marmar
    Mandoiu, Ion I.
    COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES, 2021, 12686 : 71 - 87
  • [38] Interpretable generative deep learning: an illustration with single cell gene expression data
    Martin Treppner
    Harald Binder
    Moritz Hess
    Human Genetics, 2022, 141 : 1481 - 1498
  • [39] Identification of Potential Prognostic Biomarkers for ESCC Using Single-Cell RNA Sequencing Data Analysis
    Patowary, Pallabi
    Bhattacharyya, Dhruba K.
    Barah, Pankaj
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2023, 2023, 14301 : 853 - 861
  • [40] Identifying gene expression programs in single-cell RNA-seq data using linear correlation explanation
    Nussbaum, Yulia I.
    Hossain, K. S. M. Tozammel
    Kaifi, Jussuf
    Warren, Wesley C.
    Shyu, Chi-Ren
    Mitchem, Jonathan B.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 154