Mugen-UMAP: UMAP visualization and clustering of mutated genes in single-cell DNA sequencing data

被引:1
|
作者
Li, Teng [1 ,2 ]
Zou, Yiran [2 ]
Li, Xianghan [2 ]
Wong, Thomas K. F. [2 ,3 ]
Rodrigo, Allen G. [1 ,2 ]
机构
[1] Univ Auckland, Sch Biol Sci, Auckland, New Zealand
[2] Australian Natl Univ, Res Sch Biol, Canberra, ACT, Australia
[3] Australian Natl Univ, Sch Comp, Canberra, ACT, Australia
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
关键词
UMAP; Visualization; Clustering; Single-cell DNA sequencing; Gene mutation; CANCER; EVOLUTION; PATTERNS;
D O I
10.1186/s12859-024-05928-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundThe application of Uniform Manifold Approximation and Projection (UMAP) for dimensionality reduction and visualization has revolutionized the analysis of single-cell RNA expression and population genetics. However, its potential in single-cell DNA sequencing data analysis, particularly for visualizing gene mutation information, has not been fully explored.ResultsWe introduce Mugen-UMAP, a novel Python-based program that extends UMAP's utility to single-cell DNA sequencing data. This innovative tool provides a comprehensive pipeline for processing gene annotation files of single-cell somatic single-nucleotide variants and metadata to the visualization of UMAP projections for identifying clusters, along with various statistical analyses. Employing Mugen-UMAP, we analyzed whole-exome sequencing data from 365 single-cell samples across 12 non-small cell lung cancer (NSCLC) patients, revealing distinct clusters associated with histological subtypes of NSCLC. Moreover, to demonstrate the general utility of Mugen-UMAP, we applied the program to 9 additional single-cell WES datasets from various cancer types, uncovering interesting patterns of cell clusters that warrant further investigation. In summary, Mugen-UMAP provides a quick and effective visualization method to uncover cell cluster patterns based on the gene mutation information from single-cell DNA sequencing data.ConclusionsThe application of Mugen-UMAP demonstrates its capacity to provide valuable insights into the visualization and interpretation of single-cell DNA sequencing data. Mugen-UMAP can be found at https://github.com/tengchn/Mugen-UMAP
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Somatic variant calling from single-cell DNA sequencing data
    Valecha, Monica
    Posada, David
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 2978 - 2985
  • [22] Effective Clustering for Single Cell Sequencing Cancer Data
    Ciccolella, Simone
    Patterson, Murray
    Bonizzoni, Paola
    Della Vedova, Gianluca
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (11) : 4068 - 4078
  • [23] SCSilicon: a tool for synthetic single-cell DNA sequencing data generation
    Feng, Xikang
    Chen, Lingxi
    BMC GENOMICS, 2022, 23 (SUPPL 4)
  • [24] SIEVE: joint inference of single-nucleotide variants and cell phylogeny from single-cell DNA sequencing data
    Kang, Senbai
    Borgsmueller, Nico
    Valecha, Monica
    Kuipers, Jack
    Alves, Joao M.
    Prado-Lopez, Sonia
    Chantada, Debora
    Beerenwinkel, Niko
    Posada, David
    Szczurek, Ewa
    GENOME BIOLOGY, 2022, 23 (01)
  • [25] Clustering Single-cell RNA-sequencing Data based on Matching Clusters Structures
    Wang, Yizhang
    Zhou, You
    Pang, Wie
    Liang, Yanchun
    Wang, Shu
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2020, 27 (01): : 89 - 95
  • [26] scConsensus: combining supervised and unsupervised clustering for cell type identification in single-cell RNA sequencing data
    Ranjan, Bobby
    Schmidt, Florian
    Sun, Wenjie
    Park, Jinyu
    Honardoost, Mohammad Amin
    Tan, Joanna
    Arul Rayan, Nirmala
    Prabhakar, Shyam
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [27] scConsensus: combining supervised and unsupervised clustering for cell type identification in single-cell RNA sequencing data
    Bobby Ranjan
    Florian Schmidt
    Wenjie Sun
    Jinyu Park
    Mohammad Amin Honardoost
    Joanna Tan
    Nirmala Arul Rayan
    Shyam Prabhakar
    BMC Bioinformatics, 22
  • [28] Reference-free inference of tumor phylogenies from single-cell sequencing data
    Subramanian, Ayshwarya
    Schwartz, Russell
    BMC GENOMICS, 2015, 16
  • [29] Single-Cell RNA Sequencing Data Clustering by Low-Rank Subspace Ensemble Framework
    Wang, ChuanYuan
    Gao, Ying-Lian
    Liu, Jin-Xing
    Kong, Xiong-Zhen
    Zheng, Chun-Hou
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) : 1154 - 1164
  • [30] Spectral Clustering of Single-Cell RNA-Sequencing Data by Multiple Feature Sets Affinity
    Liu, Yang
    Li, Feng
    Shang, Junliang
    Ge, Daohui
    Ren, Qianqian
    Li, Shengjun
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT III, 2023, 14088 : 268 - 278