A HIERARCHICAL BAYESIAN MODEL FOR SINGLE-CELL CLUSTERING USING RNA-SEQUENCING DATA

被引:0
|
作者
Liu, Yiyi [1 ]
Warren, Joshua L. [1 ]
Zhao, Hongyu [1 ]
机构
[1] Yale Univ, Dept Biostat, Sch Publ Hlth, New Haven, CT 06520 USA
关键词
Bayesian hierarchical model; clustering; Dirichlet process; Gaussian mixture model; missing data; single-cell RNA-sequencing; TRANSCRIPTOMES; HETEROGENEITY; VISUALIZATION; CHALLENGES;
D O I
10.1214/19-AOAS1250
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Understanding the heterogeneity of cells is an important biological question. The development of single-cell RNA-sequencing (scRNA-seq) technology provides high resolution data for such inquiry. A key challenge in scRNA-seq analysis is the high variability of measured RNA expression levels and frequent dropouts (missing values) due to limited input RNA compared to bulk RNA-seq measurement. Existing clustering methods do not perform well for these noisy and zero-inflated scRNA-seq data. In this manuscript we propose a Bayesian hierarchical model, called BasClu, to appropriately characterize important features of scRNA-seq data in order to more accurately cluster cells. We demonstrate the effectiveness of our method with extensive simulation studies and applications to three real scRNA-seq datasets.
引用
收藏
页码:1733 / 1752
页数:20
相关论文
共 50 条
  • [1] Clustering and classification methods for single-cell RNA-sequencing data
    Qi, Ren
    Ma, Anjun
    Ma, Qin
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1196 - 1208
  • [2] Machine learning and statistical methods for clustering single-cell RNA-sequencing data
    Petegrosso, Raphael
    Li, Zhuliu
    Kuang, Rui
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1209 - 1223
  • [3] A Data-Driven Clustering Recommendation Method for Single-Cell RNA-Sequencing Data
    Tian, Yu
    Zheng, Ruiqing
    Liang, Zhenlan
    Li, Suning
    Wu, Fang-Xiang
    Li, Min
    TSINGHUA SCIENCE AND TECHNOLOGY, 2021, 26 (05) : 772 - 789
  • [4] Accounting for technical noise in Bayesian graphical models of single-cell RNA-sequencing data
    Oh, Jihwan
    Chang, Changgee
    Long, Qi
    BIOSTATISTICS, 2022, 24 (01) : 161 - 176
  • [5] Missing data and technical variability in single-cell RNA-sequencing experiments
    Hicks, Stephanie C.
    Townes, F. William
    Teng, Mingxiang
    Irizarry, Rafael A.
    BIOSTATISTICS, 2018, 19 (04) : 562 - 578
  • [6] Single-cell RNA-sequencing of the brain
    Duran, Raquel Cuevas-Diaz
    Wei, Haichao
    Wu, Jia Qian
    CLINICAL AND TRANSLATIONAL MEDICINE, 2017, 6
  • [7] scGAAC: A graph attention autoencoder for clustering single-cell RNA-sequencing data
    Zhang, Lin
    Xiang, Haiping
    Wang, Feng
    Chen, Zepeng
    Shen, Mo
    Ma, Jiani
    Liu, Hui
    Zheng, Hongdang
    METHODS, 2024, 229 : 115 - 124
  • [8] Clustering Single-cell RNA-sequencing Data based on Matching Clusters Structures
    Wang, Yizhang
    Zhou, You
    Pang, Wie
    Liang, Yanchun
    Wang, Shu
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2020, 27 (01): : 89 - 95
  • [9] Clustering methods for single-cell RNA-sequencing expression data: performance evaluation with varying sample sizes and cell compositions
    Suner, Asli
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2019, 18 (05)
  • [10] Single-Cell RNA Sequencing Data Interpretation by Evolutionary Multiobjective Clustering
    Li, Xiangtao
    Wong, Ka-Chun
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (05) : 1773 - 1784