A HIERARCHICAL BAYESIAN MODEL FOR SINGLE-CELL CLUSTERING USING RNA-SEQUENCING DATA

被引:0
|
作者
Liu, Yiyi [1 ]
Warren, Joshua L. [1 ]
Zhao, Hongyu [1 ]
机构
[1] Yale Univ, Dept Biostat, Sch Publ Hlth, New Haven, CT 06520 USA
来源
ANNALS OF APPLIED STATISTICS | 2019年 / 13卷 / 03期
关键词
Bayesian hierarchical model; clustering; Dirichlet process; Gaussian mixture model; missing data; single-cell RNA-sequencing; TRANSCRIPTOMES; HETEROGENEITY; VISUALIZATION; CHALLENGES;
D O I
10.1214/19-AOAS1250
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Understanding the heterogeneity of cells is an important biological question. The development of single-cell RNA-sequencing (scRNA-seq) technology provides high resolution data for such inquiry. A key challenge in scRNA-seq analysis is the high variability of measured RNA expression levels and frequent dropouts (missing values) due to limited input RNA compared to bulk RNA-seq measurement. Existing clustering methods do not perform well for these noisy and zero-inflated scRNA-seq data. In this manuscript we propose a Bayesian hierarchical model, called BasClu, to appropriately characterize important features of scRNA-seq data in order to more accurately cluster cells. We demonstrate the effectiveness of our method with extensive simulation studies and applications to three real scRNA-seq datasets.
引用
收藏
页码:1733 / 1752
页数:20
相关论文
共 50 条
  • [41] The Poisson distribution model fits UMI-based single-cell RNA-sequencing data
    Yue Pan
    Justin T. Landis
    Razia Moorad
    Di Wu
    J. S. Marron
    Dirk P. Dittmer
    BMC Bioinformatics, 24
  • [42] scMODD: A model-driven algorithm for doublet identification in single-cell RNA-sequencing data
    Zhao, Xinye
    Du, Alexander
    Qiu, Peng
    FRONTIERS IN SYSTEMS BIOLOGY, 2023, 2
  • [43] The Poisson distribution model fits UMI-based single-cell RNA-sequencing data
    Pan, Yue
    Landis, Justin T.
    Moorad, Razia
    Wu, Di
    Marron, J. S.
    Dittmer, Dirk P.
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [44] A data denoising approach to optimize functional clustering of single cell RNA-sequencing data
    Wan, Changlin
    Jia, Dongya
    Zhao, Yue
    Chang, Wennan
    Cao, Sha
    Wang, Xiao
    Zhang, Chi
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 217 - 222
  • [45] Constructing Simulation Data with Dependence Structure for Unreliable Single-Cell RNA-Sequencing Data Using Copulas
    Fuetterer, Cornelia
    Schollmeyer, Georg
    Augustin, Thomas
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL SYMPOSIUM ON IMPRECISE PROBABILITIES: THEORIES AND APPLICATIONS (ISIPTA 2019), 2019, 103 : 216 - 224
  • [46] Characterization of iCell cardiomyocytes using single-cell RNA-sequencing methods
    Schmid, Christina
    Wohnhaas, Christian T.
    Hildebrandt, Tobias
    Baum, Patrick
    Rast, Georg
    JOURNAL OF PHARMACOLOGICAL AND TOXICOLOGICAL METHODS, 2020, 106
  • [47] Clustering methods for single-cell RNA-sequencing expression data: performance evaluation with varying sample sizes and cell compositions
    Suner, Asli
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2019, 18 (05)
  • [48] CELL SUBCLASS IDENTIFICATION IN SINGLE-CELL RNA-SEQUENCING DATA USING ORTHOGONAL NONNEGATIVE MATRIX FACTORIZATION
    Wang, Shuai
    Wu, Peng
    Zhou, Manqi
    Chang, Tsung-Hui
    Wu, Song
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 876 - 880
  • [49] Quantitative assessment of single-cell RNA-sequencing methods
    Angela R Wu
    Norma F Neff
    Tomer Kalisky
    Piero Dalerba
    Barbara Treutlein
    Michael E Rothenberg
    Francis M Mburu
    Gary L Mantalas
    Sopheak Sim
    Michael F Clarke
    Stephen R Quake
    Nature Methods, 2014, 11 : 41 - 46
  • [50] Reusability report: Learning the transcriptional grammar in single-cell RNA-sequencing data using transformers
    Sumeer Ahmad Khan
    Alberto Maillo
    Vincenzo Lagani
    Robert Lehmann
    Narsis A. Kiani
    David Gomez-Cabrero
    Jesper Tegner
    Nature Machine Intelligence, 2023, 5 : 1437 - 1446