M3S: a comprehensive model selection for multi-modal single-cell RNA sequencing data

被引:8
作者
Zhang, Yu [1 ,2 ]
Wan, Changlin [2 ,3 ]
Wang, Pengcheng [4 ]
Chang, Wennan [2 ,3 ]
Huo, Yan [2 ,5 ]
Chen, Jian [6 ]
Ma, Qin [7 ]
Cao, Sha [2 ,8 ]
Zhang, Chi [2 ,3 ,9 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, MOE Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Peoples R China
[2] Indiana Univ Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA
[3] Purdue Univ, Dept Elect Comp Engn, W Lafayette, IN 47907 USA
[4] Indiana Univ Purdue Univ, Dept Comp Sci, Indianapolis, IN 46202 USA
[5] China Med Univ, Sch Fundamental Sci, Shenyang 110122, Peoples R China
[6] Tongji Univ, Shanghai Pulm Hosp, Sch Med, Shanghai 200082, Peoples R China
[7] Ohio State Univ, Dept Biomed Informat, Columbus, OH 43210 USA
[8] Indiana Univ Sch Med, Dept Biostat, Indianapolis, IN 46202 USA
[9] Dept Med & Mol Genet, Indianapolis, IN 46202 USA
基金
中国国家自然科学基金;
关键词
Single cell RNA-seq; Multimodality; Differential gene expression analysis; Drop-seq; Left truncated mixture Gaussian;
D O I
10.1186/s12859-019-3243-1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Various statistical models have been developed to model the single cell RNA-seq expression profiles, capture its multimodality, and conduct differential gene expression test. However, for expression data generated by different experimental design and platforms, there is currently lack of capability to determine the most proper statistical model. Results: We developed an R package, namely Multi-Modal Model Selection (M3S), for gene-wise selection of the most proper multi-modality statistical model and downstream analysis, useful in a single-cell or large scale bulk tissue transcriptomic data. M3S is featured with (1) gene-wise selection of the most parsimonious model among 11 most commonly utilized ones, that can best fit the expression distribution of the gene, (2) parameter estimation of a selected model, and (3) differential gene expression test based on the selected model. Conclusion: A comprehensive evaluation suggested that M3S can accurately capture the multimodality on simulated and real single cell data. An open source package and is available through GitHub at https://github.com/zy26/M3S.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] MultiK: an automated tool to determine optimal cluster numbers in single-cell RNA sequencing data
    Liu, Siyao
    Thennavan, Aatish
    Garay, Joseph P.
    Marron, J. S.
    Perou, Charles M.
    GENOME BIOLOGY, 2021, 22 (01)
  • [22] STRUCTURE LEARNING FOR ZERO-INFLATED COUNTS WITH AN APPLICATION TO SINGLE-CELL RNA SEQUENCING DATA
    Nguyen, Thi Kim Hue
    van den Berge, Koen
    Chiogna, Monica
    Risso, Davide
    ANNALS OF APPLIED STATISTICS, 2023, 17 (03) : 2555 - 2573
  • [23] The effect of methanol fixation on single-cell RNA sequencing data (vol 22, 420, 2021)
    Wang, Xinlei
    Yu, Lei
    Wu, Angela Ruohao
    BMC GENOMICS, 2021, 22 (01)
  • [24] The heterogeneity of breast cancer metastasis: a bioinformatics analysis utilizing single-cell RNA sequencing data
    Sanjaya, Ardo
    Ratnawati, Hana
    Adhika, Oeij Anindita
    Rahmatilah, Faiz Rizqy
    BREAST CANCER RESEARCH AND TREATMENT, 2024, 208 (02) : 379 - 390
  • [25] MultiK: an automated tool to determine optimal cluster numbers in single-cell RNA sequencing data
    Siyao Liu
    Aatish Thennavan
    Joseph P. Garay
    J. S. Marron
    Charles M. Perou
    Genome Biology, 22
  • [26] Pooling across cells to normalize single-cell RNA sequencing data with many zero counts
    Lun, Aaron T. L.
    Bach, Karsten
    Marioni, John C.
    GENOME BIOLOGY, 2016, 17
  • [27] CELL SUBCLASS IDENTIFICATION IN SINGLE-CELL RNA-SEQUENCING DATA USING ORTHOGONAL NONNEGATIVE MATRIX FACTORIZATION
    Wang, Shuai
    Wu, Peng
    Zhou, Manqi
    Chang, Tsung-Hui
    Wu, Song
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 876 - 880
  • [28] Surveying brain tumor heterogeneity by single-cell RNA-sequencing of multi-sector biopsies
    Yu, Kai
    Hu, Yuqiong
    Wu, Fan
    Guo, Qiufang
    Qian, Zenghui
    Hu, Waner
    Chen, Jing
    Wang, Kuanyu
    Fan, Xiaoying
    Wu, Xinglong
    Rasko, John E. J.
    Fan, Xiaolong
    Iavarone, Antonio
    Jiang, Tao
    Tang, Fuchou
    Su, Xiao-Dong
    NATIONAL SCIENCE REVIEW, 2020, 7 (08) : 1306 - 1318
  • [29] Integration of single-cell and bulk RNA sequencing data reveals key cell types and regulators in traumatic brain injury
    Zheng, Rui-zhe
    Xing, Jin
    Huang, Qiong
    Yang, Xi-tao
    Zhao, Chang-yi
    Li, Xin-yuan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (02) : 1201 - 1214
  • [30] CB2 improves power of cell detection in droplet-based single-cell RNA sequencing data
    Ni, Zijian
    Chen, Shuyang
    Brown, Jared
    Kendziorski, Christina
    GENOME BIOLOGY, 2020, 21 (01)