M3S: a comprehensive model selection for multi-modal single-cell RNA sequencing data

被引:8
作者
Zhang, Yu [1 ,2 ]
Wan, Changlin [2 ,3 ]
Wang, Pengcheng [4 ]
Chang, Wennan [2 ,3 ]
Huo, Yan [2 ,5 ]
Chen, Jian [6 ]
Ma, Qin [7 ]
Cao, Sha [2 ,8 ]
Zhang, Chi [2 ,3 ,9 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, MOE Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Peoples R China
[2] Indiana Univ Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA
[3] Purdue Univ, Dept Elect Comp Engn, W Lafayette, IN 47907 USA
[4] Indiana Univ Purdue Univ, Dept Comp Sci, Indianapolis, IN 46202 USA
[5] China Med Univ, Sch Fundamental Sci, Shenyang 110122, Peoples R China
[6] Tongji Univ, Shanghai Pulm Hosp, Sch Med, Shanghai 200082, Peoples R China
[7] Ohio State Univ, Dept Biomed Informat, Columbus, OH 43210 USA
[8] Indiana Univ Sch Med, Dept Biostat, Indianapolis, IN 46202 USA
[9] Dept Med & Mol Genet, Indianapolis, IN 46202 USA
基金
中国国家自然科学基金;
关键词
Single cell RNA-seq; Multimodality; Differential gene expression analysis; Drop-seq; Left truncated mixture Gaussian;
D O I
10.1186/s12859-019-3243-1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Various statistical models have been developed to model the single cell RNA-seq expression profiles, capture its multimodality, and conduct differential gene expression test. However, for expression data generated by different experimental design and platforms, there is currently lack of capability to determine the most proper statistical model. Results: We developed an R package, namely Multi-Modal Model Selection (M3S), for gene-wise selection of the most proper multi-modality statistical model and downstream analysis, useful in a single-cell or large scale bulk tissue transcriptomic data. M3S is featured with (1) gene-wise selection of the most parsimonious model among 11 most commonly utilized ones, that can best fit the expression distribution of the gene, (2) parameter estimation of a selected model, and (3) differential gene expression test based on the selected model. Conclusion: A comprehensive evaluation suggested that M3S can accurately capture the multimodality on simulated and real single cell data. An open source package and is available through GitHub at https://github.com/zy26/M3S.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Analysis of cell-cell interaction between mural granulosa cells and cumulus granulosa cells during ovulation using single-cell RNA sequencing data of mouse ovary
    Shirafuta, Yuichiro
    Tamura, Isao
    Shiroshita, Amon
    Fujimura, Taishi
    Maekawa, Ryo
    Taketani, Toshiaki
    Sugino, Norihiro
    [J]. REPRODUCTIVE MEDICINE AND BIOLOGY, 2024, 23 (01)
  • [42] Single-cell RNA sequencing of neurofibromas reveals a tumor microenvironment favorable for neural regeneration and immune suppression in a neurofibromatosis type 1 porcine model
    Mclean, Dalton T.
    Meudt, Jennifer J.
    Rivera, Loren D. Lopez
    Schomberg, Dominic T.
    Pavelec, Derek M.
    Duellman, Tyler T.
    Buehler, Darya G.
    Schwartz, Patrick B.
    Graham, Melissa
    Lee, Laura M.
    Graff, Keri D.
    Reichert, Jamie L.
    Bon-Durant, Sandra S.
    Konsitzke, Charles M.
    Ronnekleiv-Kelly, Sean M.
    Shanmuganayagam, Dhanansayan
    Rubinstein, C. Dustin
    [J]. FRONTIERS IN ONCOLOGY, 2023, 13
  • [43] Advancing single-cell RNA-seq data analysis through the fusion of multi-layer perceptron and graph neural network
    Feng, Xiang
    Xiu, Yu-Han
    Long, Hai-Xia
    Wang, Zi-Tong
    Bilal, Anas
    Yang, Li-Ming
    [J]. BRIEFINGS IN BIOINFORMATICS, 2024, 25 (01)
  • [44] Single-cell RNA sequencing unveils Lrg1's role in cerebral ischemia-reperfusion injury by modulating various cells
    Ruan, Zhaohui
    Cao, Guosheng
    Qian, Yisong
    Fu, Longsheng
    Hu, Jinfang
    Xu, Tiantian
    Wu, Yaoqi
    Lv, Yanni
    [J]. JOURNAL OF NEUROINFLAMMATION, 2023, 20 (01)
  • [45] Integration of Single-Cell and Bulk RNA-seq Data to Identify the Cancer-Associated Fibroblast Subtypes and Risk Model in Glioma
    Yan, Xiuwei
    Gao, Xin
    Dong, Jiawei
    Wang, Fang
    Jiang, Xiaoyan
    Hu, Xueyan
    Zhang, Jiheng
    Wang, Nan
    Xu, Lei
    Liu, Zhihui
    Hu, Shaoshan
    Zhao, Hongtao
    [J]. BIOCHEMICAL GENETICS, 2024, 63 (2) : 1275 - 1297
  • [46] Deep Multi-Constraint Soft Clustering Analysis for Single-Cell RNA-Seq Data via Zero-Inflated Autoencoder Embedding
    He, Yezi
    Chen, Xiangtao
    Tu, Nguyen Hoang
    Luo, Jiawei
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (03) : 2254 - 2265
  • [47] Summary data-based Mendelian randomization and single-cell RNA sequencing analyses identify immune associations with low-level LGALS9 in sepsis
    Yang, Yongsan
    Dong, Lei
    Li, Yanguo
    Huang, Ye
    Zeng, Xiaoxi
    [J]. JOURNAL OF CELLULAR AND MOLECULAR MEDICINE, 2024, 28 (14)
  • [48] Discrete distributional differential expression (D3E) - a tool for gene expression analysis of single-cell RNA-seq data
    Mihails Delmans
    Martin Hemberg
    [J]. BMC Bioinformatics, 17
  • [49] Discrete distributional differential expression (D3E) - a tool for gene expression analysis of single-cell RNA-seq data
    Delmans, Mihails
    Hemberg, Martin
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [50] Inferring microenvironmental regulation of gene expression from single-cell RNA sequencing data using scMLnet with an application to COVID-19 (vol 22, pg 988, 2021)
    Cheng, Jinyu
    Zhang, Ji
    Wu, Zhongdao
    Sun, Xiaoqiang
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (02) : 1511 - 1512