Single cell clustering based on cell-pair differentiability correlation and variance analysis

被引:70
作者
Jiang, Hao [1 ]
Sohn, Lydia L. [2 ]
Huan, Haiyan [3 ]
Chen, Luonan [4 ,5 ]
机构
[1] Renmin Univ China, Sch Informat, Dept Math, Beijing 100872, Peoples R China
[2] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[4] Chinese Acad Sci, CAS Ctr Excellence Mol Cell Sci, Inst Biochem & Cell Biol, Key Lab Syst Biol,Shanghai Inst Biol Sci, Shanghai 200031, Peoples R China
[5] Chinese Acad Sci, CAS Ctr Excellence Anim Evolut & Genet, Kunming 650223, Yunnan, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
EMBRYONIC STEM-CELLS; RNA-SEQ; MICROENVIRONMENT; CHALLENGES; GENES;
D O I
10.1093/bioinformatics/bty390
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The rapid advancement of single cell technologies has shed new light on the complex mechanisms of cellular heterogeneity. Identification of intercellular transcriptomic heterogeneity is one of the most critical tasks in single-cell RNA-sequencing studies. Results: We propose a new cell similarity measure based on cell-pair differentiability correlation, which is derived from gene differential pattern among all cell pairs. Through plugging into the frame-work of hierarchical clustering with this new measure, we further develop a variance analysis based clustering algorithm 'Corr' that can determine cluster number automatically and identify cell types accurately. The robustness and superiority of the proposed algorithm are compared with representative algorithms: shared nearest neighbor (SNN)-Cliq and several other state-of-the-art clustering methods, on many benchmark or real single cell RNA-sequencing datasets in terms of both internal criteria (clustering number and accuracy) and external criteria (purity, adjusted rand index, F1-measure). Moreover, differentiability vector with our new measure provides a new means in identifying potential biomarkers from cancer related single cell datasets even with strong noise. Prognosis analyses from independent datasets of cancers confirmed the effectiveness of our 'Corr' method.
引用
收藏
页码:3684 / 3694
页数:11
相关论文
共 32 条
  • [21] RNA-Seq of single prostate CTCs implicates noncanonical Wnt signaling in antiandrogen resistance
    Miyamoto, David T.
    Zheng, Yu
    Wittner, Ben S.
    Lee, Richard J.
    Zhu, Huili
    Broderick, Katherine T.
    Desai, Rushil
    Fox, Douglas B.
    Brannigan, Brian W.
    Trautwein, Julie
    Arora, Kshitij S.
    Desai, Niyati
    Dahl, Douglas M.
    Sequist, Lecia V.
    Smith, Matthew R.
    Kapur, Ravi
    Wu, Chin-Lee
    Shioda, Toshi
    Ramaswamy, Sridhar
    Ting, David T.
    Toner, Mehmet
    Maheswaran, Shyamala
    Haber, Daniel A.
    [J]. SCIENCE, 2015, 349 (6254) : 1351 - 1356
  • [22] Transdifferentiation of tadpole pancreatic acinar cells to duct cells mediated by Notch and stromelysin-3
    Mukhi, Sandeep
    Brown, Donald D.
    [J]. DEVELOPMENTAL BIOLOGY, 2011, 351 (02) : 311 - 317
  • [23] Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells
    Ramskold, Daniel
    Luo, Shujun
    Wang, Yu-Chieh
    Li, Robin
    Deng, Qiaolin
    Faridani, Omid R.
    Daniels, Gregory A.
    Khrebtukova, Irina
    Loring, Jeanne F.
    Laurent, Louise C.
    Schroth, Gary P.
    Sandberg, Rickard
    [J]. NATURE BIOTECHNOLOGY, 2012, 30 (08) : 777 - 782
  • [24] SILHOUETTES - A GRAPHICAL AID TO THE INTERPRETATION AND VALIDATION OF CLUSTER-ANALYSIS
    ROUSSEEUW, PJ
    [J]. JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 1987, 20 : 53 - 65
  • [25] SIDEseq: A Cell Similarity Measure Defined by Shared Identified Differentially Expressed Genes for Single-Cell RNA sequencing Data
    Schiffman C.
    Lin C.
    Shi F.
    Chen L.
    Sohn L.
    Huang H.
    [J]. Statistics in Biosciences, 2017, 9 (1) : 200 - 216
  • [26] Single-cell sequencing-based technologies will revolutionize whole-organism science
    Shapiro, Ehud
    Biezuner, Tamir
    Linnarsson, Sten
    [J]. NATURE REVIEWS GENETICS, 2013, 14 (09) : 618 - 630
  • [27] Computational and analytical challenges in single-cell transcriptomics
    Stegle, Oliver
    Teichmann, Sarah A.
    Marioni, John C.
    [J]. NATURE REVIEWS GENETICS, 2015, 16 (03) : 133 - 145
  • [28] Single-cell entropy for accurate estimation of differentiation potency from a cell's transcriptome
    Teschendorff, Andrew E.
    Enver, Tariq
    [J]. NATURE COMMUNICATIONS, 2017, 8
  • [29] Visualization and analysis of single-cell RNA-seq data by kernel-based similarity learning
    Wang, Bo
    Zhu, Junjie
    Pierson, Emma
    Ramazzotti, Daniele
    Batzoglou, Serafim
    [J]. NATURE METHODS, 2017, 14 (04) : 414 - +
  • [30] Identification of cell types from single-cell transcriptomes using a novel clustering method
    Xu, Chen
    Su, Zhengchang
    [J]. BIOINFORMATICS, 2015, 31 (12) : 1974 - 1980