Classification of Single-Cell Gene Expression Trajectories from Incomplete and Noisy Data

被引:6
|
作者
Karbalayghareh, Alireza [1 ]
Braga-Neto, Ulisses [1 ]
Dougherty, Edward R. [1 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
基金
美国国家科学基金会;
关键词
Gene regulatory network; probabilistic Boolean network; trajectory classification; Bayes classifier; expectation maximization; hidden Markov model; partially observed Boolean dynamical system; single-cell gene expression trajectory;
D O I
10.1109/TCBB.2017.2763946
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This paper studies classification of gene-expression trajectories coming from two classes, healthy and mutated (cancerous) using Boolean networks with perturbation (BNps) to model the dynamics of each class at the state level. Each class has its own BNp, which is partially known based on gene pathways. We employ a Gaussian model at the observation level to show the expression values of the genes given the hidden binary states at each time point. We use expectation maximization (EM) to learn the BNps and the unknown model parameters, derive closed-form updates for the parameters, and propose a learning algorithm. After learning, a plug-in Bayes classifier is used to classify unlabeled trajectories, which can have missing data. Measuring gene expressions at different times yields trajectories only when measurements come from a single cell. In multiple-cell scenarios, the expression values are averages over many cells with possibly different states. Via the central-limit theorem, we propose another model for expression data in multiple-cell scenarios. Simulations demonstrate that single-cell trajectory data can outperform multiple-cell average expression data relative to classification error, especially in high-noise situations. We also consider data generated via a mammalian cell-cycle network, both the wild-type and with a common mutation affecting p27.
引用
收藏
页码:193 / 207
页数:15
相关论文
共 50 条
  • [1] Cancer classification of single-cell gene expression data by neural network
    Kim, Bong-Hyun
    Yu, Kijin
    Lee, Peter C. W.
    BIOINFORMATICS, 2020, 36 (05) : 1360 - 1366
  • [2] Privacy of single-cell gene expression data
    Cho, Hyunghoon
    PATTERNS, 2024, 5 (11):
  • [3] Putative cell type discovery from single-cell gene expression data
    Miao, Zhichao
    Moreno, Pablo
    Huang, Ni
    Papatheodorou, Irene
    Brazma, Alvis
    Teichmann, Sarah A.
    NATURE METHODS, 2020, 17 (06) : 621 - +
  • [4] Putative cell type discovery from single-cell gene expression data
    Zhichao Miao
    Pablo Moreno
    Ni Huang
    Irene Papatheodorou
    Alvis Brazma
    Sarah A. Teichmann
    Nature Methods, 2020, 17 : 621 - 628
  • [5] Spatial reconstruction of single-cell gene expression data
    Rahul Satija
    Jeffrey A Farrell
    David Gennert
    Alexander F Schier
    Aviv Regev
    Nature Biotechnology, 2015, 33 : 495 - 502
  • [6] Spatial reconstruction of single-cell gene expression data
    Satija, Rahul
    Farrell, Jeffrey A.
    Gennert, David
    Schier, Alexander F.
    Regev, Aviv
    NATURE BIOTECHNOLOGY, 2015, 33 (05) : 495 - U206
  • [7] Synthesising Executable Gene Regulatory Networks from Single-Cell Gene Expression Data
    Fisher, Jasmin
    Koeksal, Ali Sinan
    Piterman, Nir
    Woodhouse, Steven
    COMPUTER AIDED VERIFICATION, PT I, 2015, 9206 : 544 - 560
  • [8] Joint learning of multiple gene networks from single-cell gene expression data
    Wu, Nuosi
    Yin, Fu
    Ou-Yang, Le
    Zhu, Zexuan
    Xie, Weixin
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 2583 - 2595
  • [9] Reconstructing developmental landscapes and trajectories from single-cell data
    Schiebinger, Geoffrey
    CURRENT OPINION IN SYSTEMS BIOLOGY, 2021, 27
  • [10] scINSIGHT for interpreting single-cell gene expression from biologically heterogeneous data
    Qian, Kun
    Fu, Shiwei
    Li, Hongwei
    Li, Wei Vivian
    GENOME BIOLOGY, 2022, 23 (01)