A bi-Poisson model for clustering gene expression profiles by RNA-seq

被引:6
|
作者
Wang, Ningtao [1 ]
Wang, Yaqun [1 ]
Hao, Han [1 ]
Wang, Luojun [1 ]
Wang, Zhong
Wang, Jianxin [2 ]
Wu, Rongling [1 ,3 ,4 ]
机构
[1] Penn State Univ, Hershey, PA 17033 USA
[2] Beijing Forestry Univ, Beijing, Peoples R China
[3] Penn State Univ, Ctr Stat Genet, Hershey, PA 17033 USA
[4] Beijing Forestry Univ, Ctr Computat Biol, Beijing, Peoples R China
关键词
RNA-seq; Poisson distribution; EM algorithm; breast cancer cell lines; DIFFERENTIAL EXPRESSION; TRANSCRIPTION FACTORS; DYNAMICS;
D O I
10.1093/bib/bbt029
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
With the availability of gene expression data by RNA-seq, powerful statistical approaches for grouping similar gene expression profiles across different environments have become increasingly important. We describe and assess a computational model for clustering genes into distinct groups based on the pattern of gene expression in response to changing environment. The model capitalizes on the Poisson distribution to capture the count property of RNA-seq data. A two-stage hierarchical expectation-maximization (EM) algorithm is implemented to estimate an optimal number of groups and mean expression amounts of each group across two environments. A procedure is formulated to test whether and how a given group shows a plastic response to environmental changes. The impact of gene-environment interactions on the phenotypic plasticity of the organism can also be visualized and characterized. The model was used to analyse an RNA-seq dataset measured from two cell lines of breast cancer that respond differently to an anti-cancer drug, from which genes associated with the resistance and sensitivity of the cell lines are identified. We performed simulation studies to validate the statistical behaviour of the model. The model provides a useful tool for clustering gene expression data by RNA-seq, facilitating our understanding of gene functions and networks.
引用
收藏
页码:534 / 541
页数:8
相关论文
共 50 条
  • [41] Transformation and model choice for RNA-seq co-expression analysis
    Rau, Andrea
    Maugis-Rabusseau, Cathy
    BRIEFINGS IN BIOINFORMATICS, 2018, 19 (03) : 425 - 436
  • [42] RNA-Seq in Mytilus galloprovincialis: comparative transcriptomics and expression profiles among different tissues
    Moreira, Rebeca
    Pereiro, Patricia
    Canchaya, Carlos
    Posada, David
    Figueras, Antonio
    Novoa, Beatriz
    BMC GENOMICS, 2015, 16
  • [43] RNA-Seq in Mytilus galloprovincialis: comparative transcriptomics and expression profiles among different tissues
    Rebeca Moreira
    Patricia Pereiro
    Carlos Canchaya
    David Posada
    Antonio Figueras
    Beatriz Novoa
    BMC Genomics, 16
  • [44] A Unified Model for Robust Differential Expression Analysis of RNA-Seq Data
    Liu, Kefei
    Shen, Li
    Jiang, Hui
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 437 - 442
  • [45] A comparison of methods for differential expression analysis of RNA-seq data
    Soneson, Charlotte
    Delorenzi, Mauro
    BMC BIOINFORMATICS, 2013, 14
  • [46] Gene expression profiling of non-polyadenylated RNA-seq across species
    Zhang, Xiao-Ou
    Yin, Qing-Fei
    Chen, Ling-Ling
    Yang, Li
    GENOMICS DATA, 2014, 2 : 237 - 241
  • [47] Empirical Bayes Analysis of RNA-seq Data for Detection of Gene Expression Heterosis
    Jarad Niemi
    Eric Mittman
    Will Landau
    Dan Nettleton
    Journal of Agricultural, Biological, and Environmental Statistics, 2015, 20 : 614 - 628
  • [48] Measuring differential gene expression with RNA-seq: challenges and strategies for data analysis
    Finotello, Francesca
    Di Camillo, Barbara
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2015, 14 (02) : 130 - 142
  • [49] RNA-Seq for Gene Expression Profiling of Human Necrotizing Enterocolitis: a Pilot Study
    Jung, Kyuwhan
    Koh, InSong
    Kim, Jeong-Hyun
    Cheong, Hyun Sub
    Park, Taejin
    Nam, So Hyun
    Jung, Soo-Min
    Sio, Cherry Ann
    Kim, Su Yeong
    Jung, Euiseok
    Lee, Byoungkook
    Kim, Hye-Rim
    Shin, Eun
    Jung, Sung-Eun
    Choi, Chang Won
    Kim, Beyong Il
    Jung, Eunyoung
    Shin, Hyoung Doo
    JOURNAL OF KOREAN MEDICAL SCIENCE, 2017, 32 (05) : 817 - 824
  • [50] Fully Bayesian Analysis of RNA-seq Counts for the Detection of Gene Expression Heterosis
    Landau, Will
    Niemi, Jarad
    Nettleton, Dan
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) : 610 - 621