MULTISCALE POISSON PROCESS APPROACHES FOR DETECTING AND ESTIMATING DIFFERENCES FROM HIGH-THROUGHPUT SEQUENCING ASSAYS

被引:0
|
作者
Shim, Heejung [1 ]
Xing, Zhengrong [2 ]
Pantaleo, Ester [2 ]
Luca, Francesca [3 ,4 ]
Pique-Regi, Roger [4 ,5 ]
Stephens, Matthew [6 ]
机构
[1] Univ Melbourne, Sch Math & Stat & Melbourne Integrat Genom, Melbourne, Australia
[2] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
[3] Wayne State Univ, Dept Obstet & Gynecol, Detroit, MI USA
[4] Wayne State Univ, Ctr Mol Med & Genet, Detroit, MI USA
[5] Wayne State Univ, Ctr Mol Med & Genet, Detroit, MI USA
[6] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
关键词
Multiscale Poisson processes; wavelets; differential expression analysis; high- throughput sequencing assays; high-resolution; Bayesian inference; functional data; count data; RNA-seq; DNase-; seq; ATAC-seq; chromatin accessibility; RNA-SEQ; EXPRESSION ANALYSIS; OPEN CHROMATIN; IN-VIVO; ASSOCIATION;
D O I
10.1214/23-AOAS1828
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Estimating and testing for differences in molecular phenotypes (e.g., gene expression, chromatin accessibility, transcription factor binding) across conditions is an important part of understanding the molecular basis of gene regulation. These phenotypes are commonly measured using high-throughput high-resolution count data that reflect how the phenotypes vary along the genome. Multiple methods have been proposed to help exploit these highresolution measurements for differential expression analysis. However, they ignore the count nature of the data, instead using normal distributions that work well only for data with large sample sizes or high counts. Here we develop count-based methods to address this problem. We model the data for each sample using an inhomogeneous Poisson process with spatially structured underlying intensity function and then, building on multiscale models for the Poisson process, estimate and test for differences in the underlying intensity function across samples (or groups of samples). Using both simulation and real ATAC-seq data, we show that our method outperforms previous normal-based methods, especially in situations with small sample sizes or low counts.
引用
收藏
页码:1773 / 1788
页数:16
相关论文
共 50 条
  • [41] Variation and evolution of polyadenylation profiles in sauropsid mitochondrial mRNAs as deduced from the high-throughput RNA sequencing
    Sun, Yao
    Kurisaki, Masaki
    Hashiguchi, Yasuyuki
    Kumazawa, Yoshinori
    BMC GENOMICS, 2017, 18
  • [42] Full high-throughput sequencing analysis of differences in expression profiles of long noncoding RNAs and their mechanisms of action in systemic lupus erythematosus
    Ye, Hui
    Wang, Xue
    Wang, Lei
    Chu, Xiaoying
    Hu, Xuanxuan
    Sun, Li
    Jiang, Minghua
    Wang, Hong
    Wang, Zihan
    Zhao, Han
    Yang, Xinyu
    Wang, Jianguang
    ARTHRITIS RESEARCH & THERAPY, 2019, 21 (1)
  • [43] Genome-Wide Estimation of Linkage Disequilibrium from Population-Level High-Throughput Sequencing Data
    Maruki, Takahiro
    Lynch, Michael
    GENETICS, 2014, 197 (04) : 1303 - U421
  • [44] High-throughput identification of heavy metal binding proteins from the byssus of chinese green mussel (Perna viridis) by combination of transcriptome and proteome sequencing
    Zhang, Xinhui
    Huang, Huiwei
    He, Yanbin
    Ruan, Zhiqiang
    You, Xinxin
    Li, Wanshun
    Wen, Bo
    Lu, Zizheng
    Liu, Bing
    Deng, Xu
    Shi, Qiong
    PLOS ONE, 2019, 14 (05):
  • [45] High-Throughput Sequencing (HTS) of newly synthetized RNAs enables one shot detection and identification of live mycoplasmas and differentiation from inert nucleic acids
    Desbrousses, Celine
    Archer, Fabienne
    Colin, Adelie
    Bobet-Erny, Alexandra
    Champavere, Angelique
    Gros, Edwige
    Beurdeley, Pascale
    Cruveiller, Stephane
    Tardy, Florence
    Eloit, Marc
    BIOLOGICALS, 2020, 65 : 18 - 24
  • [46] Adventitious Virus Detection in Cells by High-Throughput Sequencing of Newly Synthesized RNAs: Unambiguous Differentiation of Cell Infection from Carryover of Viral Nucleic Acids
    Cheval, Justine
    Muth, Erika
    Gonzalez, Gaelle
    Coulpier, Muriel
    Beurdeley, Pascale
    Cruveiller, Stephane
    Eloit, Marc
    MSPHERE, 2019, 4 (03):
  • [47] Identification of QTLs for 14 Agronomically Important Traits in Setaria italica Based on SNPs Generated from High-Throughput Sequencing
    Zhang, Kai
    Fan, Guangyu
    Zhang, Xinxin
    Zhao, Fang
    Wei, Wei
    Du, Guohua
    Feng, Xiaolei
    Wang, Xiaoming
    Wang, Feng
    Song, Guoliang
    Zou, Hongfeng
    Zhang, Xiaolei
    Li, Shuangdong
    Ni, Xuemei
    Zhang, Gengyun
    Zhao, Zhihai
    G3-GENES GENOMES GENETICS, 2017, 7 (05): : 1587 - 1594
  • [48] Transcriptome-wide high-throughput m6A sequencing of differential m6A methylation patterns in the decidual tissues from RSA patients
    Luo, Yong
    Chen, Jin
    Cui, Ying
    Fang, Fang
    Zhang, Ziyu
    Hu, Lili
    Chen, Xiaoyong
    Li, Zengming
    Li, Liping
    Chen, Lina
    FASEB JOURNAL, 2023, 37 (03)
  • [49] Application of High-Throughput Next-Generation Sequencing for HLA Typing on Buccal Extracted DNA: Results from over 10,000 Donor Recruitment Samples
    Yin, Yuxin
    Lan, James H.
    Nguyen, David
    Valenzuela, Nicole
    Takemura, Ping
    Bolon, Yung-Tsi
    Springer, Brianna
    Saito, Katsuyuki
    Zheng, Ying
    Hague, Tim
    Pasztor, Agnes
    Horvath, Gyorgy
    Rigo, Krisztina
    Reed, Elaine F.
    Zhang, Qiuheng
    PLOS ONE, 2016, 11 (10):
  • [50] High-throughput next-generation sequencing to genotype six classical HLA loci from 96 donors in a single MiSeq run
    Ehrenberg, P. K.
    Geretz, A.
    Sindhu, R. K.
    Vayntrub, T.
    Vina, M. A. Fernandez
    Apps, R.
    Michael, N. L.
    Thomas, R.
    HLA, 2017, 90 (05) : 284 - 291