A Generalized Linear Model for Peak Calling in ChIP-Seq Data

被引:4
|
作者
Xu, Jialin [1 ]
Zhang, Yu [1 ]
机构
[1] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
关键词
generalized linear model; ChIP-Seq; peak calling;
D O I
10.1089/cmb.2012.0023
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) has become a routine for detecting genome-wide protein-DNA interaction. The success of ChIP-Seq data analysis highly depends on the quality of peak calling (i.e., to detect peaks of tag counts at a genomic location and evaluate if the peak corresponds to a real protein-DNA interaction event). The challenges in peak calling include (1) how to combine the forward and the reverse strand tag data to improve the power of peak calling and (2) how to account for the variation of tag data observed across different genomic locations. We introduce a new peak calling method based on the generalized linear model (GLMNB) that utilizes negative binomial distribution to model the tag count data and account for the variation of background tags that may randomly bind to the DNA sequence at varying levels due to local genomic structures and sequence contents. We allow local shifting of peaks observed on the forward and the reverse stands, such that at each potential binding site, a binding profile representing the pattern of a real peak signal is fitted to best explain the observed tag data with maximum likelihood. Our method can also detect multiple peaks within a local region if there are multiple binding sites in the region.
引用
收藏
页码:826 / 838
页数:13
相关论文
共 50 条
  • [1] Normalization, bias correction, and peak calling for ChIP-seq
    Diaz, Aaron
    Park, Kiyoub
    Lim, Daniel A.
    Song, Jun S.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2012, 11 (03)
  • [2] NEXT-peak: a normal-exponential two-peak model for peak-calling in ChIP-seq data
    Kim, Nak-Kyeong
    Jayatillake, Rasika V.
    Spouge, John L.
    BMC GENOMICS, 2013, 14
  • [3] NEXT-peak: a normal-exponential two-peak model for peak-calling in ChIP-seq data
    Nak-Kyeong Kim
    Rasika V Jayatillake
    John L Spouge
    BMC Genomics, 14
  • [4] Features that define the best ChIP-seq peak calling algorithms
    Thomas, Reuben
    Thomas, Sean
    Holloway, Alisha K.
    Pollard, Katherine S.
    BRIEFINGS IN BIOINFORMATICS, 2017, 18 (03) : 441 - 450
  • [5] WACS: improving ChIP-seq peak calling by optimally weighting controls
    Awdeh, Aseel
    Turcotte, Marcel
    Perkins, Theodore J.
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [6] WACS: improving ChIP-seq peak calling by optimally weighting controls
    Aseel Awdeh
    Marcel Turcotte
    Theodore J. Perkins
    BMC Bioinformatics, 22
  • [7] Rapid innovation in ChIP-seq peak-calling algorithms is outdistancing benchmarking efforts
    Szalkowski, Adam M.
    Schmid, Christoph D.
    BRIEFINGS IN BIOINFORMATICS, 2011, 12 (06) : 626 - 633
  • [8] PeakRanger: A cloud-enabled peak caller for ChIP-seq data
    Xin Feng
    Robert Grossman
    Lincoln Stein
    BMC Bioinformatics, 12
  • [9] Peak Finder Metaserver - a novel application for finding peaks in ChIP-seq data
    Kruczyk, Marcin
    Umer, Husen M.
    Enroth, Stefan
    Komorowski, Jan
    BMC BIOINFORMATICS, 2013, 14
  • [10] Peak Finder Metaserver - a novel application for finding peaks in ChIP-seq data
    Marcin Kruczyk
    Husen M Umer
    Stefan Enroth
    Jan Komorowski
    BMC Bioinformatics, 14