ChIP-PaM: an algorithm to identify protein-DNA interaction using ChIP-Seq data

被引:14
|
作者
Wu, Song [1 ]
Wang, Jianmin [2 ]
Zhao, Wei [1 ]
Pounds, Stanley [1 ]
Cheng, Cheng [1 ]
机构
[1] St Jude Childrens Res Hosp, Dept Biostat, Memphis, TN 38105 USA
[2] St Jude Childrens Res Hosp, Bioinformat Ctr, Memphis, TN 38105 USA
关键词
BINDING-SITES;
D O I
10.1186/1742-4682-7-18
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: ChIP-Seq is a powerful tool for identifying the interaction between genomic regulators and their bound DNAs, especially for locating transcription factor binding sites. However, high cost and high rate of false discovery of transcription factor binding sites identified from ChIP-Seq data significantly limit its application. Results: Here we report a new algorithm, ChIP-PaM, for identifying transcription factor target regions in ChIP-Seq datasets. This algorithm makes full use of a protein-DNA binding pattern by capitalizing on three lines of evidence: 1) the tag count modelling at the peak position, 2) pattern matching of a specific tag count distribution, and 3) motif searching along the genome. A novel data-based two-step eFDR procedure is proposed to integrate the three lines of evidence to determine significantly enriched regions. Our algorithm requires no technical controls and efficiently discriminates falsely enriched regions from regions enriched by true transcription factor (TF) binding on the basis of ChIP-Seq data only. An analysis of real genomic data is presented to demonstrate our method. Conclusions: In a comparison with other existing methods, we found that our algorithm provides more accurate binding site discovery while maintaining comparable statistical power.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] BayesPeak: Bayesian analysis of ChIP-seq data
    Christiana Spyrou
    Rory Stark
    Andy G Lynch
    Simon Tavaré
    BMC Bioinformatics, 10
  • [42] A Statistical Framework for the Analysis of ChIP-Seq Data
    Kuan, Pei Fen
    Chung, Dongjun
    Pan, Guangjin
    Thomson, James A.
    Stewart, Ron
    Keles, Suenduez
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (495) : 891 - 903
  • [43] CistromeFinder for ChIP-seq and DNase-seq data reuse
    Sun, Hanfei
    Qin, Bo
    Liu, Tao
    Wang, Qixuan
    Liu, Jing
    Wang, Juan
    Lin, Xueqiu
    Yang, Yulin
    Taing, Len
    Rao, Prakash K.
    Brown, Myles
    Zhang, Yong
    Long, Henry W.
    Liu, X. Shirley
    BIOINFORMATICS, 2013, 29 (10) : 1352 - 1354
  • [44] Evaluation of Algorithm Performance in ChIP-Seq Peak Detection
    Wilbanks, Elizabeth G.
    Facciotti, Marc T.
    PLOS ONE, 2010, 5 (07):
  • [45] Identifying ChIP-seq enrichment using MACS
    Jianxing Feng
    Tao Liu
    Bo Qin
    Yong Zhang
    Xiaole Shirley Liu
    Nature Protocols, 2012, 7 : 1728 - 1740
  • [46] Defining bacterial regulons using ChIP-seq
    Myers, Kevin S.
    Park, Dan M.
    Beauchene, Nicole A.
    Kiley, Patricia J.
    METHODS, 2015, 86 : 80 - 88
  • [47] Identifying ChIP-seq enrichment using MACS
    Feng, Jianxing
    Liu, Tao
    Qin, Bo
    Zhang, Yong
    Liu, Xiaole Shirley
    NATURE PROTOCOLS, 2012, 7 (09) : 1728 - 1740
  • [48] Impact of artifact removal on ChIP quality metrics in ChIP-seq and ChIP-exo data
    Carroll, Thomas S.
    Liang, Ziwei
    Salama, Rafik
    Stark, Rory
    de Santiago, Ines
    FRONTIERS IN GENETICS, 2014, 5
  • [49] Inferring direct DNA binding from ChIP-seq
    Bailey, Timothy L.
    Machanick, Philip
    NUCLEIC ACIDS RESEARCH, 2012, 40 (17)
  • [50] iTAR: a web server for identifying target genes of transcription factors using ChIP-seq or ChIP-chip data
    Chia-Chun Yang
    Erik H. Andrews
    Min-Hsuan Chen
    Wan-Yu Wang
    Jeremy J. W. Chen
    Mark Gerstein
    Chun-Chi Liu
    Chao Cheng
    BMC Genomics, 17