iTAR: a web server for identifying target genes of transcription factors using ChIP-seq or ChIP-chip data

被引:2
作者
Yang, Chia-Chun [1 ,2 ]
Andrews, Erik H. [8 ]
Chen, Min-Hsuan [2 ,3 ]
Wang, Wan-Yu [2 ]
Chen, Jeremy J. W. [1 ,3 ,4 ]
Gerstein, Mark [5 ,6 ]
Liu, Chun-Chi [2 ,3 ,4 ]
Cheng, Chao [7 ,8 ,9 ]
机构
[1] Natl Chung Hsing Univ, Inst Mol Biol, Taichung 402, Taiwan
[2] Natl Chung Hsing Univ, Inst Genom & Bioinformat, Taichung 402, Taiwan
[3] Natl Chung Hsing Univ, Inst Biomed Sci, Taichung 402, Taiwan
[4] Natl Chung Hsing Univ, Agr Biotechnol Ctr, Taichung 402, Taiwan
[5] Yale Univ, Program Computat Biol & Bioinformat, 260 Whitney Ave, New Haven, CT 06520 USA
[6] Yale Univ, Dept Mol Biophys & Biochem, 260 Whitney Ave, New Haven, CT 06520 USA
[7] Geisel Sch Med Dartmouth, Dept Genet, Hanover, NH 03755 USA
[8] Geisel Sch Med Dartmouth, Inst Quantitat Biomed Sci, Lebanon, NH 03766 USA
[9] Geisel Sch Med Dartmouth, Norris Cotton Canc Ctr, Lebanon, NH 03766 USA
关键词
Transcription factor; ChIP-seq; ChIP-chip; Gaussian mixture model; Gene ontology analysis; EMBRYONIC STEM-CELLS; HUMAN GENOME; DNA; BINDING; EXPRESSION; ELEMENTS;
D O I
10.1186/s12864-016-2963-0
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Chromatin immunoprecipitation followed by massively parallel DNA sequencing (ChIP-seq) or microarray hybridization (ChIP-chip) has been widely used to determine the genomic occupation of transcription factors (TFs). We have previously developed a probabilistic method, called TIP (Target Identification from Profiles), to identify TF target genes using ChIP-seq/ChIP-chip data. To achieve high specificity, TIP applies a conservative method to estimate significance of target genes, with the trade-off being a relatively low sensitivity of target gene identification compared to other methods. Additionally, TIP's output does not render binding-peak locations or intensity, information highly useful for visualization and general experimental biological use, while the variability of ChIP-seq/ChIP-chip file formats has made input into TIP more difficult than desired. Description: To improve upon these facets, here we present are fined TIP with key extensions. First, it implements a Gaussian mixture model for p-value estimation, increasing target gene identification sensitivity and more accurately capturing the shape of TF binding profile distributions. Second, it enables the incorporation of TF binding-peak data by identifying their locations in significant target gene promoter regions and quantifies their strengths. Finally, for full ease of implementation we have incorporated it into a web server (http://syslab3.nchu.edu.tw/iTAR/) that enables flexibility of input file format, can be used across multiple species and genome assembly versions, and is freely available for public use. The web server additionally performs GO enrichment analysis for the identified target genes to reveal the potential function of the corresponding TF. Conclusions: The iTAR web server provides a user-friendly interface and supports target gene identification in seven species, ranging from yeast to human. To facilitate investigating the quality of ChIP-seq/ChIP-chip data, the web server generates the chart of the characteristic binding profiles and the density plot of normalized regulatory scores. The iTAR web server is a useful tool in identifying TF target genes from ChIP-seq/ChIP-chip data and discovering biological insights.
引用
收藏
页数:8
相关论文
共 21 条
[1]   The RNA polymerase II core promoter: a key component in the regulation of gene expression [J].
Butler, JEF ;
Kadonaga, JT .
GENES & DEVELOPMENT, 2002, 16 (20) :2583-2592
[2]   Integration of external signaling pathways with the core transcriptional network in embryonic stem cells [J].
Chen, Xi ;
Xu, Han ;
Yuan, Ping ;
Fang, Fang ;
Huss, Mikael ;
Vega, Vinsensius B. ;
Wong, Eleanor ;
Orlov, Yuriy L. ;
Zhang, Weiwei ;
Jiang, Jianming ;
Loh, Yuin-Han ;
Yeo, Hock Chuan ;
Yeo, Zhen Xuan ;
Narang, Vipin ;
Govindarajan, Kunde Ramamoorthy ;
Leong, Bernard ;
Shahab, Atif ;
Ruan, Yijun ;
Bourque, Guillaume ;
Sung, Wing-Kin ;
Clarke, Neil D. ;
Wei, Chia-Lin ;
Ng, Huck-Hui .
CELL, 2008, 133 (06) :1106-1117
[3]   TIP: A probabilistic method for identifying transcription factor target genes from ChIP-seq binding profiles [J].
Cheng, Chao ;
Min, Renqiang ;
Gerstein, Mark .
BIOINFORMATICS, 2011, 27 (23) :3221-3227
[4]   An integrated encyclopedia of DNA elements in the human genome [J].
Dunham, Ian ;
Kundaje, Anshul ;
Aldred, Shelley F. ;
Collins, Patrick J. ;
Davis, CarrieA. ;
Doyle, Francis ;
Epstein, Charles B. ;
Frietze, Seth ;
Harrow, Jennifer ;
Kaul, Rajinder ;
Khatun, Jainab ;
Lajoie, Bryan R. ;
Landt, Stephen G. ;
Lee, Bum-Kyu ;
Pauli, Florencia ;
Rosenbloom, Kate R. ;
Sabo, Peter ;
Safi, Alexias ;
Sanyal, Amartya ;
Shoresh, Noam ;
Simon, Jeremy M. ;
Song, Lingyun ;
Trinklein, Nathan D. ;
Altshuler, Robert C. ;
Birney, Ewan ;
Brown, James B. ;
Cheng, Chao ;
Djebali, Sarah ;
Dong, Xianjun ;
Dunham, Ian ;
Ernst, Jason ;
Furey, Terrence S. ;
Gerstein, Mark ;
Giardine, Belinda ;
Greven, Melissa ;
Hardison, Ross C. ;
Harris, Robert S. ;
Herrero, Javier ;
Hoffman, Michael M. ;
Iyer, Sowmya ;
Kellis, Manolis ;
Khatun, Jainab ;
Kheradpour, Pouya ;
Kundaje, Anshul ;
Lassmann, Timo ;
Li, Qunhua ;
Lin, Xinying ;
Marinov, Georgi K. ;
Merkel, Angelika ;
Mortazavi, Ali .
NATURE, 2012, 489 (7414) :57-74
[5]   The ENCODE (ENCyclopedia of DNA elements) Project [J].
Feingold, EA ;
Good, PJ ;
Guyer, MS ;
Kamholz, S ;
Liefer, L ;
Wetterstrand, K ;
Collins, FS ;
Gingeras, TR ;
Kampa, D ;
Sekinger, EA ;
Cheng, J ;
Hirsch, H ;
Ghosh, S ;
Zhu, Z ;
Pate, S ;
Piccolboni, A ;
Yang, A ;
Tammana, H ;
Bekiranov, S ;
Kapranov, P ;
Harrison, R ;
Church, G ;
Struhl, K ;
Ren, B ;
Kim, TH ;
Barrera, LO ;
Qu, C ;
Van Calcar, S ;
Luna, R ;
Glass, CK ;
Rosenfeld, MG ;
Guigo, R ;
Antonarakis, SE ;
Birney, E ;
Brent, M ;
Pachter, L ;
Reymond, A ;
Dermitzakis, ET ;
Dewey, C ;
Keefe, D ;
Denoeud, F ;
Lagarde, J ;
Ashurst, J ;
Hubbard, T ;
Wesselink, JJ ;
Castelo, R ;
Eyras, E ;
Myers, RM ;
Sidow, A ;
Batzoglou, S .
SCIENCE, 2004, 306 (5696) :636-640
[6]   The human genome browser at UCSC [J].
Kent, WJ ;
Sugnet, CW ;
Furey, TS ;
Roskin, KM ;
Pringle, TH ;
Zahler, AM ;
Haussler, D .
GENOME RESEARCH, 2002, 12 (06) :996-1006
[7]   E2F4 regulatory program predicts patient survival prognosis in breast cancer [J].
Khaleel, Sari S. ;
Andrews, Erik H. ;
Ung, Matthew ;
DiRenzo, James ;
Cheng, Chao .
BREAST CANCER RESEARCH, 2014, 16 (06)
[8]   Design and analysis of ChIP-seq experiments for DNA-binding proteins [J].
Kharchenko, Peter V. ;
Tolstorukov, Michael Y. ;
Park, Peter J. .
NATURE BIOTECHNOLOGY, 2008, 26 (12) :1351-1359
[9]   Ultrafast and memory-efficient alignment of short DNA sequences to the human genome [J].
Langmead, Ben ;
Trapnell, Cole ;
Pop, Mihai ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2009, 10 (03)
[10]   Orchestrated response: a symphony of transcription factors for gene control [J].
Lemon, B ;
Tjian, R .
GENES & DEVELOPMENT, 2000, 14 (20) :2551-2569