Assessing quality and completeness of human transcriptional regulatory pathways on a genome-wide scale

被引:44
作者
Shmelkov, Evgeny [1 ,2 ]
Tang, Zuojian [1 ]
Aifantis, Iannis [3 ,4 ]
Statnikov, Alexander [1 ,5 ]
机构
[1] NYU, Sch Med, Ctr Hlth Informat & Bioinformat, New York, NY 10012 USA
[2] NYU, Sch Med, Dept Pharmacol, New York, NY USA
[3] NYU, Sch Med, Howard Hughes Med Inst, New York, NY USA
[4] NYU, Sch Med, Dept Pathol, New York, NY USA
[5] NYU, Sch Med, Dept Med, New York, NY USA
关键词
MYC TARGET GENES; T-CELL LEUKEMIA; C-MYC; CHROMATIN IMMUNOPRECIPITATION; PROSTATE-CANCER; FACTOR-BINDING; IDENTIFICATION; DATABASE; NETWORK; GROWTH;
D O I
10.1186/1745-6150-6-15
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Pathway databases are becoming increasingly important and almost omnipresent in most types of biological and translational research. However, little is known about the quality and completeness of pathways stored in these databases. The present study conducts a comprehensive assessment of transcriptional regulatory pathways in humans for seven well-studied transcription factors: MYC, NOTCH1, BCL6, TP53, AR, STAT1, and RELA. The employed benchmarking methodology first involves integrating genome-wide binding with functional gene expression data to derive direct targets of transcription factors. Then the lists of experimentally obtained direct targets are compared with relevant lists of transcriptional targets from 10 commonly used pathway databases. Results: The results of this study show that for the majority of pathway databases, the overlap between experimentally obtained target genes and targets reported in transcriptional regulatory pathway databases is surprisingly small and often is not statistically significant. The only exception is MetaCore pathway database which yields statistically significant intersection with experimental results in 84% cases. Additionally, we suggest that the lists of experimentally derived direct targets obtained in this study can be used to reveal new biological insight in transcriptional regulation and suggest novel putative therapeutic targets in cancer. Conclusions: Our study opens a debate on validity of using many popular pathway databases to obtain transcriptional regulatory targets. We conclude that the choice of pathway databases should be informed by solid scientific evidence and rigorous empirical evaluation. Reviewers: This article was reviewed by Prof. Wing Hung Wong, Dr. Thiago Motta Venancio (nominated by Dr. L Aravind), and Prof. Geoff J McLachlan.
引用
收藏
页数:13
相关论文
共 35 条
[1]   The public road to high-quality curated biological pathways [J].
Adriaens, Michiel E. ;
Jaillard, Magali ;
Waagmeester, Andra ;
Coort, Susan L. M. ;
Pico, Alex R. ;
Evelo, Chris T. A. .
DRUG DISCOVERY TODAY, 2008, 13 (19-20) :856-862
[2]   Molecular pathogenesis of T-cell leukaemia and lymphoma [J].
Aifantis, Iannis ;
Raetz, Elizabeth ;
Buonamici, Silvia .
NATURE REVIEWS IMMUNOLOGY, 2008, 8 (05) :380-390
[3]   Integrated biochemical and computational approach identifies BCL6 direct target genes controlling multiple pathways in normal germinal center B cells [J].
Basso, Katia ;
Saito, Masumichi ;
Sumazin, Pavel ;
Margolin, Adam A. ;
Wang, Kai ;
Lim, Wei-Keat ;
Kitagawa, Yukiko ;
Schneider, Christof ;
Alvarez, Mariano J. ;
Califano, Andrea ;
Dalla-Favera, Riccardo .
BLOOD, 2010, 115 (05) :975-984
[4]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[5]   Oncogenic pathway signatures in human cancers as a guide to targeted therapies [J].
Bild, AH ;
Yao, G ;
Chang, JT ;
Wang, QL ;
Potti, A ;
Chasse, D ;
Joshi, MB ;
Harpole, D ;
Lancaster, JM ;
Berchuck, A ;
Olson, JA ;
Marks, JR ;
Dressman, HK ;
West, M ;
Nevins, JR .
NATURE, 2006, 439 (7074) :353-357
[6]   The HGNC Database in 2008: a resource for the human genome [J].
Bruford, Elspeth A. ;
Lush, Michael J. ;
Wright, Mathew W. ;
Sneddon, Tam P. ;
Povey, Sue ;
Birney, Ewan .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D445-D448
[7]   Novel c-MYC target genes mediate differential effects on cell proliferation and migration [J].
Cappellen, David ;
Schlange, Thomas ;
Bauer, Matthieu ;
Maurer, Francisca ;
Hynes, Nancy E. .
EMBO REPORTS, 2007, 8 (01) :70-76
[8]   Identification of SULF2 as a Novel Transcriptional Target of p53 by Use of Integrated Genomic Analyses [J].
Chau, B. Nelson ;
Diaz, Robert L. ;
Saunders, Matthew A. ;
Cheng, Chun ;
Chang, Aaron N. ;
Warrener, Paul ;
Bradshaw, Jeffrey ;
Linsley, Peter S. ;
Cleary, Michele A. .
CANCER RESEARCH, 2009, 69 (04) :1368-1374
[9]   The BCL6 transcriptional program features repression of multiple oncogenes in primary B cells and is deregulated in DLBCL [J].
Ci, Weimin ;
Polo, Jose M. ;
Cerchietti, Leandro ;
Shaknovich, Rita ;
Wang, Ling ;
Yang, Shao Ning ;
Ye, Kenny ;
Farinha, Pedro ;
Horsman, Douglas E. ;
Gascoyne, Randy D. ;
Elemento, Olivier ;
Melnick, Ari .
BLOOD, 2009, 113 (22) :5536-5548
[10]   The 2010 Nucleic Acids Research Database Issue and online Database Collection: a community of data resources [J].
Cochrane, Guy R. ;
Galperin, Michael Y. .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D1-D4