JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles

被引:749
作者
Mathelier, Anthony [1 ]
Fornes, Oriol [1 ]
Arenillas, David J. [1 ]
Chen, Chih-yu [1 ]
Denay, Gregoire [2 ]
Lee, Jessica [1 ]
Shi, Wenqiang [1 ]
Shyr, Casper [1 ]
Tan, Ge [3 ]
Worsley-Hunt, Rebecca [1 ,6 ]
Zhang, Allen W. [1 ]
Parcy, Francois [2 ]
Lenhard, Boris [3 ]
Sandelin, Albin [4 ,5 ]
Wasserman, Wyeth W. [1 ]
机构
[1] Univ British Columbia, Dept Med Genet, Child & Family Res Inst, Ctr Mol Med & Therapeut, Vancouver, BC V5Z 4H4, Canada
[2] Univ Grenoble Alpes, CNRS, Lab Physiol Cellulaire & Vegetale, CEA,iRTSV,INRA, F-38054 Grenoble, France
[3] Univ London Imperial Coll Sci Technol & Med, MRC, Ctr Clin Sci, Computat Regulatory Genom, Du Cane Rd, London W12 0NN, England
[4] Univ Copenhagen, Bioinformat Ctr, Dept Biol, Ole Maaloes Vej 5, DK-2200 Copenhagen, Denmark
[5] Univ Copenhagen, Biotech Res & Innovat Ctr, Ole Maaloes Vej 5, DK-2200 Copenhagen, Denmark
[6] Max Delbruck Ctr Mol Med, Berlin Inst Med Syst Biol, D-13125 Berlin, Germany
基金
加拿大自然科学与工程研究理事会; 英国医学研究理事会;
关键词
DNA-BINDING; BIOINFORMATICS; SPECIFICITIES; SOFTWARE;
D O I
10.1093/nar/gkv1176
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
JASPAR (http://jaspar.genereg.net) is an open-access database storing curated, non-redundant transcription factor (TF) binding profiles representing transcription factor binding preferences as position frequency matrices for multiple species in six taxonomic groups. For this 2016 release, we expanded the JASPAR CORE collection with 494 new TF binding profiles (315 in vertebrates, 11 in nematodes, 3 in insects, 1 in fungi and 164 in plants) and updated 59 profiles (58 in vertebrates and 1 in fungi). The introduced profiles represent an 83% expansion and 10% update when compared to the previous release. We updated the structural annotation of the TF DNA binding domains (DBDs) following a published hierarchical structural classification. In addition, we introduced 130 transcription factor flexible models trained on ChIP-seq data for vertebrates, which capture dinucleotide dependencies within TF binding sites. This new JASPAR release is accompanied by a new web tool to infer JASPAR TF binding profiles recognized by a given TF protein sequence. Moreover, we provide the users with a Ruby module complementing the JASPAR API to ease programmatic access and use of the JASPAR collection of profiles. Finally, we provide the JASPAR2016 R/Bioconductor data package with the data of this release.
引用
收藏
页码:D110 / D115
页数:6
相关论文
共 28 条
[1]   Diversity and Complexity in DNA Recognition by Transcription Factors [J].
Badis, Gwenael ;
Berger, Michael F. ;
Philippakis, Anthony A. ;
Talukder, Shaheynoor ;
Gehrke, Andrew R. ;
Jaeger, Savina A. ;
Chan, Esther T. ;
Metzler, Genita ;
Vedenko, Anastasia ;
Chen, Xiaoyu ;
Kuznetsov, Hanna ;
Wang, Chi-Fong ;
Coburn, David ;
Newburger, Daniel E. ;
Morris, Quaid ;
Hughes, Timothy R. ;
Bulyk, Martha L. .
SCIENCE, 2009, 324 (5935) :1720-1723
[2]   Inferring direct DNA binding from ChIP-seq [J].
Bailey, Timothy L. ;
Machanick, Philip .
NUCLEIC ACIDS RESEARCH, 2012, 40 (17) :e128
[3]   JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update [J].
Bryne, Jan Christian ;
Valen, Eivind ;
Tang, Man-Hung Eric ;
Marstrand, Troels ;
Winther, Ole ;
da Piedade, Isabelle ;
Krogh, Anders ;
Lenhard, Boris ;
Sandelin, Albin .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D102-D106
[4]   Nucleotides of transcription factor binding sites exert interdependent effects on the binding affinities of transcription factors [J].
Bulyk, ML ;
Johnson, PLF ;
Church, GM .
NUCLEIC ACIDS RESEARCH, 2002, 30 (05) :1255-1261
[5]   bHLH003, bHLH013 and bHLH017 Are New Targets of JAZ Repressors Negatively Regulating JA Responses [J].
Fonseca, Sandra ;
Fernandez-Calvo, Patricia ;
Fernandez, Guillermo M. ;
Diez-Diaz, Monica ;
Gimenez-Ibanez, Selena ;
Lopez-Vidriero, Irene ;
Godoy, Marta ;
Fernandez-Barbero, Gemma ;
Van Leene, Jelle ;
De Jaeger, Geert ;
Manuel Franco-Zorrilla, Jose ;
Solano, Roberto .
PLOS ONE, 2014, 9 (01)
[6]   DNA-binding specificities of plant transcription factors and their potential to define target genes [J].
Franco-Zorrilla, Jose M. ;
Lopez-Vidriero, Irene ;
Carrasco, Jose L. ;
Godoy, Marta ;
Vera, Pablo ;
Solano, Roberto .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (06) :2367-2372
[7]   TFCat: the curated catalog of mouse and human transcription factors [J].
Fulton, Debra L. ;
Sundararajan, Saravanan ;
Badis, Gwenael ;
Hughes, Timothy R. ;
Wasserman, Wyeth W. ;
Roach, Jared C. ;
Sladek, Rob .
GENOME BIOLOGY, 2009, 10 (03)
[8]   Bioconductor: open software development for computational biology and bioinformatics [J].
Gentleman, RC ;
Carey, VJ ;
Bates, DM ;
Bolstad, B ;
Dettling, M ;
Dudoit, S ;
Ellis, B ;
Gautier, L ;
Ge, YC ;
Gentry, J ;
Hornik, K ;
Hothorn, T ;
Huber, W ;
Iacus, S ;
Irizarry, R ;
Leisch, F ;
Li, C ;
Maechler, M ;
Rossini, AJ ;
Sawitzki, G ;
Smith, C ;
Smyth, G ;
Tierney, L ;
Yang, JYH ;
Zhang, JH .
GENOME BIOLOGY, 2004, 5 (10)
[9]   BioRuby: bioinformatics software for the Ruby programming language [J].
Goto, Naohisa ;
Prins, Pjotr ;
Nakao, Mitsuteru ;
Bonnal, Raoul ;
Aerts, Jan ;
Katayama, Toshiaki .
BIOINFORMATICS, 2010, 26 (20) :2617-2619
[10]   A Functional and Evolutionary Perspective on Transcription Factor Binding in Arabidopsis thaliana [J].
Heyndrickx, Ken S. ;
Van de Velde, Jan ;
Wang, Congmao ;
Weigei, Detlef ;
Vandepoele, Klaas .
PLANT CELL, 2014, 26 (10) :3894-3910