PhytoTypeDB: a database of plant protein inter-cultivar variability and function

被引:1
作者
Necci, Marco [1 ,2 ,3 ]
Piovesan, Damiano [1 ]
Micheletti, Diego [3 ]
Paladin, Lisanna [1 ]
Cestaro, Alessandro [3 ]
Tosatto, Silvio C. E. [1 ,4 ]
机构
[1] Univ Padua, Dept Biomed Sci, Via U Bassi 58-B, I-35131 Padua, Italy
[2] Univ Udine, Dept Agr Sci, Via Palladio 8, I-33100 Udine, Italy
[3] Fdn Edmund Mach, Via E Mach 1, I-38010 San Michele All Adige, Italy
[4] CNR, Inst Neurosci, Via U Bassi 58-B, I-35131 Padua, Italy
来源
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | 2018年
关键词
PREDICTION; RESOURCE; PLATFORM; GENOMICS; TOOLS;
D O I
10.1093/database/bay125
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Despite a fast-growing number of available plant genomes, available computational resources are poorly integrated and provide only limited access to the underlying data. Most existing databases focus on DNA/RNA data or specific gene families, with less emphasis on protein structure, function and variability. In particular, despite the economic importance of many plant accessions, there are no straightforward ways to retrieve or visualize information on their differences. To fill this gap, we developed PhytoTypeDB (http://phytotypedb.bio.unipd.it/), a scalable database containing plant protein annotations and genetic variants from resequencing of different accessions. The database content is generated by an integrated pipeline, exploiting state-of-the-art methods for protein characterization requiring only the proteome reference sequence and variant calling files. Protein names for unknown proteins are inferred by homology for over 95% of the entries. Single-nucleotide variants are visualized along with protein annotation in a user-friendly web interface. The server offers an effective querying system, which allows to compare variability among different species and accessions, to generate custom data sets based on shared functional features or to perform sequence searches. A documented set of exposed RESTful endpoints make the data accessible programmatically by third-party clients.
引用
收藏
页数:5
相关论文
共 24 条
[1]   1,135 Genomes Reveal the Global Pattern of Polymorphism in Arabidopsis thaliana [J].
Alonso-Blanco, Carlos ;
Andrade, Jorge ;
Becker, Claude ;
Bemm, Felix ;
Bergelson, Joy ;
Borgwardt, Karsten M. ;
Cao, Jun ;
Chae, Eunyoung ;
Dezwaan, Todd M. ;
Ding, Wei ;
Ecker, Joseph R. ;
Exposito-Alonso, Moises ;
Farlow, Ashley ;
Fitz, Joffrey ;
Gan, Xiangchao ;
Grimm, Dominik G. ;
Hancock, Angela M. ;
Henz, Stefan R. ;
Holm, Svante ;
Horton, Matthew ;
Jarsulic, Mike ;
Kerstetter, Randall A. ;
Korte, Arthur ;
Korte, Pamela ;
Lanz, Christa ;
Lee, Cheng-Ruei ;
Meng, Dazhe ;
Michael, Todd P. ;
Mott, Richard ;
Muliyati, Ni Wayan ;
Nagele, Thomas ;
Nagler, Matthias ;
Nizhynska, Viktoria ;
Nordborg, Magnus ;
Novikova, Polina Yu. ;
Pico, F. Xavier ;
Platzer, Alexander ;
Rabanal, Fernando A. ;
Rodriguez, Alex ;
Rowan, Beth A. ;
Salome, Patrice A. ;
Schmid, Karl J. ;
Schmitz, Robert J. ;
Seren, Umit ;
Sperone, Felice Gianluca ;
Sudkamp, Mitchell ;
Svardal, Hannes ;
Tanzer, Matt M. ;
Todd, Donald ;
Volchenboum, Samuel L. .
CELL, 2016, 166 (02) :481-491
[2]   MaizeGDB update: new tools, data and interface for the maize model organism database [J].
Andorf, Carson M. ;
Cannon, Ethalinda K. ;
Portwood, John L., II ;
Gardiner, Jack M. ;
Harper, Lisa C. ;
Schaeffer, Mary L. ;
Braun, Bremen L. ;
Campbell, Darwin A. ;
Vinnakota, Abhinav G. ;
Sribalusu, Venktanaga V. ;
Huerta, Miranda ;
Cho, Kyoung Tak ;
Wimalanathan, Kokulapalan ;
Richter, Jacqueline D. ;
Mauch, Emily D. ;
Rao, Bhavani S. ;
Birkett, Scott M. ;
Sen, Taner Z. ;
Lawrence-Dill, Carolyn J. .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D1195-D1201
[3]   UniProt: the universal protein knowledgebase [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Bye-A-Jee, Hema ;
Cowley, Andrew ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Castro, Leyla Garcia ;
Figueira, Luis ;
Garmiri, Penelope ;
Georghiou, George ;
Gonzalez, Daniel ;
Hatton-Ellis, Emma ;
Li, Weizhong ;
Liu, Wudong ;
Lopez, Rodrigo ;
Luo, Jie ;
Lussi, Yvonne ;
MacDougall, Alistair ;
Nightingale, Andrew ;
Palka, Barbara ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Speretta, Elena ;
Turner, Edward ;
Tyagi, Nidhi ;
Volynkin, Vladimir ;
Wardell, Tony ;
Warner, Kate ;
Watkins, Xavier ;
Zaru, Rossana ;
Zellner, Hermann ;
Xenarios, Ioannis .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D158-D169
[4]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[5]   Development and validation of the Axiom®Apple480K SNP genotyping array [J].
Bianco, Luca ;
Cestaro, Alessandro ;
Linsmith, Gareth ;
Muranty, Helene ;
Denance, Caroline ;
Theron, Anthony ;
Poncet, Charles ;
Micheletti, Diego ;
Kerschbamer, Emanuela ;
Di Pierro, Erica A. ;
Larger, Simone ;
Pindo, Massimo ;
Van de Weg, Eric ;
Davassi, Alessandro ;
Laurens, Francois ;
Velasco, Riccardo ;
Durel, Charles-Eric ;
Troggio, Michela .
PLANT JOURNAL, 2016, 86 (01) :62-74
[6]   COMPARTMENTS: unification and visualization of protein subcellular localization evidence [J].
Binder, Janos X. ;
Pletscher-Frankild, Sune ;
Tsafou, Kalliopi ;
Stolte, Christian ;
O'Donoghue, Sean I. ;
Schneider, Reinhard ;
Jensen, Lars Juhl .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2014,
[7]   Plant genome sequencing - applications for crop improvement [J].
Bolger, Marie E. ;
Weisshaar, Bernd ;
Scholz, Uwe ;
Stein, Nils ;
Usadel, Bjoern ;
Mayer, Klaus F. X. .
CURRENT OPINION IN BIOTECHNOLOGY, 2014, 26 :31-37
[8]   Expansion of the Gene Ontology knowledgebase and resources [J].
Carbon, S. ;
Dietze, H. ;
Lewis, S. E. ;
Mungall, C. J. ;
Munoz-Torres, M. C. ;
Basu, S. ;
Chisholm, R. L. ;
Dodson, R. J. ;
Fey, P. ;
Thomas, P. D. ;
Mi, H. ;
Muruganujan, A. ;
Huang, X. ;
Poudel, S. ;
Hu, J. C. ;
Aleksander, S. A. ;
McIntosh, B. K. ;
Renfro, D. P. ;
Siegele, D. A. ;
Antonazzo, G. ;
Attrill, H. ;
Brown, N. H. ;
Marygold, S. J. ;
McQuilton, P. ;
Ponting, L. ;
Millburn, G. H. ;
Rey, A. J. ;
Stefancsik, R. ;
Tweedie, S. ;
Falls, K. ;
Schroeder, A. J. ;
Courtot, M. ;
Osumi-Sutherland, D. ;
Parkinson, H. ;
Roncaglia, P. ;
Lovering, R. C. ;
Foulger, R. E. ;
Huntley, R. P. ;
Denny, P. ;
Campbell, N. H. ;
Kramarz, B. ;
Patel, S. ;
Buxton, J. L. ;
Umrao, Z. ;
Deng, A. T. ;
Alrohaif, H. ;
Mitchell, K. ;
Ratnaraj, F. ;
Omer, W. ;
Rodriguez-Lopez, M. .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D331-D338
[9]  
De Leo F, 2002, NUCLEIC ACIDS RES, V30, P347, DOI 10.1093/nar/30.1.347
[10]   InterPro in 2017-beyond protein family and domain annotations [J].
Finn, Robert D. ;
Attwood, Teresa K. ;
Babbitt, Patricia C. ;
Bateman, Alex ;
Bork, Peer ;
Bridge, Alan J. ;
Chang, Hsin-Yu ;
Dosztanyi, Zsuzsanna ;
El-Gebali, Sara ;
Fraser, Matthew ;
Gough, Julian ;
Haft, David ;
Holliday, Gemma L. ;
Huang, Hongzhan ;
Huang, Xiaosong ;
Letunic, Ivica ;
Lopez, Rodrigo ;
Lu, Shennan ;
Marchler-Bauer, Aron ;
Mi, Huaiyu ;
Mistry, Jaina ;
Natale, Darren A. ;
Necci, Marco ;
Nuka, Gift ;
Orengo, Christine A. ;
Park, Youngmi ;
Pesseat, Sebastien ;
Piovesan, Damiano ;
Potter, Simon C. ;
Rawlings, Neil D. ;
Redaschi, Nicole ;
Richardson, Lorna ;
Rivoire, Catherine ;
Sangrador-Vegas, Amaia ;
Sigrist, Christian ;
Sillitoe, Ian ;
Smithers, Ben ;
Squizzato, Silvano ;
Sutton, Granger ;
Thanki, Narmada ;
Thomas, Paul D. ;
Tosatto, Silvio C. E. ;
Wu, Cathy H. ;
Xenarios, Ioannis ;
Yeh, Lai-Su ;
Young, Siew-Yit ;
Mitchell, Alex L. .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D190-D199