Developing integrated crop knowledge networks to advance candidate gene discovery

被引:32
作者
Hassani-Pak, Keywan [1 ]
Castellote, Martin [1 ,2 ]
Esch, Maria [1 ]
Hindle, Matthew [1 ]
Lysenko, Artem [1 ]
Taubert, Jan [1 ]
Rawlings, Christopher [1 ]
机构
[1] Rothamsted Res, Dept Computat & Syst Biol, Harpenden, Herts, England
[2] INTA EEA Balcarce, Lab Agrobiotechnol, Buenos Aires, DF, Argentina
基金
英国生物技术与生命科学研究理事会;
关键词
Bioinformatics; Knowledge network; Data integration; Gene discovery; Knowledge discovery; crop genomics;
D O I
10.1016/j.atg.2016.10.003
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The chances of raising crop productivity to enhance global food security would be greatly improved if we had a complete understanding of all the biological mechanisms that underpinned traits such as crop yield, disease resistance or nutrient and water use efficiency. With more crop genomes emerging all the time, we are nearer having the basic information, at the gene-level, to begin assembling crop gene catalogues and using data from other plant species to understand how the genes function and how their interactions govern crop development and physiology. Unfortunately, the task of creating such a complete knowledge base of gene functions, interaction networks and trait biology is technically challenging because the relevant data are dispersed in myriad databases in a variety of data formats with variable quality and coverage. In this paper we present a general approach for building genomescale knowledge networks that provide a unified representation of heterogeneous but interconnected datasets to enable effective knowledge mining and gene discovery. We describe the datasets and outline the methods, work flows and tools that we have developed for creating and visualising these networks for the major crop species, wheat and barley. We present the global characteristics of such knowledge networks and with an example linking a seed size phenotype to a barley WRKY transcription factor orthologous to TTG2 from Arabidopsis, we illustrate the value of integrated data in biological knowledge discovery. The software we have developed (www.ondex.org) and the knowledge resources (http://knetminer.rothamsted.ac.uk) we have created are all open-source and provide a first step towards systematic and evidence-based gene discovery in order to facilitate crop improvement. (C) 2016 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license.
引用
收藏
页码:18 / 26
页数:9
相关论文
共 37 条
[1]   The OMA orthology database in 2015: function predictions, better plant support, synteny view and other improvements [J].
Altenhoff, Adrian M. ;
Skunca, Nives ;
Glover, Natasha ;
Train, Clement-Marie ;
Sueki, Anna ;
Pilizota, Ivana ;
Gori, Kevin ;
Tomiczek, Bartlomiej ;
Mueller, Steven ;
Redestig, Henning ;
Gonnet, Gaston H. ;
Dessimoz, Christophe .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D240-D249
[2]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[3]  
[Anonymous], 2016, KNETMINER WHEAT RELE
[4]   UniProt: a hub for protein information [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Apweiler, Rolf ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Cas-tro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightin-gale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Cowley, Andrew ;
Figueira, Luis ;
Li, Weizhong ;
McWilliam, Hamish .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D204-D212
[5]   Gene Ontology Consortium: going forward [J].
Blake, J. A. ;
Christie, K. R. ;
Dolan, M. E. ;
Drabkin, H. J. ;
Hill, D. P. ;
Ni, L. ;
Sitnikov, D. ;
Burgess, S. ;
Buza, T. ;
Gresham, C. ;
McCarthy, F. ;
Pillai, L. ;
Wang, H. ;
Carbon, S. ;
Dietze, H. ;
Lewis, S. E. ;
Mungall, C. J. ;
Munoz-Torres, M. C. ;
Feuermann, M. ;
Gaudet, P. ;
Basu, S. ;
Chisholm, R. L. ;
Dodson, R. J. ;
Fey, P. ;
Mi, H. ;
Thomas, P. D. ;
Muruganujan, A. ;
Poudel, S. ;
Hu, J. C. ;
Aleksander, S. A. ;
McIntosh, B. K. ;
Renfro, D. P. ;
Siegele, D. A. ;
Attrill, H. ;
Brown, N. H. ;
Tweedie, S. ;
Lomax, J. ;
Osumi-Sutherland, D. ;
Parkinson, H. ;
Roncaglia, P. ;
Lovering, R. C. ;
Talmud, P. J. ;
Humphries, S. E. ;
Denny, P. ;
Campbell, N. H. ;
Foulger, R. E. ;
Chibucos, M. C. ;
Giglio, M. Gwinn ;
Chang, H. Y. ;
Finn, R. .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D1049-D1056
[6]   The Triticeae Toolbox: Combining Phenotype and Genotype Data to Advance Small-Grains Breeding [J].
Blake, Victoria C. ;
Birkett, Clay ;
Matthews, David E. ;
Hane, David L. ;
Bradbury, Peter ;
Jannink, Jean-Luc .
PLANT GENOME, 2016, 9 (02)
[7]  
Canevet C., 2010, NAT PRECEDINGS
[8]   A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome [J].
Chapman, Jarrod A. ;
Mascher, Martin ;
Buluc, Aydin ;
Barry, Kerrie ;
Georganas, Evangelos ;
Session, Adam ;
Strnadova, Veronika ;
Jenkins, Jerry ;
Sehgal, Sunish ;
Oliker, Leonid ;
Schmutz, Jeremy ;
Yelick, Katherine A. ;
Scholz, Uwe ;
Waugh, Robbie ;
Poland, Jesse A. ;
Muehlbauer, Gary J. ;
Stein, Nils ;
Rokhsar, Daniel S. .
GENOME BIOLOGY, 2015, 16
[9]   Ensembl variation resources [J].
Chen, Yuan ;
Cunningham, Fiona ;
Rios, Daniel ;
McLaren, William M. ;
Smith, James ;
Pritchard, Bethan ;
Spudich, Giulietta M. ;
Brent, Simon ;
Kulesha, Eugene ;
Marin-Garcia, Pablo ;
Smedley, Damian ;
Birney, Ewan ;
Flicek, Paul .
BMC GENOMICS, 2010, 11
[10]  
Defoin-Platel Michael, 2011, Asp Appl Biol, V107, P79