The Rice Genome Knowledgebase (RGKbase): an annotation database for rice comparative genomics and evolutionary biology

被引:17
作者
Wang, Dapeng [1 ]
Xia, Yan [1 ,2 ]
Li, Xinna [1 ]
Hou, Lixia [1 ]
Yu, Jun [1 ]
机构
[1] Chinese Acad Sci, Beijing Inst Genom, CAS Key Lab Genome Sci & Informat, Beijing 100029, Peoples R China
[2] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
MULTIPLE SEQUENCE ALIGNMENT; TRANSPOSABLE ELEMENTS; EXPRESSED SEQUENCE; DRAFT SEQUENCE; PREDICTION; PROGRAM; TOOL; EFFICIENT; FEATURES; SEARCH;
D O I
10.1093/nar/gks1225
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Over the past 10 years, genomes of cultivated rice cultivars and their wild counterparts have been sequenced although most efforts are focused on genome assembly and annotation of two major cultivated rice (Oryza sativa L.) subspecies, 93-11 (indica) and Nipponbare (japonica). To integrate information from genome assemblies and annotations for better analysis and application, we now introduce a comparative rice genome database, the Rice Genome Knowledgebase (RGKbase, http://rgkbase.big.ac.cn/RGKbase/). RGKbase is built to have three major components: (i) integrated data curation for rice genomics and molecular biology, which includes genome sequence assemblies, transcriptomic and epigenomic data, genetic variations, quantitative trait loci (QTLs) and the relevant literature; (ii) User-friendly viewers, such as Gbrowse, GeneBrowse and Circos, for genome annotations and evolutionary dynamics and (iii) Bioinformatic tools for compositional and synteny analyses, gene family classifications, gene ontology terms and pathways and gene co-expression networks. RGKbase current includes data from five rice cultivars and species: Nipponbare (japonica), 93-11 (indica), PA64s (indica), the African rice (Oryza glaberrima) and a wild rice species (Oryza brachyantha). We are also constantly introducing new datasets from variety of public efforts, such as two recent releases-sequence data from similar to 1000 rice varieties, which are mapped into the reference genome, yielding ample high-quality single-nucleotide polymorphisms and insertions-deletions.
引用
收藏
页码:D1199 / D1205
页数:7
相关论文
共 69 条
  • [1] Generic eukaryotic core promoter prediction using structural features of DNA
    Abeel, Thomas
    Saeys, Yvan
    Bonnet, Eric
    Rouze, Pierre
    Van de Peer, Yves
    [J]. GENOME RESEARCH, 2008, 18 (02) : 310 - 323
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] [Anonymous], THESIS PENNSYLVANIA
  • [4] Reorganizing the protein space at the Universal Protein Resource (UniProt)
    Apweiler, Rolf
    Martin, Maria Jesus
    O'Donovan, Claire
    Magrane, Michele
    Alam-Faruque, Yasmin
    Antunes, Ricardo
    Casanova, Elisabet Barrera
    Bely, Benoit
    Bingley, Mark
    Bower, Lawrence
    Bursteinas, Borisas
    Chan, Wei Mun
    Chavali, Gayatri
    Da Silva, Alan
    Dimmer, Emily
    Eberhardt, Ruth
    Fazzini, Francesco
    Fedotov, Alexander
    Garavelli, John
    Castro, Leyla Garcia
    Gardner, Michael
    Hieta, Reija
    Huntley, Rachael
    Jacobsen, Julius
    Legge, Duncan
    Liu, Wudong
    Luo, Jie
    Orchard, Sandra
    Patient, Samuel
    Pichler, Klemens
    Poggioli, Diego
    Pontikos, Nikolas
    Pundir, Sangya
    Rosanoff, Steven
    Sawford, Tony
    Sehra, Harminder
    Turner, Edward
    Wardell, Tony
    Watkins, Xavier
    Corbett, Matt
    Donnelly, Mike
    van Rensburg, Pieter
    Goujon, Mickael
    McWilliam, Hamish
    Lopez, Rodrigo
    Xenarios, Ioannis
    Bougueleret, Lydie
    Bridge, Alan
    Poux, Sylvain
    Redaschi, Nicole
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D71 - D75
  • [5] Tandem repeats finder: a program to analyze DNA sequences
    Benson, G
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (02) : 573 - 580
  • [6] The Gene Ontology: enhancements for 2011
    Blake, J. A.
    Dolan, M.
    Drabkin, H.
    Hill, D. P.
    Ni, L.
    Sitnikov, D.
    Burgess, S.
    Buza, T.
    Gresham, C.
    McCarthy, F.
    Pillai, L.
    Wang, H.
    Carbon, S.
    Lewis, S. E.
    Mungall, C. J.
    Gaudet, P.
    Chisholm, R. L.
    Fey, P.
    Kibbe, W. A.
    Basu, S.
    Siegele, D. A.
    McIntosh, B. K.
    Renfro, D. P.
    Zweifel, A. E.
    Hu, J. C.
    Brown, N. H.
    Tweedie, S.
    Alam-Faruque, Y.
    Apweiler, R.
    Auchinchloss, A.
    Axelsen, K.
    Argoud-Puy, G.
    Bely, B.
    Blatter, M. -C.
    Bougueleret, L.
    Boutet, E.
    Branconi-Quintaje, S.
    Breuza, L.
    Bridge, A.
    Browne, P.
    Chan, W. M.
    Coudert, E.
    Cusin, I.
    Dimmer, E.
    Duek-Roggli, P.
    Eberhardt, R.
    Estreicher, A.
    Famiglietti, L.
    Ferro-Rojas, S.
    Feuermann, M.
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D559 - D564
  • [7] DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS
    BOGUSKI, MS
    LOWE, TMJ
    TOLSTOSHEV, CM
    [J]. NATURE GENETICS, 1993, 4 (04) : 332 - 333
  • [8] RetrOryza:: a database of the rice LTR-retrotransposons
    Chaparro, Cristian
    Guyot, Romain
    Zuccolo, Andrea
    Piegu, Benoit
    Panaud, Olivier
    [J]. NUCLEIC ACIDS RESEARCH, 2007, 35 : D66 - D70
  • [9] Donlin Maureen J, 2007, Curr Protoc Bioinformatics, VChapter 9, DOI 10.1002/0471250953.bi0909s17
  • [10] MUSCLE: multiple sequence alignment with high accuracy and high throughput
    Edgar, RC
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 (05) : 1792 - 1797