The Rice Genome Knowledgebase (RGKbase): an annotation database for rice comparative genomics and evolutionary biology

被引:17
作者
Wang, Dapeng [1 ]
Xia, Yan [1 ,2 ]
Li, Xinna [1 ]
Hou, Lixia [1 ]
Yu, Jun [1 ]
机构
[1] Chinese Acad Sci, Beijing Inst Genom, CAS Key Lab Genome Sci & Informat, Beijing 100029, Peoples R China
[2] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
MULTIPLE SEQUENCE ALIGNMENT; TRANSPOSABLE ELEMENTS; EXPRESSED SEQUENCE; DRAFT SEQUENCE; PREDICTION; PROGRAM; TOOL; EFFICIENT; FEATURES; SEARCH;
D O I
10.1093/nar/gks1225
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Over the past 10 years, genomes of cultivated rice cultivars and their wild counterparts have been sequenced although most efforts are focused on genome assembly and annotation of two major cultivated rice (Oryza sativa L.) subspecies, 93-11 (indica) and Nipponbare (japonica). To integrate information from genome assemblies and annotations for better analysis and application, we now introduce a comparative rice genome database, the Rice Genome Knowledgebase (RGKbase, http://rgkbase.big.ac.cn/RGKbase/). RGKbase is built to have three major components: (i) integrated data curation for rice genomics and molecular biology, which includes genome sequence assemblies, transcriptomic and epigenomic data, genetic variations, quantitative trait loci (QTLs) and the relevant literature; (ii) User-friendly viewers, such as Gbrowse, GeneBrowse and Circos, for genome annotations and evolutionary dynamics and (iii) Bioinformatic tools for compositional and synteny analyses, gene family classifications, gene ontology terms and pathways and gene co-expression networks. RGKbase current includes data from five rice cultivars and species: Nipponbare (japonica), 93-11 (indica), PA64s (indica), the African rice (Oryza glaberrima) and a wild rice species (Oryza brachyantha). We are also constantly introducing new datasets from variety of public efforts, such as two recent releases-sequence data from similar to 1000 rice varieties, which are mapped into the reference genome, yielding ample high-quality single-nucleotide polymorphisms and insertions-deletions.
引用
收藏
页码:D1199 / D1205
页数:7
相关论文
共 69 条
[1]   Generic eukaryotic core promoter prediction using structural features of DNA [J].
Abeel, Thomas ;
Saeys, Yvan ;
Bonnet, Eric ;
Rouze, Pierre ;
Van de Peer, Yves .
GENOME RESEARCH, 2008, 18 (02) :310-323
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
[Anonymous], THESIS PENNSYLVANIA
[4]   Reorganizing the protein space at the Universal Protein Resource (UniProt) [J].
Apweiler, Rolf ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Antunes, Ricardo ;
Casanova, Elisabet Barrera ;
Bely, Benoit ;
Bingley, Mark ;
Bower, Lawrence ;
Bursteinas, Borisas ;
Chan, Wei Mun ;
Chavali, Gayatri ;
Da Silva, Alan ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Fazzini, Francesco ;
Fedotov, Alexander ;
Garavelli, John ;
Castro, Leyla Garcia ;
Gardner, Michael ;
Hieta, Reija ;
Huntley, Rachael ;
Jacobsen, Julius ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
Orchard, Sandra ;
Patient, Samuel ;
Pichler, Klemens ;
Poggioli, Diego ;
Pontikos, Nikolas ;
Pundir, Sangya ;
Rosanoff, Steven ;
Sawford, Tony ;
Sehra, Harminder ;
Turner, Edward ;
Wardell, Tony ;
Watkins, Xavier ;
Corbett, Matt ;
Donnelly, Mike ;
van Rensburg, Pieter ;
Goujon, Mickael ;
McWilliam, Hamish ;
Lopez, Rodrigo ;
Xenarios, Ioannis ;
Bougueleret, Lydie ;
Bridge, Alan ;
Poux, Sylvain ;
Redaschi, Nicole .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D71-D75
[5]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[6]   The Gene Ontology: enhancements for 2011 [J].
Blake, J. A. ;
Dolan, M. ;
Drabkin, H. ;
Hill, D. P. ;
Ni, L. ;
Sitnikov, D. ;
Burgess, S. ;
Buza, T. ;
Gresham, C. ;
McCarthy, F. ;
Pillai, L. ;
Wang, H. ;
Carbon, S. ;
Lewis, S. E. ;
Mungall, C. J. ;
Gaudet, P. ;
Chisholm, R. L. ;
Fey, P. ;
Kibbe, W. A. ;
Basu, S. ;
Siegele, D. A. ;
McIntosh, B. K. ;
Renfro, D. P. ;
Zweifel, A. E. ;
Hu, J. C. ;
Brown, N. H. ;
Tweedie, S. ;
Alam-Faruque, Y. ;
Apweiler, R. ;
Auchinchloss, A. ;
Axelsen, K. ;
Argoud-Puy, G. ;
Bely, B. ;
Blatter, M. -C. ;
Bougueleret, L. ;
Boutet, E. ;
Branconi-Quintaje, S. ;
Breuza, L. ;
Bridge, A. ;
Browne, P. ;
Chan, W. M. ;
Coudert, E. ;
Cusin, I. ;
Dimmer, E. ;
Duek-Roggli, P. ;
Eberhardt, R. ;
Estreicher, A. ;
Famiglietti, L. ;
Ferro-Rojas, S. ;
Feuermann, M. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D559-D564
[7]   DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS [J].
BOGUSKI, MS ;
LOWE, TMJ ;
TOLSTOSHEV, CM .
NATURE GENETICS, 1993, 4 (04) :332-333
[8]   RetrOryza:: a database of the rice LTR-retrotransposons [J].
Chaparro, Cristian ;
Guyot, Romain ;
Zuccolo, Andrea ;
Piegu, Benoit ;
Panaud, Olivier .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D66-D70
[9]  
Donlin Maureen J, 2007, Curr Protoc Bioinformatics, VChapter 9, DOI 10.1002/0471250953.bi0909s17
[10]   MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797