Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data

被引:1531
作者
Kawahara, Yoshihiro [1 ]
de la Bastide, Melissa [2 ]
Hamilton, John P. [3 ]
Kanamori, Hiroyuki [1 ]
McCombie, W. Richard [2 ]
Ouyang, Shu [4 ]
Schwartz, David C. [5 ]
Tanaka, Tsuyoshi [1 ]
Wu, Jianzhong [1 ]
Zhou, Shiguo [5 ]
Childs, Kevin L. [3 ]
Davidson, Rebecca M. [3 ,6 ]
Lin, Haining [3 ,7 ]
Quesada-Ocampo, Lina [3 ]
Vaillancourt, Brieanne [3 ]
Sakai, Hiroaki [1 ]
Lee, Sung Shin [1 ]
Kim, Jungsok [1 ]
Numa, Hisataka [1 ]
Itoh, Takeshi [1 ]
Buell, C. Robin [3 ]
Matsumoto, Takashi [1 ]
机构
[1] Natl Inst Agrobiol Sci, Agrogen Res Ctr, Tsukuba, Ibaraki 3058602, Japan
[2] CSHL, Cold Spring Harbor, NY 11723 USA
[3] Michigan State Univ, Dept Plant Biol, Plant Biol Labs, E Lansing, MI 48824 USA
[4] Perkin Elmer, Frederick, MD 21701 USA
[5] Univ Wisconsin, Lab Mol & Computat Genom, UW Biotechnol Ctr, Madison, WI 53706 USA
[6] Natl Jewish Hlth, Integrated Ctr Genes Environm & Hlth, Denver, CO USA
[7] Dupont Pioneer, Johnston, IA 50131 USA
基金
美国国家科学基金会;
关键词
Oryza sativa; Nipponbare; Unified rice reference genome; Pseudomolecules; Minimum tiling path; Optical mapping; Genome re-sequencing; Next-generation sequencing; SINGLE-NUCLEOTIDE POLYMORPHISMS; RICE GENOME; ANNOTATION; DNA; ASSOCIATION; RESOLUTION; ALIGNMENT; RESOURCE; JAPONICA; INDICA;
D O I
10.1186/1939-8433-6-4
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
Rice research has been enabled by access to the high quality reference genome sequence generated in 2005 by the International Rice Genome Sequencing Project (IRGSP). To further facilitate genomic-enabled research, we have updated and validated the genome assembly and sequence for the Nipponbare cultivar of Oryza sativa (japonica group). The Nipponbare genome assembly was updated by revising and validating the minimal tiling path of clones with the optical map for rice. Sequencing errors in the revised genome assembly were identified by re-sequencing the genome of two different Nipponbare individuals using the Illumina Genome Analyzer II/IIx platform. A total of 4,886 sequencing errors were identified in 321 Mb of the assembled genome indicating an error rate in the original IRGSP assembly of only 0.15 per 10,000 nucleotides. A small number (five) of insertions/deletions were identified using longer reads generated using the Roche 454 pyrosequencing platform. As the re-sequencing data were generated from two different individuals, we were able to identify a number of allelic differences between the original individual used in the IRGSP effort and the two individuals used in the re-sequencing effort. The revised assembly, termed Os-Nipponbare-Reference-IRGSP-1.0, is now being used in updated releases of the Rice Annotation Project and the Michigan State University Rice Genome Annotation Project, thereby providing a unified set of pseudomolecules for the rice community. A revised, error-corrected, and validated assembly of the Nipponbare cultivar of rice was generated using optical map data, re-sequencing data, and manual curation that will facilitate on-going and future research in rice. Detection of polymorphisms between three different Nipponbare individuals highlights that allelic differences between individuals should be considered in diversity studies.
引用
收藏
页码:3 / 10
页数:10
相关论文
共 27 条
  • [1] A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3
    Cingolani, Pablo
    Platts, Adrian
    Wang, Le Lily
    Coon, Melissa
    Tung Nguyen
    Wang, Luan
    Land, Susan J.
    Lu, Xiangyi
    Ruden, Douglas M.
    [J]. FLY, 2012, 6 (02) : 80 - 92
  • [2] B73-Mo17 Near-Isogenic Lines Demonstrate Dispersed Structural Variation in Maize
    Eichten, Steven R.
    Foerster, Jillian M.
    de Leon, Natalia
    Kai, Ying
    Yeh, Cheng-Ting
    Liu, Sanzhen
    Jeddeloh, Jeffrey A.
    Schnable, Patrick S.
    Kaeppler, Shawn M.
    Springer, Nathan M.
    [J]. PLANT PHYSIOLOGY, 2011, 156 (04) : 1679 - 1690
  • [3] A draft sequence of the rice genome (Oryza sativa L. ssp japonica)
    Goff, SA
    Ricke, D
    Lan, TH
    Presting, G
    Wang, RL
    Dunn, M
    Glazebrook, J
    Sessions, A
    Oeller, P
    Varma, H
    Hadley, D
    Hutchinson, D
    Martin, C
    Katagiri, F
    Lange, BM
    Moughamer, T
    Xia, Y
    Budworth, P
    Zhong, JP
    Miguel, T
    Paszkowski, U
    Zhang, SP
    Colbert, M
    Sun, WL
    Chen, LL
    Cooper, B
    Park, S
    Wood, TC
    Mao, L
    Quail, P
    Wing, R
    Dean, R
    Yu, YS
    Zharkikh, A
    Shen, R
    Sahasrabudhe, S
    Thomas, A
    Cannings, R
    Gutin, A
    Pruss, D
    Reid, J
    Tavtigian, S
    Mitchell, J
    Eldredge, G
    Scholl, T
    Miller, RM
    Bhatnagar, S
    Adey, N
    Rubano, T
    Tusneem, N
    [J]. SCIENCE, 2002, 296 (5565) : 92 - 100
  • [4] Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm
    Huang, Xuehui
    Zhao, Yan
    Wei, Xinghua
    Li, Canyang
    Wang, Ahong
    Zhao, Qiang
    Li, Wenjun
    Guo, Yunli
    Deng, Liuwei
    Zhu, Chuanrang
    Fan, Danlin
    Lu, Yiqi
    Weng, Qijun
    Liu, Kunyan
    Zhou, Taoying
    Jing, Yufeng
    Si, Lizhen
    Dong, Guojun
    Huang, Tao
    Lu, Tingting
    Feng, Qi
    Qian, Qian
    Li, Jiayang
    Han, Bin
    [J]. NATURE GENETICS, 2012, 44 (01) : 32 - U53
  • [5] Genome-wide association studies of 14 agronomic traits in rice landraces
    Huang, Xuehui
    Wei, Xinghua
    Sang, Tao
    Zhao, Qiang
    Feng, Qi
    Zhao, Yan
    Li, Canyang
    Zhu, Chuanrang
    Lu, Tingting
    Zhang, Zhiwu
    Li, Meng
    Fan, Danlin
    Guo, Yunli
    Wang, Ahong
    Wang, Lu
    Deng, Liuwei
    Li, Wenjun
    Lu, Yiqi
    Weng, Qijun
    Liu, Kunyan
    Huang, Tao
    Zhou, Taoying
    Jing, Yufeng
    Li, Wei
    Lin, Zhang
    Buckler, Edward S.
    Qian, Qian
    Zhang, Qi-Fa
    Li, Jiayang
    Han, Bin
    [J]. NATURE GENETICS, 2010, 42 (11) : 961 - U76
  • [6] The Sequence Alignment/Map format and SAMtools
    Li, Heng
    Handsaker, Bob
    Wysoker, Alec
    Fennell, Tim
    Ruan, Jue
    Homer, Nils
    Marth, Gabor
    Abecasis, Goncalo
    Durbin, Richard
    [J]. BIOINFORMATICS, 2009, 25 (16) : 2078 - 2079
  • [7] Fast and accurate short read alignment with Burrows-Wheeler transform
    Li, Heng
    Durbin, Richard
    [J]. BIOINFORMATICS, 2009, 25 (14) : 1754 - 1760
  • [8] Function annotation of the rice transcriptome at single-nucleotide resolution by RNA-seq
    Lu, Tingting
    Lu, Guojun
    Fan, Danlin
    Zhu, Chuanrang
    Li, Wei
    Zhao, Qiang
    Feng, Qi
    Zhao, Yan
    Guo, Yunli
    Li, Wenjun
    Huang, Xuehui
    Han, Bin
    [J]. GENOME RESEARCH, 2010, 20 (09) : 1238 - 1249
  • [9] The map-based sequence of the rice genome
    Matsumoto, T
    Wu, JZ
    Kanamori, H
    Katayose, Y
    Fujisawa, M
    Namiki, N
    Mizuno, H
    Yamamoto, K
    Antonio, BA
    Baba, T
    Sakata, K
    Nagamura, Y
    Aoki, H
    Arikawa, K
    Arita, K
    Bito, T
    Chiden, Y
    Fujitsuka, N
    Fukunaka, R
    Hamada, M
    Harada, C
    Hayashi, A
    Hijishita, S
    Honda, M
    Hosokawa, S
    Ichikawa, Y
    Idonuma, A
    Iijima, M
    Ikeda, M
    Ikeno, M
    Ito, K
    Ito, S
    Ito, T
    Ito, Y
    Ito, Y
    Iwabuchi, A
    Kamiya, K
    Karasawa, W
    Kurita, K
    Katagiri, S
    Kikuta, A
    Kobayashi, H
    Kobayashi, N
    Machita, K
    Maehara, T
    Masukawa, M
    Mizubayashi, T
    Mukai, Y
    Nagasaki, H
    Nagata, Y
    [J]. NATURE, 2005, 436 (7052) : 793 - 800
  • [10] Genetic Properties of the Maize Nested Association Mapping Population
    McMullen, Michael D.
    Kresovich, Stephen
    Villeda, Hector Sanchez
    Bradbury, Peter
    Li, Huihui
    Sun, Qi
    Flint-Garcia, Sherry
    Thornsberry, Jeffry
    Acharya, Charlotte
    Bottoms, Christopher
    Brown, Patrick
    Browne, Chris
    Eller, Magen
    Guill, Kate
    Harjes, Carlos
    Kroon, Dallas
    Lepak, Nick
    Mitchell, Sharon E.
    Peterson, Brooke
    Pressoir, Gael
    Romero, Susan
    Rosas, Marco Oropeza
    Salvo, Stella
    Yates, Heather
    Hanson, Mark
    Jones, Elizabeth
    Smith, Stephen
    Glaubitz, Jeffrey C.
    Goodman, Major
    Ware, Doreen
    Holland, James B.
    Buckler, Edward S.
    [J]. SCIENCE, 2009, 325 (5941) : 737 - 740