De Novo Genome Sequence Assembly of Dwarf Coconut (Cocos nucifera L. 'Catigan Green Dwarf') Provides Insights into Genomic Variation Between Coconut Types and Related Palm Species

被引:33
作者
Lantican, Darlon V. [1 ,2 ]
Strickler, Susan R. [3 ]
Canama, Alma O. [1 ]
Gardoce, Roanne R. [1 ]
Mueller, Lukas A. [3 ]
Galvez, Hayde F. [1 ,4 ]
机构
[1] Univ Philippines Los Banos, Coll Agr & Food Sci, Inst Plant Breeding, Genet Lab, Laguna 4031, Philippines
[2] Univ Philippines Syst, Philippine Genome Ctr, Quezon City, Philippines
[3] Boyce Thompson Inst Plant Res, Ithaca, NY 14853 USA
[4] Univ Philippines Los Banos, Coll Agr & Food Sci, Inst Crop Sci, Laguna 4031, Philippines
来源
G3-GENES GENOMES GENETICS | 2019年 / 9卷 / 08期
关键词
Cocos nucifera L; dwarf coconut; genome assembly; Illumina Miseq Sequencing; PacBio SMRT sequencing; Dovetail Chicago sequencing; hybrid assembly; SSR and SNP markers; ZINC-FINGER PROTEIN; DISEASE RESISTANCE; TRANSCRIPTION FACTOR; EXPRESSION PROFILES; UNSATURATED FAT; WATER RELATIONS; SALT TOLERANCE; SSR MARKERS; RNA-SEQ; DNA;
D O I
10.1534/g3.119.400215
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We report the first whole genome sequence (WGS) assembly and annotation of a dwarf coconut variety, 'Catigan Green Dwarf' (CATD). The genome sequence was generated using the PacBio SMRT sequencing platform at 15X coverage of the expected genome size of 2.15 Gbp, which was corrected with assembled 50X Illumina paired-end MiSeq reads of the same genome. The draft genome was improved through Chicago sequencing to generate a scaffold assembly that results in a total genome size of 2.1 Gbp consisting of 7,998 scaffolds with N50 of 570,487 bp. The final assembly covers around 97.6% of the estimated genome size of coconut 'CATD' based on homozygous k-mer peak analysis. A total of 34,958 high-confidence gene models were predicted and functionally associated to various economically important traits, such as pest/disease resistance, drought tolerance, coconut oil biosynthesis, and putative transcription factors. The assembled genome was used to infer the evolutionary relationship within the palm family based on genomic variations and synteny of coding gene sequences. Data show that at least three (3) rounds of whole genome duplication occurred and are commonly shared by these members of the Arecaceae family. A total of 7,139 unique SSR markers were designed to be used as a resource in marker-based breeding. In addition, we discovered 58,503 variants in coconut by aligning the Hainan Tall (HAT) WGS reads to the non-repetitive regions of the assembled CATD genome. The gene markers and genome-wide SSR markers established here will facilitate the development of varieties with resilience to climate change, resistance to pests and diseases, and improved oil yield and quality.
引用
收藏
页码:2377 / 2393
页数:17
相关论文
共 147 条
  • [1] CYTOLOGY OF COCONUT ENDOSPERM
    ABRAHAM, A
    MATHEW, PM
    [J]. ANNALS OF BOTANY, 1963, 27 (107) : 505 - +
  • [2] Genome-wide identification of C2H2 zinc-finger gene family in rice and their phylogeny and expression analysis
    Agarwal, Pinky
    Arora, Rita
    Ray, Swatismita
    Singh, Ashok K.
    Singh, Vijay P.
    Takatsuji, Hiroshi
    Kapoor, Sanjay
    Tyagi, Akhilesh K.
    [J]. PLANT MOLECULAR BIOLOGY, 2007, 65 (04) : 467 - 485
  • [3] Transcript Profiling During Fiber Development Identifies Pathways in Secondary Metabolism and Cell Wall Structure That May Contribute to Cotton Fiber Quality
    Al-Ghazi, Yves
    Bourot, Stephane
    Arioli, Tony
    Dennis, Elizabeth S.
    Llewellyn, Danny J.
    [J]. PLANT AND CELL PHYSIOLOGY, 2009, 50 (07) : 1364 - 1381
  • [4] Genome sequence of the date palm Phoenix dactylifera L
    Al-Mssallem, Ibrahim S.
    Hu, Songnian
    Zhang, Xiaowei
    Lin, Qiang
    Liu, Wanfei
    Tan, Jun
    Yu, Xiaoguang
    Liu, Jiucheng
    Pan, Linlin
    Zhang, Tongwu
    Yin, Yuxin
    Xin, Chengqi
    Wu, Hao
    Zhang, Guangyu
    Abdullah, Mohammed M. Ba
    Huang, Dawei
    Fang, Yongjun
    Alnakhli, Yasser O.
    Jia, Shangang
    Yin, An
    Alhuzimi, Eman M.
    Alsaihati, Burair A.
    Al-Owayyed, Saad A.
    Zhao, Duojun
    Zhang, Sun
    Al-Otaibi, Noha A.
    Sun, Gaoyuan
    Majrashi, Majed A.
    Li, Fusen
    Tala
    Wang, Jixiang
    Yun, Quanzheng
    Alnassar, Nafla A.
    Wang, Lei
    Yang, Meng
    Al-Jelaify, Rasha F.
    Liu, Kan
    Gao, Shenghan
    Chen, Kaifu
    Alkhaldi, Samiyah R.
    Liu, Guiming
    Zhang, Meng
    Guo, Haiyan
    Yu, Jun
    [J]. NATURE COMMUNICATIONS, 2013, 4
  • [5] AL-SALIH A A, 1987, Date Palm Journal, V5, P123
  • [6] Alsaihati B., 2014, PLANT AN GEN 22 C PL
  • [7] DroughtDB: an expert-curated compilation of plant drought stress genes and their homologs in nine species
    Alter, Svenja
    Bader, Kai C.
    Spannagl, Manuel
    Wang, Yu
    Bauer, Eva
    Schoen, Chris-Carolin
    Mayer, Klaus F. X.
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2015,
  • [8] Eight glacial cycles from an Antarctic ice core
    Augustin, L
    Barbante, C
    Barnes, PRF
    Barnola, JM
    Bigler, M
    Castellano, E
    Cattani, O
    Chappellaz, J
    DahlJensen, D
    Delmonte, B
    Dreyfus, G
    Durand, G
    Falourd, S
    Fischer, H
    Flückiger, J
    Hansson, ME
    Huybrechts, P
    Jugie, R
    Johnsen, SJ
    Jouzel, J
    Kaufmann, P
    Kipfstuhl, J
    Lambert, F
    Lipenkov, VY
    Littot, GVC
    Longinelli, A
    Lorrain, R
    Maggi, V
    Masson-Delmotte, V
    Miller, H
    Mulvaney, R
    Oerlemans, J
    Oerter, H
    Orombelli, G
    Parrenin, F
    Peel, DA
    Petit, JR
    Raynaud, D
    Ritz, C
    Ruth, U
    Schwander, J
    Siegenthaler, U
    Souchez, R
    Stauffer, B
    Steffensen, JP
    Stenni, B
    Stocker, TF
    Tabacco, IE
    Udisti, R
    van de Wal, RSW
    [J]. NATURE, 2004, 429 (6992) : 623 - 628
  • [9] Batugal P., 2005, Coconut Genetic Resources
  • [10] Batugal Pons, 2009, P327, DOI 10.1007/978-0-387-71201-7_10