A manually annotated Actinidia chinensis var. chinensis (kiwifruit) genome highlights the challenges associated with draft genomes and gene prediction in plants

被引:174
作者
Pilkington, Sarah M. [1 ]
Crowhurst, Ross [1 ]
Hilario, Elena [1 ]
Nardozza, Simona [1 ]
Fraser, Lena [1 ]
Peng, Yongyan [1 ,2 ]
Gunaseelan, Kularajathevan [1 ]
Simpson, Robert [3 ]
Tahir, Jibran [3 ]
Deroles, Simon C. [3 ]
Templeton, Kerry [1 ]
Luo, Zhiwei [1 ]
Davy, Marcus [4 ]
Cheng, Canhong [1 ]
McNeilage, Mark [1 ]
Scaglione, Davide [5 ]
Liu, Yifei [6 ]
Zhang, Qiong [7 ]
Datson, Paul [1 ]
De Silva, Nihal [1 ]
Gardiner, Susan E. [3 ]
Bassett, Heather [3 ]
Chagne, David [3 ]
McCallum, John [8 ]
Dzierzon, Helge [3 ]
Deng, Cecilia [1 ]
Wang, Yen-Yi [1 ]
Barron, Lorna [1 ]
Manako, Kelvina [1 ]
Bowen, Judith [1 ]
Foster, Toshi M. [3 ]
Erridge, Zoe A. [3 ]
Tiffin, Heather [3 ]
Waite, Chethi N. [3 ]
Davies, Kevin M. [3 ]
Grierson, Ella P. [3 ]
Laing, William A. [3 ]
Kirk, Rebecca [1 ]
Chen, Xiuyin [1 ]
Wood, Marion [1 ]
Montefiori, Mirco [1 ]
Brummell, David A. [3 ]
Schwinn, Kathy E. [3 ]
Catanach, Andrew [8 ]
Fullerton, Christina [1 ]
Li, Dawei [7 ]
Meiyalaghan, Sathiyamoorthy [8 ]
Nieuwenhuizen, Niels [1 ]
Read, Nicola [2 ]
Prakash, Roneel [1 ]
机构
[1] New Zealand Inst Plant & Food Res Ltd PFR, Private Bag 92169, Auckland 1142, New Zealand
[2] Univ Auckland, Sch Biol Sci, Private Bag 92019, Auckland 1142, New Zealand
[3] PFR, Private Bag 11600, Palmerston North 4442, New Zealand
[4] PFR, 412 1 Rd, Te Puke 3182, Bay Of Plenty, New Zealand
[5] IGA Technol Serv, Parco Sci & Tecnol, Udine, Italy
[6] Chinese Acad Sci, South China Bot Gardens, Guangzhou 510650, Guangdong, Peoples R China
[7] Chinese Acad Sci, Bot Garden, Key Lab Plant Germplasm Enhancement & Specialty A, Wuhan 430074, Peoples R China
[8] PFR, Private Bag 4704, Christchurch 8140, New Zealand
[9] Univ Udine, Dept Agr & Environm Sci, Via Sci 208, I-33100 Udine, Italy
[10] Queensland Univ Technol, Inst Future Environm, Brisbane, Qld 4001, Australia
关键词
Manual annotation; Genome sequencing; Actinidia chinensis; ALIGNMENT; EVOLUTION; RESOURCE; PROGRAM; APPLE; RNA;
D O I
10.1186/s12864-018-4656-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) 'Hongyang' draft genome has 164 Mb of sequences unassigned to pseudo-chromosomes, and omissions have been identified in the gene models. Results: A second genome of an A. chinensis (genotype Red5) was fully sequenced. This new sequence resulted in a 554.0 Mb assembly with all but 6 Mb assigned to pseudo-chromosomes. Pseudo-chromosomal comparisons showed a considerable number of translocation events have occurred following a whole genome duplication (WGD) event some consistent with centromeric Robertsonian-like translocations. RNA sequencing data from 12 tissues and ab initio analysis informed a genome-wide manual annotation, using the WebApollo tool. In total, 33,044 gene loci represented by 33,123 isoforms were identified, named and tagged for quality of evidential support. Of these 3114 (9.4%) were identical to a protein within 'Hongyang' The Kiwifruit Information Resource (KIR v2). Some proportion of the differences will be varietal polymorphisms. However, as most computationally predicted Red5 models required manual re-annotation this proportion is expected to be small. The quality of the new gene models was tested by fully sequencing 550 cloned 'Hort16A' cDNAs and comparing with the predicted protein models for Red5 and both the original 'Hongyang' assembly and the revised annotation from KIR v2. Only 48.9% and 63.5% of the cDNAs had a match with 90% identity or better to the original and revised 'Hongyang' annotation, respectively, compared with 90.9% to the Red5 models. Conclusions: Our study highlights the need to take a cautious approach to draft genomes and computationally predicted genes. Our use of the manual annotation tool WebApollo facilitated manual checking and correction of gene models enabling improvement of computational prediction. This utility was especially relevant for certain types of gene families such as the EXPANSIN like genes. Finally, this high quality gene set will supply the kiwifruit and general plant community with a new tool for genomics and other comparative analysis.
引用
收藏
页数:19
相关论文
共 58 条
[1]   Polyploidy and genome evolution in plants [J].
Adams, KL ;
Wendel, JF .
CURRENT OPINION IN PLANT BIOLOGY, 2005, 8 (02) :135-141
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]  
[Anonymous], GENOME BIOL
[4]   Analysis of the genome sequence of the flowering plant Arabidopsis thaliana [J].
Kaul, S ;
Koo, HL ;
Jenkins, J ;
Rizzo, M ;
Rooney, T ;
Tallon, LJ ;
Feldblyum, T ;
Nierman, W ;
Benito, MI ;
Lin, XY ;
Town, CD ;
Venter, JC ;
Fraser, CM ;
Tabata, S ;
Nakamura, Y ;
Kaneko, T ;
Sato, S ;
Asamizu, E ;
Kato, T ;
Kotani, H ;
Sasamoto, S ;
Ecker, JR ;
Theologis, A ;
Federspiel, NA ;
Palm, CJ ;
Osborne, BI ;
Shinn, P ;
Conway, AB ;
Vysotskaia, VS ;
Dewar, K ;
Conn, L ;
Lenz, CA ;
Kim, CJ ;
Hansen, NF ;
Liu, SX ;
Buehler, E ;
Altafi, H ;
Sakano, H ;
Dunn, P ;
Lam, B ;
Pham, PK ;
Chao, Q ;
Nguyen, M ;
Yu, GX ;
Chen, HM ;
Southwick, A ;
Lee, JM ;
Miranda, M ;
Toriumi, MJ ;
Davis, RW .
NATURE, 2000, 408 (6814) :796-815
[5]   Scaffolding pre-assembled contigs using SSPACE [J].
Boetzer, Marten ;
Henkel, Christiaan V. ;
Jansen, Hans J. ;
Butler, Derek ;
Pirovano, Walter .
BIOINFORMATICS, 2011, 27 (04) :578-579
[6]   Brassica rapa Genome 2.0: A Reference Upgrade through Sequence Re-assembly and Gene Re-annotation [J].
Cai, Chengcheng ;
Wang, Xiaobo ;
Liu, Bo ;
Wu, Jian ;
Liang, Jianli ;
Cui, Yinan ;
Cheng, Feng ;
Wang, Xiaowu .
MOLECULAR PLANT, 2017, 10 (04) :649-651
[7]   Widespread anti-sense transcription in apple is correlated with siRNA production and indicates a large potential for transcriptional and/or post-transcriptional control [J].
Celton, Jean-Marc ;
Gaillard, Sylvain ;
Bruneau, Maryline ;
Pelletier, Sandra ;
Aubourg, Sebastien ;
Martin-Magniette, Marie-Laure ;
Navarro, Lionel ;
Laurens, Francois ;
Renou, Jean-Pierre .
NEW PHYTOLOGIST, 2014, 203 (01) :287-299
[8]   The Draft Genome Sequence of European Pear (Pyrus communis L. 'Bartlett') [J].
Chagne, David ;
Crowhurst, Ross N. ;
Pindo, Massimo ;
Thrimawithana, Amali ;
Deng, Cecilia ;
Ireland, Hilary ;
Fiers, Mark ;
Dzierzon, Helge ;
Cestaro, Alessandro ;
Fontana, Paolo ;
Bianco, Luca ;
Lu, Ashley ;
Storey, Roy ;
Knaebel, Mareike ;
Saeed, Munazza ;
Montanari, Sara ;
Kim, Yoon Kyeong ;
Nicolini, Daniela ;
Larger, Simone ;
Stefani, Erika ;
Allan, Andrew C. ;
Bowen, Judith ;
Harvey, Isaac ;
Johnston, Jason ;
Malnoy, Mickael ;
Troggio, Michela ;
Perchepied, Laure ;
Sawyer, Greg ;
Wiedow, Claudia ;
Won, Kyungho ;
Viola, Roberto ;
Hellens, Roger P. ;
Brewer, Lester ;
Bus, Vincent G. M. ;
Schaffer, Robert J. ;
Gardiner, Susan E. ;
Velasco, Riccardo .
PLOS ONE, 2014, 9 (04)
[9]  
Chang S. J., 1993, Plant Molecular Biology Reporter, V11, P113, DOI 10.1007/BF02670468
[10]   Saccharomyces Genome Database: the genomics resource of budding yeast [J].
Cherry, J. Michael ;
Hong, Eurie L. ;
Amundsen, Craig ;
Balakrishnan, Rama ;
Binkley, Gail ;
Chan, Esther T. ;
Christie, Karen R. ;
Costanzo, Maria C. ;
Dwight, Selina S. ;
Engel, Stacia R. ;
Fisk, Dianna G. ;
Hirschman, Jodi E. ;
Hitz, Benjamin C. ;
Karra, Kalpana ;
Krieger, Cynthia J. ;
Miyasato, Stuart R. ;
Nash, Rob S. ;
Park, Julie ;
Skrzypek, Marek S. ;
Simison, Matt ;
Weng, Shuai ;
Wong, Edith D. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D700-D705