Massive gene presence-absence variation shapes an open pan-genome in the Mediterranean mussel

被引:133
作者
Gerdol, Marco [1 ]
Moreira, Rebeca [2 ]
Cruz, Fernando [3 ]
Gomez-Garrido, Jessica [3 ]
Vlasova, Anna [4 ]
Rosani, Umberto [5 ]
Venier, Paola [5 ]
Naranjo-Ortiz, Miguel A. [6 ]
Murgarella, Maria [7 ]
Greco, Samuele [1 ]
Balseiro, Pablo [8 ]
Corvelo, Andre [3 ,9 ]
Frias, Leonor [3 ]
Gut, Marta [3 ]
Gabaldon, Toni [10 ,11 ,12 ]
Pallavicini, Alberto [13 ]
Canchaya, Carlos [7 ,14 ,15 ]
Novoa, Beatriz [2 ]
Alioto, Tyler S. [3 ,6 ]
Posada, David [7 ,14 ,15 ]
Figueras, Antonio [2 ]
机构
[1] Univ Trieste, Dept Life Sci, Via Licio Giorgieri 5, I-34127 Trieste, Italy
[2] CSIC, Inst Invest Marinas IIM, Eduardo Cabello 6, Vigo 36208, Spain
[3] Barcelona Inst Sci & Technol BIST, Ctr Genom Regulat CRG, CNAG CRG, Baldiri & Reixac 4, Barcelona 08028, Spain
[4] CRG Ctr Genom Regulat, Doctor Aiguader 88, Barcelona 08003, Spain
[5] Univ Padua, Dept Biol, Via Ugo Bassi 58-B, I-35131 Padua, Italy
[6] Univ Pompeu Fabra UPF, Barcelona 08003, Spain
[7] Univ Vigo, Dept Biochem Genet & Immunol, Vigo 36310, Spain
[8] Norce Norwegian Res Ctr AS, Bergen, Norway
[9] New York Genome Ctr, New York, NY 10013 USA
[10] ICREA, Pg Lluis Companys 23, Barcelona 08010, Spain
[11] Barelona Supercomp Ctr BSC CNS, Barcelona 08034, Spain
[12] Inst Res Biomed IRB, Barcelona 08034, Spain
[13] Anton Dohrn Zool Stn, I-80121 Naples, Italy
[14] Univ Vigo, Biomed Res Ctr CINBIO, Vigo 36310, Spain
[15] Galicia Hlth Res Inst, Vigo 36310, Spain
基金
欧洲研究理事会; 欧盟地平线“2020”;
关键词
Mussel; Bivalve; Pan-genome; Presence-absence variation; Structural variants; Hemizygosity; Dispensable gene; Phylome; Innate immunity; Antimicrobial peptides; INTRON-LENGTH POLYMORPHISM; MOSAIC HYBRID ZONE; MYTILUS-GALLOPROVINCIALIS; STRUCTURAL VARIATION; HIGH-ACCURACY; EDULIS; SEQUENCE; MAP; DIFFERENTIATION; INTROGRESSION;
D O I
10.1186/s13059-020-02180-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background The Mediterranean mussel Mytilus galloprovincialis is an ecologically and economically relevant edible marine bivalve, highly invasive and resilient to biotic and abiotic stressors causing recurrent massive mortalities in other bivalves. Although these traits have been recently linked with the maintenance of a high genetic variation within natural populations, the factors underlying the evolutionary success of this species remain unclear. Results Here, after the assembly of a 1.28-Gb reference genome and the resequencing of 14 individuals from two independent populations, we reveal a complex pan-genomic architecture in M. galloprovincialis, with a core set of 45,000 genes plus a strikingly high number of dispensable genes (20,000) subject to presence-absence variation, which may be entirely missing in several individuals. We show that dispensable genes are associated with hemizygous genomic regions affected by structural variants, which overall account for nearly 580 Mb of DNA sequence not included in the reference genome assembly. As such, this is the first study to report the widespread occurrence of gene presence-absence variation at a whole-genome scale in the animal kingdom. Conclusions Dispensable genes usually belong to young and recently expanded gene families enriched in survival functions, which might be the key to explain the resilience and invasiveness of this species. This unique pan-genome architecture is characterized by dispensable genes in accessory genomic regions that exceed by orders of magnitude those observed in other metazoans, including humans, and closely mirror the open pan-genomes found in prokaryotes and in a few non-metazoan eukaryotes.
引用
收藏
页数:21
相关论文
共 118 条
[91]   Invertebrate ecological immunology [J].
Rolff, J ;
Siva-Jothy, MT .
SCIENCE, 2003, 301 (5632) :472-475
[92]   High polymorphism in big defensin gene expression reveals presence-absence gene variability (PAV) in the oyster Crassostrea gigas [J].
Rosa, Rafael D. ;
Alonso, Pascal ;
Santini, Adrien ;
Vergnes, Agnes ;
Bachere, Evelyne .
DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY, 2015, 49 (02) :231-238
[93]   A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms [J].
Sachidanandam, R ;
Weissman, D ;
Schmidt, SC ;
Kakol, JM ;
Stein, LD ;
Marth, G ;
Sherry, S ;
Mullikin, JC ;
Mortimore, BJ ;
Willey, DL ;
Hunt, SE ;
Cole, CG ;
Coggill, PC ;
Rice, CM ;
Ning, ZM ;
Rogers, J ;
Bentley, DR ;
Kwok, PY ;
Mardis, ER ;
Yeh, RT ;
Schultz, B ;
Cook, L ;
Davenport, R ;
Dante, M ;
Fulton, L ;
Hillier, L ;
Waterston, RH ;
McPherson, JD ;
Gilman, B ;
Schaffner, S ;
Van Etten, WJ ;
Reich, D ;
Higgins, J ;
Daly, MJ ;
Blumenstiel, B ;
Baldwin, J ;
Stange-Thomann, NS ;
Zody, MC ;
Linton, L ;
Lander, ES ;
Altshuler, D .
NATURE, 2001, 409 (6822) :928-933
[94]   THE NEIGHBOR-JOINING METHOD - A NEW METHOD FOR RECONSTRUCTING PHYLOGENETIC TREES [J].
SAITOU, N ;
NEI, M .
MOLECULAR BIOLOGY AND EVOLUTION, 1987, 4 (04) :406-425
[95]   Assembly of a pan-genome from deep sequencing of 910 humans of African descent [J].
Sherman, Rachel M. ;
Forman, Juliet ;
Antonescu, Valentin ;
Puiu, Daniela ;
Daya, Michelle ;
Rafaels, Nicholas ;
Boorgula, Meher Preethi ;
Chavan, Sameer ;
Vergara, Candelaria ;
Ortega, Victor E. ;
Levin, Albert M. ;
Eng, Celeste ;
Yazdanbakhsh, Maria ;
Wilson, James G. ;
Marrugo, Javier ;
Lange, Leslie A. ;
Williams, L. Keoki ;
Watson, Harold ;
Ware, Lorraine B. ;
Olopade, Christopher O. ;
Olopade, Olufunmilayo ;
Oliveira, Ricardo R. ;
Ober, Carole ;
Nicolae, Dan L. ;
Meyers, Deborah A. ;
Mayorga, Alvaro ;
Knight-Madden, Jennifer ;
Hartert, Tina ;
Hansel, Nadia N. ;
Foreman, Marilyn G. ;
Ford, Jean G. ;
Faruque, Mezbah U. ;
Dunston, Georgia M. ;
Caraballo, Luis ;
Burchard, Esteban G. ;
Bleecker, Eugene R. ;
Araujo, Maria I. ;
Herrera-Paz, Edwin F. ;
Campbell, Monica ;
Foster, Cassandra ;
Taub, Margaret A. ;
Beaty, Terri H. ;
Ruczinski, Ingo ;
Mathias, Rasika A. ;
Barnes, Kathleen C. ;
Salzberg, Steven L. .
NATURE GENETICS, 2019, 51 (01) :30-+
[96]   BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs [J].
Simao, Felipe A. ;
Waterhouse, Robert M. ;
Ioannidis, Panagiotis ;
Kriventseva, Evgenia V. ;
Zdobnov, Evgeny M. .
BIOINFORMATICS, 2015, 31 (19) :3210-3212
[97]   ABySS: A parallel assembler for short read sequence data [J].
Simpson, Jared T. ;
Wong, Kim ;
Jackman, Shaun D. ;
Schein, Jacqueline E. ;
Jones, Steven J. M. ;
Birol, Inanc .
GENOME RESEARCH, 2009, 19 (06) :1117-1123
[98]   Extreme genomic variation in a natural population [J].
Small, Kerrin S. ;
Brudno, Michael ;
Hill, Matthew M. ;
Sidow, Arend .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (13) :5698-5703
[99]   Glacial history of the European marine mussels Mytilus, inferred from distribution of mitochondrial DNA lineages [J].
Smietanka, B. ;
Burzynski, A. ;
Hummel, H. ;
Wenne, R. .
HEREDITY, 2014, 113 (03) :250-258
[100]   Maize Inbreds Exhibit High Levels of Copy Number Variation (CNV) and Presence/Absence Variation (PAV) in Genome Content [J].
Springer, Nathan M. ;
Ying, Kai ;
Fu, Yan ;
Ji, Tieming ;
Yeh, Cheng-Ting ;
Jia, Yi ;
Wu, Wei ;
Richmond, Todd ;
Kitzman, Jacob ;
Rosenbaum, Heidi ;
Iniguez, A. Leonardo ;
Barbazuk, W. Brad ;
Jeddeloh, Jeffrey A. ;
Nettleton, Daniel ;
Schnable, Patrick S. .
PLOS GENETICS, 2009, 5 (11)