Using Runs of Homozygosity and Machine Learning to Disentangle Sources of Inbreeding and Infer Self-Fertilization Rates

被引:3
作者
Zeitler, Leo [1 ]
Gilbert, Kimberly J. [1 ]
机构
[1] Univ Fribourg, Dept Biol, Chemin Musee 10, CH-1700 Fribourg, Switzerland
基金
瑞士国家科学基金会;
关键词
runs of homozygosity; inbreeding; self-fertilization; outcrossing rate; demographic history; mating system; random forest; MATING SYSTEM; NUCLEOTIDE POLYMORPHISM; POPULATION-STRUCTURE; BREEDING-SYSTEM; GENOMES REVEAL; ARABIS-ALPINA; ARABIDOPSIS; SELECTION; GENETICS; MODELS;
D O I
10.1093/gbe/evae139
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Runs of homozygosity (ROHs) are indicative of elevated homozygosity and inbreeding due to mating of closely related individuals. Self-fertilization can be a major source of inbreeding which elevates genome-wide homozygosity and thus should also create long ROHs. While ROHs are frequently used to understand inbreeding in the context of conservation and selective breeding, as well as for consanguinity of populations and their demographic history, it remains unclear how ROH characteristics are altered by selfing and if this confounds expected signatures of inbreeding due to demographic change. Using simulations, we study the impact of the mode of reproduction and demographic history on ROHs. We apply random forests to identify unique characteristics of ROHs, indicative of different sources of inbreeding. We pinpoint distinct features of ROHs that can be used to better characterize the type of inbreeding the population was subjected to and to predict outcrossing rates and complex demographic histories. Using additional simulations and four empirical datasets, two from highly selfing species and two from mixed-maters, we predict the selfing rate and validate our estimations. We find that self-fertilization rates are successfully identified even with complex demography. Population genetic summary statistics improve algorithm accuracy particularly in the presence of additional inbreeding, e.g. from population bottlenecks. Our findings highlight the importance of ROHs in disentangling confounding factors related to various sources of inbreeding and demonstrate situations where such sources cannot be differentiated. Additionally, our random forest models provide a novel tool to the community for inferring selfing rates using genomic data.
引用
收藏
页数:16
相关论文
共 72 条
[1]   GENETICS OF INBREEDING POPULATIONS [J].
ALLARD, RW ;
JAIN, SK ;
WORKMAN, PL .
ADVANCES IN GENETICS INCORPORATING MOLECULAR GENETIC MEDICINE, 1968, 14 :55-&
[2]   GENETIC DRIFT AND THE LOSS OF ALLELES VERSUS HETEROZYGOSITY [J].
ALLENDORF, FW .
ZOO BIOLOGY, 1986, 5 (02) :181-190
[3]   1,135 Genomes Reveal the Global Pattern of Polymorphism in Arabidopsis thaliana [J].
Alonso-Blanco, Carlos ;
Andrade, Jorge ;
Becker, Claude ;
Bemm, Felix ;
Bergelson, Joy ;
Borgwardt, Karsten M. ;
Cao, Jun ;
Chae, Eunyoung ;
Dezwaan, Todd M. ;
Ding, Wei ;
Ecker, Joseph R. ;
Exposito-Alonso, Moises ;
Farlow, Ashley ;
Fitz, Joffrey ;
Gan, Xiangchao ;
Grimm, Dominik G. ;
Hancock, Angela M. ;
Henz, Stefan R. ;
Holm, Svante ;
Horton, Matthew ;
Jarsulic, Mike ;
Kerstetter, Randall A. ;
Korte, Arthur ;
Korte, Pamela ;
Lanz, Christa ;
Lee, Cheng-Ruei ;
Meng, Dazhe ;
Michael, Todd P. ;
Mott, Richard ;
Muliyati, Ni Wayan ;
Nagele, Thomas ;
Nagler, Matthias ;
Nizhynska, Viktoria ;
Nordborg, Magnus ;
Novikova, Polina Yu. ;
Pico, F. Xavier ;
Platzer, Alexander ;
Rabanal, Fernando A. ;
Rodriguez, Alex ;
Rowan, Beth A. ;
Salome, Patrice A. ;
Schmid, Karl J. ;
Schmitz, Robert J. ;
Seren, Umit ;
Sperone, Felice Gianluca ;
Sudkamp, Mitchell ;
Svardal, Hannes ;
Tanzer, Matt M. ;
Todd, Donald ;
Volchenboum, Samuel L. .
CELL, 2016, 166 (02) :481-491
[4]  
[Anonymous], 2023, R Foundation for Statistical Computing
[5]   Genetic discontinuity, breeding-system change and population history of Arabis alpina in the Italian Peninsula and adjacent Alps [J].
Ansell, S. W. ;
Grundmann, M. ;
Russell, S. J. ;
Schneider, H. ;
Vogel, J. C. .
MOLECULAR ECOLOGY, 2008, 17 (09) :2245-2257
[6]  
Auwera G A., 2013, Curr. Protoc. Bioinforma., V43, DOI DOI 10.1002/0471250953.BI1110S43
[7]   Deleterious phenotypes in wild Arabidopsis arenosa populations are common and linked to runs of homozygosity [J].
Barragan, A. Cristina ;
Collenberg, Maximilian ;
Schwab, Rebecca ;
Kersten, Sonja ;
Kerstens, Merijn H. L. ;
Pozarova, Doubravka ;
Bezrukov, Ilja ;
Bemm, Felix ;
Kolar, Filip ;
Weigel, Detlef .
G3-GENES GENOMES GENETICS, 2024, 14 (03)
[8]  
BENNETT J H, 1953, Genetica, V26, P392, DOI 10.1007/BF01690623
[9]   The type of bottleneck matters: Insights into the deleterious variation landscape of small managed populations [J].
Bortoluzzi, Chiara ;
Bosse, Mirte ;
Derks, Martijn F. L. ;
Crooijmans, Richard P. M. A. ;
Groenen, Martien A. M. ;
Megens, Hendrik-Jan .
EVOLUTIONARY APPLICATIONS, 2020, 13 (02) :330-341
[10]   Regions of Homozygosity in the Porcine Genome: Consequence of Demography and the Recombination Landscape [J].
Bosse, Mirte ;
Megens, Hendrik-Jan ;
Madsen, Ole ;
Paudel, Yogesh ;
Frantz, Laurent A. F. ;
Schook, Lawrence B. ;
Crooijmans, Richard P. M. A. ;
Groenen, Martien A. M. .
PLOS GENETICS, 2012, 8 (11)