Transposable elements (TEs) contribute to a large fraction of the expansion of many eukaryotic genomes due to the capability of TEs duplicating themselves through transposition. A first step to understanding the roles of TEs in a eukaryotic genome is to characterize the population-wide variation of TE insertions in the species. Here, we present a maximum-likelihood (ML) method for estimating allele frequencies and detecting selection on TE insertions in a diploid population, based on the genotypes at TE insertion sites detected in multiple individuals sampled from the population using paired-end (PE) sequencing reads. Tests of the method on simulated data show that it can accurately estimate the allele frequencies of TE insertions even when the PE sequencing is conducted at a relatively low coverage (= 5X). The method can also detect TE insertions under strong selection, and the detection ability increases with sample size in a population, although a substantial fraction of actual TE insertions under selection may be undetected. Application of the ML method to genomic sequencing data collected from a natural Daphnia pulex population shows that, on the one hand, most (> 90%) TE insertions present in the reference D. pulex genome are either fixed or nearly fixed (with allele frequencies > 0.95); on the other hand, among the nonreference TE insertions (i.e., those detected in some individuals in the population but absent from the reference genome), the majority (>70%) are still at low frequencies (< 0.1). Finally, we detected a substantial fraction (similar to 9%) of nonreference TE insertions under selection.
机构:
INRA, UMR CBGP INRA IRD CIRAD Montpellier Supagro UMR 1, F-34060 Montpellier, France
Museum Natl Hist Nat, CNRS, UMR OSEB, F-75231 Paris, France
Inst Biol Computat, Montpellier, FranceINRA, UMR CBGP INRA IRD CIRAD Montpellier Supagro UMR 1, F-34060 Montpellier, France
Leblois, Raphael
Pudlo, Pierre
论文数: 0引用数: 0
h-index: 0
机构:
INRA, UMR CBGP INRA IRD CIRAD Montpellier Supagro UMR 1, F-34060 Montpellier, France
Inst Biol Computat, Montpellier, France
Univ Montpellier 2, CNRS, UMR I3M, Montpellier, FranceINRA, UMR CBGP INRA IRD CIRAD Montpellier Supagro UMR 1, F-34060 Montpellier, France
Pudlo, Pierre
Neron, Joseph
论文数: 0引用数: 0
h-index: 0
机构:
Museum Natl Hist Nat, CNRS, UMR OSEB, F-75231 Paris, FranceINRA, UMR CBGP INRA IRD CIRAD Montpellier Supagro UMR 1, F-34060 Montpellier, France
Neron, Joseph
Bertaux, Francois
论文数: 0引用数: 0
h-index: 0
机构:
Museum Natl Hist Nat, CNRS, UMR OSEB, F-75231 Paris, France
INRIA Paris Rocquencourt, BANG Team, Le Chesnay, FranceINRA, UMR CBGP INRA IRD CIRAD Montpellier Supagro UMR 1, F-34060 Montpellier, France
Bertaux, Francois
Beeravolu, Champak Reddy
论文数: 0引用数: 0
h-index: 0
机构:
INRA, UMR CBGP INRA IRD CIRAD Montpellier Supagro UMR 1, F-34060 Montpellier, FranceINRA, UMR CBGP INRA IRD CIRAD Montpellier Supagro UMR 1, F-34060 Montpellier, France
Beeravolu, Champak Reddy
Vitalis, Renaud
论文数: 0引用数: 0
h-index: 0
机构:
INRA, UMR CBGP INRA IRD CIRAD Montpellier Supagro UMR 1, F-34060 Montpellier, France
Inst Biol Computat, Montpellier, FranceINRA, UMR CBGP INRA IRD CIRAD Montpellier Supagro UMR 1, F-34060 Montpellier, France
Vitalis, Renaud
Rousset, Francois
论文数: 0引用数: 0
h-index: 0
机构:
Inst Biol Computat, Montpellier, France
Univ Montpellier 2, CNRS, UMR ISEM, Montpellier, FranceINRA, UMR CBGP INRA IRD CIRAD Montpellier Supagro UMR 1, F-34060 Montpellier, France
机构:
Univ Calif Los Angeles, Bioinformat Interdept Program, Los Angeles, CA 90024 USAUniv Calif Los Angeles, Bioinformat Interdept Program, Los Angeles, CA 90024 USA
Kessner, Darren
Turner, Thomas L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Santa Barbara, Dept Ecol Evolut & Marine Biol, Santa Barbara, CA 93106 USAUniv Calif Los Angeles, Bioinformat Interdept Program, Los Angeles, CA 90024 USA
Turner, Thomas L.
Novembre, John
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Los Angeles, Bioinformat Interdept Program, Los Angeles, CA 90024 USA
Univ Calif Los Angeles, Dept Ecol & Evolutionary Biol, Los Angeles, CA USAUniv Calif Los Angeles, Bioinformat Interdept Program, Los Angeles, CA 90024 USA