Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations
被引:42
|
作者:
Bansal, Vikas
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Diego, Dept Pediat, La Jolla, CA 92093 USA
Scripps Translat Sci Inst, La Jolla, CA 92037 USAUniv Calif San Diego, Dept Pediat, La Jolla, CA 92093 USA
Bansal, Vikas
[1
,2
]
Libiger, Ondrej
论文数: 0引用数: 0
h-index: 0
机构:
Scripps Translat Sci Inst, La Jolla, CA 92037 USAUniv Calif San Diego, Dept Pediat, La Jolla, CA 92093 USA
Libiger, Ondrej
[2
]
机构:
[1] Univ Calif San Diego, Dept Pediat, La Jolla, CA 92093 USA
[2] Scripps Translat Sci Inst, La Jolla, CA 92037 USA
Background: Estimation of individual ancestry from genetic data is useful for the analysis of disease association studies, understanding human population history and interpreting personal genomic variation. New, computationally efficient methods are needed for ancestry inference that can effectively utilize existing information about allele frequencies associated with different human populations and can work directly with DNA sequence reads. Results: We describe a fast method for estimating the relative contribution of known reference populations to an individual's genetic ancestry. Our method utilizes allele frequencies from the reference populations and individual genotype or sequence data to obtain a maximum likelihood estimate of the global admixture proportions using the BFGS optimization algorithm. It accounts for the uncertainty in genotypes present in sequence data by using genotype likelihoods and does not require individual genotype data from external reference panels. Simulation studies and application of the method to real datasets demonstrate that our method is significantly times faster than previous methods and has comparable accuracy. Using data from the 1000 Genomes project, we show that estimates of the genome-wide average ancestry for admixed individuals are consistent between exome sequence data and whole-genome low-coverage sequence data. Finally, we demonstrate that our method can be used to estimate admixture proportions using pooled sequence data making it a valuable tool for controlling for population stratification in sequencing based association studies that utilize DNA pooling. Conclusions: Our method is an efficient and versatile tool for estimating ancestry from DNA sequence data and is available from https://sites.google.com/site/vibansal/software/iAdmix.
机构:
Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
Univ Calif Los Angeles, Interdept Program Bioinformat, Los Angeles, CA 90095 USAUniv Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
Yang, Wen-Yun
Hormozdiari, Farhad
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USAUniv Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
Hormozdiari, Farhad
Wang, Zhanyong
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USAUniv Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
Wang, Zhanyong
He, Dan
论文数: 0引用数: 0
h-index: 0
机构:
IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USAUniv Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
He, Dan
Pasaniuc, Bogdan
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Los Angeles, Interdept Program Bioinformat, Los Angeles, CA 90095 USA
Univ Calif Los Angeles, Dept Pathol & Lab Med, Los Angeles, CA 90095 USA
Univ Calif Los Angeles, Jonsson Comprehens Canc Ctr, Los Angeles, CA 90095 USAUniv Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
Pasaniuc, Bogdan
Eskin, Eleazar
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
Univ Calif Los Angeles, Interdept Program Bioinformat, Los Angeles, CA 90095 USA
Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USAUniv Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
机构:
Univ Chicago, Dept Human Genet, Chicago, IL 60637 USAUniv Chicago, Dept Human Genet, Chicago, IL 60637 USA
Pique-Regi, Roger
Degner, Jacob F.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Chicago, Dept Human Genet, Chicago, IL 60637 USA
Univ Chicago, Comm Genet Genom & Syst Biol, Chicago, IL 60637 USAUniv Chicago, Dept Human Genet, Chicago, IL 60637 USA
Degner, Jacob F.
Pai, Athma A.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Chicago, Dept Human Genet, Chicago, IL 60637 USAUniv Chicago, Dept Human Genet, Chicago, IL 60637 USA
Pai, Athma A.
Gaffney, Daniel J.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Chicago, Dept Human Genet, Chicago, IL 60637 USA
Univ Chicago, Howard Hughes Med Inst, Chicago, IL 60637 USAUniv Chicago, Dept Human Genet, Chicago, IL 60637 USA
Gaffney, Daniel J.
Gilad, Yoav
论文数: 0引用数: 0
h-index: 0
机构:
Univ Chicago, Dept Human Genet, Chicago, IL 60637 USAUniv Chicago, Dept Human Genet, Chicago, IL 60637 USA
Gilad, Yoav
Pritchard, Jonathan K.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Chicago, Dept Human Genet, Chicago, IL 60637 USA
Univ Chicago, Howard Hughes Med Inst, Chicago, IL 60637 USAUniv Chicago, Dept Human Genet, Chicago, IL 60637 USA
机构:
Seoul Natl Univ, Boramae Hosp, Dept Lab Med, Seoul 156707, South Korea
Seoul Natl Univ, Coll Med, Dept Lab Med, Seoul 156707, South Korea
Seoul Metropolitan Publ Cord Blood Bank, Seoul, South KoreaSeoul Natl Univ, Boramae Hosp, Dept Lab Med, Seoul 156707, South Korea
Yoon, J. H.
Shin, S.
论文数: 0引用数: 0
h-index: 0
机构:
Seoul Natl Univ, Boramae Hosp, Dept Lab Med, Seoul 156707, South Korea
Seoul Natl Univ, Coll Med, Dept Lab Med, Seoul 156707, South Korea
Seoul Metropolitan Publ Cord Blood Bank, Seoul, South KoreaSeoul Natl Univ, Boramae Hosp, Dept Lab Med, Seoul 156707, South Korea
Shin, S.
Park, M. H.
论文数: 0引用数: 0
h-index: 0
机构:
Seoul Natl Univ, Coll Med, Dept Lab Med, Seoul 156707, South KoreaSeoul Natl Univ, Boramae Hosp, Dept Lab Med, Seoul 156707, South Korea
Park, M. H.
Song, E. Y.
论文数: 0引用数: 0
h-index: 0
机构:
Seoul Natl Univ, Coll Med, Dept Lab Med, Seoul 156707, South KoreaSeoul Natl Univ, Boramae Hosp, Dept Lab Med, Seoul 156707, South Korea
Song, E. Y.
Roh, E. Y.
论文数: 0引用数: 0
h-index: 0
机构:
Seoul Natl Univ, Boramae Hosp, Dept Lab Med, Seoul 156707, South Korea
Seoul Natl Univ, Coll Med, Dept Lab Med, Seoul 156707, South Korea
Seoul Metropolitan Publ Cord Blood Bank, Seoul, South KoreaSeoul Natl Univ, Boramae Hosp, Dept Lab Med, Seoul 156707, South Korea