BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data

被引:442
作者
Narasimhan, Vagheesh [1 ]
Danecek, Petr [1 ]
Scally, Aylwyn [2 ]
Xue, Yali [1 ]
Tyler-Smith, Chris [1 ]
Durbin, Richard [1 ]
机构
[1] Wellcome Trust Sanger Inst, Hinxton, England
[2] Univ Cambridge, Dept Genet, Downing St, Cambridge CB2 3EH, England
基金
英国惠康基金;
关键词
HOMOZYGOSITY; RUNS;
D O I
10.1093/bioinformatics/btw044
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A Summary: Runs of homozygosity (RoHs) are genomic stretches of a diploid genome that show identical alleles on both chromosomes. Longer RoHs are unlikely to have arisen by chance but are likely to denote autozygosity, whereby both copies of the genome descend from the same recent ancestor. Early tools to detect RoH used genotype array data, but substantially more information is available from sequencing data. Here, we present and evaluate BCFtools/RoH, an extension to the BCFtools software package, that detects regions of autozygosity in sequencing data, in particular exome data, using a hidden Markov model. By applying it to simulated data and real data from the 1000 Genomes Project we estimate its accuracy and show that it has higher sensitivity and specificity than existing methods under a range of sequencing error rates and levels of autozygosity.
引用
收藏
页码:1749 / 1751
页数:3
相关论文
共 8 条
[1]   An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[2]   High-Resolution Detection of Identity by Descent in Unrelated Individuals [J].
Browning, Sharon R. ;
Browning, Brian L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 86 (04) :526-539
[3]  
Durbin R., 1998, BIOL SEQUENCE ANAL P
[4]   Whole population, genome-wide mapping of hidden relatedness [J].
Gusev, Alexander ;
Lowe, Jennifer K. ;
Stoffel, Markus ;
Daly, Mark J. ;
Altshuler, David ;
Breslow, Jan L. ;
Friedman, Jeffrey M. ;
Pe'er, Itsik .
GENOME RESEARCH, 2009, 19 (02) :318-326
[5]   Detecting autozygosity through runs of homozygosity: A comparison of three autozygosity detection algorithms [J].
Howrigan, Daniel P. ;
Simonson, Matthew A. ;
Keller, Matthew C. .
BMC GENOMICS, 2011, 12
[6]   Fine-scale recombination rate differences between sexes, populations and individuals [J].
Kong, Augustine ;
Thorleifsson, Gudmar ;
Gudbjartsson, Daniel F. ;
Masson, Gisli ;
Sigurdsson, Asgeir ;
Jonasdottir, Aslaug ;
Walters, G. Bragi ;
Jonasdottir, Adalbjorg ;
Gylfason, Arnaldur ;
Kristinsson, Kari Th. ;
Gudjonsson, Sigurjon A. ;
Frigge, Michael L. ;
Helgason, Agnar ;
Thorsteinsdottir, Unnur ;
Stefansson, Kari .
NATURE, 2010, 467 (7319) :1099-1103
[7]   H3M2: detection of runs of homozygosity from whole-exome sequencing data [J].
Magi, Alberto ;
Tattini, Lorenzo ;
Palombo, Flavia ;
Benelli, Matteo ;
Gialluisi, Alessandro ;
Giusti, Betti ;
Abbate, Rosanna ;
Seri, Marco ;
Gensini, Gian Franco ;
Romeo, Giovanni ;
Pippucci, Tommaso .
BIOINFORMATICS, 2014, 30 (20) :2852-2859
[8]   PLINK: A tool set for whole-genome association and population-based linkage analyses [J].
Purcell, Shaun ;
Neale, Benjamin ;
Todd-Brown, Kathe ;
Thomas, Lori ;
Ferreira, Manuel A. R. ;
Bender, David ;
Maller, Julian ;
Sklar, Pamela ;
de Bakker, Paul I. W. ;
Daly, Mark J. ;
Sham, Pak C. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (03) :559-575