H3M2: detection of runs of homozygosity from whole-exome sequencing data

被引:91
作者
Magi, Alberto [1 ]
Tattini, Lorenzo [1 ]
Palombo, Flavia [2 ]
Benelli, Matteo [3 ]
Gialluisi, Alessandro [4 ]
Giusti, Betti [1 ]
Abbate, Rosanna [1 ]
Seri, Marco [2 ]
Gensini, Gian Franco [1 ]
Romeo, Giovanni [2 ]
Pippucci, Tommaso [1 ,2 ]
机构
[1] Univ Florence, Dept Expt & Clin Med, I-50019 Florence, Italy
[2] Univ Bologna, Dept Med & Surg Sci, Med Genet Unit, Polyclin St Orsola Malpighi, I-40138 Bologna, Italy
[3] Careggi Hosp, Diagnost Genet Unit, I-50019 Florence, Italy
[4] Max Planck Inst Psycholinguist, Language & Genet Dept, NL-6525 EN Nijmegen, Netherlands
关键词
DNA;
D O I
10.1093/bioinformatics/btu401
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Runs of homozygosity (ROH) are sizable chromosomal stretches of homozygous genotypes, ranging in length from tens of kilobases to megabases. ROHs can be relevant for population and medical genetics, playing a role in predisposition to both rare and common disorders. ROHs are commonly detected by single nucleotide polymorphism (SNP) microarrays, but attempts have been made to use whole-exome sequencing (WES) data. Currently available methods developed for the analysis of uniformly spaced SNP-array maps do not fit easily to the analysis of the sparse and non-uniform distribution of the WES target design. Results: To meet the need of an approach specifically tailored to WES data, we developed (HM2)-M-3, an original algorithm based on heterogeneous hidden Markov model that incorporates inter-marker distances to detect ROH from WES data. We evaluated the performance of H-3 M-2 to correctly identify ROHs on synthetic chromosomes and examined its accuracy in detecting ROHs of different length (short, medium and long) from real 1000 genomes project data. H3M2 turned out to be more accurate than GERMLINE and PLINK, two state-of-the-art algorithms, especially in the detection of short and medium ROHs.
引用
收藏
页码:2852 / 2859
页数:8
相关论文
共 23 条
[1]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[2]  
[Anonymous], 2012, Nature
[3]   Global distribution of genomic diversity underscores rich complex history of continental human populations [J].
Auton, Adam ;
Bryc, Katarzyna ;
Boyko, Adam R. ;
Lohmueller, Kirk E. ;
Novembre, John ;
Reynolds, Andy ;
Indap, Amit ;
Wright, Mark H. ;
Degenhardt, Jeremiah D. ;
Gutenkunst, Ryan N. ;
King, Karen S. ;
Nelson, Matthew R. ;
Bustamante, Carlos D. .
GENOME RESEARCH, 2009, 19 (05) :795-803
[4]   Detecting Identity by Descent and Estimating Genotype Error Rates in Sequence Data [J].
Browning, Brian L. ;
Browning, Sharon R. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2013, 93 (05) :840-851
[5]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[6]   The International HapMap Project [J].
Gibbs, RA ;
Belmont, JW ;
Hardenbol, P ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Ch'ang, LY ;
Huang, W ;
Liu, B ;
Shen, Y ;
Tam, PKH ;
Tsui, LC ;
Waye, MMY ;
Wong, JTF ;
Zeng, CQ ;
Zhang, QR ;
Chee, MS ;
Galver, LM ;
Kruglyak, S ;
Murray, SS ;
Oliphant, AR ;
Montpetit, A ;
Hudson, TJ ;
Chagnon, F ;
Ferretti, V ;
Leboeuf, M ;
Phillips, MS ;
Verner, A ;
Kwok, PY ;
Duan, SH ;
Lind, DL ;
Miller, RD ;
Rice, JP ;
Saccone, NL ;
Taillon-Miller, P ;
Xiao, M ;
Nakamura, Y ;
Sekine, A ;
Sorimachi, K ;
Tanaka, T ;
Tanaka, Y ;
Tsunoda, T ;
Yoshino, E ;
Bentley, DR ;
Deloukas, P ;
Hunt, S ;
Powell, D ;
Altshuler, D ;
Gabriel, SB ;
Qiu, RZ .
NATURE, 2003, 426 (6968) :789-796
[7]   Whole population, genome-wide mapping of hidden relatedness [J].
Gusev, Alexander ;
Lowe, Jennifer K. ;
Stoffel, Markus ;
Daly, Mark J. ;
Altshuler, David ;
Breslow, Jan L. ;
Friedman, Jeffrey M. ;
Pe'er, Itsik .
GENOME RESEARCH, 2009, 19 (02) :318-326
[8]   A Systematic Approach to Mapping Recessive Disease Genes in Individuals from Outbred Populations [J].
Hildebrandt, Friedhelm ;
Heeringa, Saskia F. ;
Rueschendorf, Franz ;
Attanasio, Massimo ;
Nuernberg, Gudrun ;
Becker, Christian ;
Seelow, Dominik ;
Huebner, Norbert ;
Chernin, Gil ;
Vlangos, Christopher N. ;
Zhou, Weibin ;
O'Toole, John F. ;
Hoskins, Bethan E. ;
Wolf, Matthias T. F. ;
Hinkes, Bernward G. ;
Chaib, Hassan ;
Ashraf, Shazia ;
Schoeb, Dominik S. ;
Ovunc, Bugsu ;
Allen, Susan J. ;
Vega-Warner, Virginia ;
Wise, Eric ;
Harville, Heather M. ;
Lyons, Robert H. ;
Washburn, Joseph ;
MacDonald, James ;
Nuernberg, Peter ;
Otto, Edgar A. .
PLOS GENETICS, 2009, 5 (01)
[9]   Detecting autozygosity through runs of homozygosity: A comparison of three autozygosity detection algorithms [J].
Howrigan, Daniel P. ;
Simonson, Matthew A. ;
Keller, Matthew C. .
BMC GENOMICS, 2011, 12
[10]   Genomic Runs of Homozygosity Record Population History and Consanguinity [J].
Kirin, Mirna ;
McQuillan, Ruth ;
Franklin, Christopher S. ;
Campbell, Harry ;
McKeigue, Paul M. ;
Wilson, James F. .
PLOS ONE, 2010, 5 (11)