REHH 2.0: a reimplementation of the R package REHH to detect positive selection from haplotype structure

被引:243
作者
Gautier, Mathieu [1 ,2 ]
Klassmann, Alexander [3 ]
Vitalis, Renaud [1 ,2 ]
机构
[1] INRA, UMR CBGP, F-34988 Montferrier Sur Lez, France
[2] Inst Biol Computat, F-34095 Montpellier, France
[3] Univ Cologne, D-50674 Cologne, Germany
关键词
EHH; footprints of selection; iHS; Rsb; XP-EHH; HUMAN-POPULATIONS; HUMAN GENOME; SCANS; FOOTPRINTS; ADMIXTURE; PROGRAM; SIGNALS;
D O I
10.1111/1755-0998.12634
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Identifying genomic regions with unusually high local haplotype homozygosity represents a powerful strategy to characterize candidate genes responding to natural or artificial positive selection. To that end, statistics measuring the extent of haplotype homozygosity within (e.g. EHH, iHS) and between (Rsb or XP-EHH) populations have been proposed in the literature. The REHH package for R was previously developed to facilitate genome-wide scans of selection, based on the analysis of long-range haplotypes. However, its performance was not sufficient to cope with the growing size of available data sets. Here, we propose a major upgrade of the REHH package, which includes an improved processing of the input files, a faster algorithm to enumerate haplotypes, as well as multithreading. As illustrated with the analysis of large human haplotype data sets, these improvements decrease the computation time by more than one order of magnitude. This new version of REHH will thus allow performing iHS-, Rsb-or XP-EHH-based scans on large data sets. The package REHH 2.0 is available from the CRAN repository (http://cran.r-project.org/web/packages/rehh/index.html) together with help files and a detailed manual.
引用
收藏
页码:78 / 90
页数:13
相关论文
共 24 条
[1]   Integrating common and rare genetic variation in diverse human populations [J].
Altshuler, David M. ;
Gibbs, Richard A. ;
Peltonen, Leena ;
Dermitzakis, Emmanouil ;
Schaffner, Stephen F. ;
Yu, Fuli ;
Bonnen, Penelope E. ;
de Bakker, Paul I. W. ;
Deloukas, Panos ;
Gabriel, Stacey B. ;
Gwilliam, Rhian ;
Hunt, Sarah ;
Inouye, Michael ;
Jia, Xiaoming ;
Palotie, Aarno ;
Parkin, Melissa ;
Whittaker, Pamela ;
Chang, Kyle ;
Hawes, Alicia ;
Lewis, Lora R. ;
Ren, Yanru ;
Wheeler, David ;
Muzny, Donna Marie ;
Barnes, Chris ;
Darvishi, Katayoon ;
Hurles, Matthew ;
Korn, Joshua M. ;
Kristiansson, Kati ;
Lee, Charles ;
McCarroll, Steven A. ;
Nemesh, James ;
Keinan, Alon ;
Montgomery, Stephen B. ;
Pollack, Samuela ;
Price, Alkes L. ;
Soranzo, Nicole ;
Gonzaga-Jauregui, Claudia ;
Anttila, Verneri ;
Brodeur, Wendy ;
Daly, Mark J. ;
Leslie, Stephen ;
McVean, Gil ;
Moutsianas, Loukas ;
Nguyen, Huy ;
Zhang, Qingrun ;
Ghori, Mohammed J. R. ;
McGinnis, Ralph ;
McLaren, William ;
Takeuchi, Fumihiko ;
Grossman, Sharon R. .
NATURE, 2010, 467 (7311) :52-58
[2]  
[Anonymous], 2016, R LANGUAGE ENV STAT
[3]   Sex-dependent dominance at a single locus maintains variation in age at maturity in salmon [J].
Barson, Nicola J. ;
Aykanat, Tutku ;
Hindar, Kjetil ;
Baranski, Matthew ;
Bolstad, Geir H. ;
Fiske, Peder ;
Jacq, Celeste ;
Jensen, Arne J. ;
Johnston, Susan E. ;
Karlsson, Sten ;
Kent, Matthew ;
Oen, Thomas M. ;
Niemelae, Eero ;
Nome, Torfinn ;
Naesje, Tor F. ;
Orell, Panu ;
Romakkaniemi, Atso ;
Saegrov, Harald ;
Urdal, Kurt ;
Erkinaro, Jaakko ;
Lien, Sigbjorn ;
Primmer, Craig R. .
NATURE, 2015, 528 (7582) :405-+
[4]   Using genome-wide measures of coancestry to maintain diversity and fitness in endangered and domestic pig populations [J].
Bosse, Mirte ;
Megens, Hendrik-Jan ;
Madsen, Ole ;
Crooijmans, Richard P. M. A. ;
Ryder, Oliver A. ;
Austerlitz, Frederic ;
Groenen, Martien A. M. ;
de Cara, M. Angeles R. .
GENOME RESEARCH, 2015, 25 (07) :970-981
[5]   Genome-wide genetic marker discovery and genotyping using next-generation sequencing [J].
Davey, John W. ;
Hohenlohe, Paul A. ;
Etter, Paul D. ;
Boone, Jason Q. ;
Catchen, Julian M. ;
Blaxter, Mark L. .
NATURE REVIEWS GENETICS, 2011, 12 (07) :499-510
[6]   On Detecting Incomplete Soft or Hard Selective Sweeps Using Haplotype Structure [J].
Ferrer-Admetlla, Anna ;
Liang, Mason ;
Korneliussen, Thorfinn ;
Nielsen, Rasmus .
MOLECULAR BIOLOGY AND EVOLUTION, 2014, 31 (05) :1275-1291
[7]   Adaptive admixture in the West African bovine hybrid zone: insight from the Borgou population [J].
Flori, Laurence ;
Thevenon, Sophie ;
Dayo, Guiguigbaza-Kossigan ;
Senou, Marcel ;
Sylla, Souleymane ;
Berthier, David ;
Moazami-Goudarzi, Katayoun ;
Gautier, Mathieu .
MOLECULAR ECOLOGY, 2014, 23 (13) :3241-3257
[8]   rehh: an R package to detect footprints of selection in genome-wide SNP data from haplotype structure [J].
Gautier, Mathieu ;
Vitalis, Renaud .
BIOINFORMATICS, 2012, 28 (08) :1176-1177
[9]   Footprints of selection in the ancestral admixture of a New World Creole cattle breed [J].
Gautier, Mathieu ;
Naves, Michel .
MOLECULAR ECOLOGY, 2011, 20 (15) :3128-3143
[10]   Draft genome sequence of an elite Dura palm and whole-genome patterns of DNA variation in oil palm [J].
Jin, Jingjing ;
Lee, May ;
Bai, Bin ;
Sun, Yanwei ;
Qu, Jing ;
Rahmadsyah ;
Alfiko, Yuzer ;
Lim, Chin Huat ;
Suwanto, Antonius ;
Sugiharti, Maria ;
Wong, Limsoon ;
Ye, Jian ;
Chua, Nam-Hai ;
Yue, Gen Hua .
DNA RESEARCH, 2016, 23 (06) :527-533