Detection and validation of structural variations in bovine whole-genome sequence data

被引:28
作者
Chen, Long [1 ,2 ]
Chamberlain, Amanda J. [1 ]
Reich, Coralie M. [1 ]
Daetwyler, Hans D. [1 ,2 ]
Hayes, Ben J. [1 ,2 ]
机构
[1] AgriBio, Ctr AgriBiosci, Biosci Res, Dept Econ Dev Jobs Transport & Resources, Bundoora, Vic, Australia
[2] La Trobe Univ, Sch Appl Syst Biol, Bundoora, Vic, Australia
关键词
COPY NUMBER VARIATION; GENE FAMILY; CATTLE; RETROTRANSPOSITION; EXPRESSION; ALIGNMENT; CNVS;
D O I
10.1186/s12711-017-0286-5
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Background: Several examples of structural variation (SV) affecting phenotypic traits have been reported in cattle. Currently the identification of SV from whole-genome sequence data (WGS) suffers from a high false positive rate. Our aim was to construct a high quality set of SV calls in cattle using WGS data. First, we tested two SV detection programs, Breakdancer and Pindel, and the overlap of these methods, on simulated sequence data to determine their precision and sensitivity. We then identified population SV from WGS of 252 Holstein and 64 Jersey bulls based on the overlapping calls from the two programs. In addition, we validated an overlapped SV set in 28 twice-sequenced Holstein individuals, and in another two validated sets (one for each breed) that were transmitted from sire to son. We also tested whether highly conserved gene sets across eukaryotes and recently expanded gene families in bovine were depleted and enriched, respectively, for SV. Results: In empirical WGS data, 17,518 SV covering 27.36 Mb were found in the Holstein population and 4285 SV covering 8.74 Mb in the Jersey population, of which 4.62 Mb of SV overlapped between Holsteins and Jerseys. A total of 11,534 candidate SV covering 5.64 Mb were validated in the 28 twice-sequenced individuals, while 3.49 and 0.67 Mb of SV were validated from Holstein and Jersey sire-son transmission, respectively. Only eight of 237 core eukaryotic genes had at least a 50-bp overlap with an SV from our validated sets, suggesting that conserved genes are depleted for SV (p < 0.05). In addition, we observed that recently expanded gene families were significantly more associated with SV than other genes. Long interspersed nuclear elements-1 were enriched for deletions when compared to the rest of the genome (p = 0.0035). Conclusions: We reported SV from 252 Holstein and 64 Jersey individuals. A considerable proportion of Jersey population SV (53.5%) were also found in Holstein. In contrast, about 76.90% sire-son transmission validated SV were present in Jerseys and Holsteins. The enrichment of SV in expanding gene families suggests that SV can be a source of genetic variation for evolution.
引用
收藏
页数:13
相关论文
共 50 条
[1]   CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing [J].
Abyzov, Alexej ;
Urban, Alexander E. ;
Snyder, Michael ;
Gerstein, Mark .
GENOME RESEARCH, 2011, 21 (06) :974-984
[2]   APPLICATIONS OF NEXT-GENERATION SEQUENCING Genome structural variation discovery and genotyping [J].
Alkan, Can ;
Coe, Bradley P. ;
Eichler, Evan E. .
NATURE REVIEWS GENETICS, 2011, 12 (05) :363-375
[3]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[5]  
[Anonymous], 2012, Nature
[6]  
[Anonymous], P 23 PLANT AN GEN M
[7]   RSVSim: an R/Bioconductor package for the simulation of structural variations [J].
Bartenhagen, Christoph ;
Dugas, Martin .
BIOINFORMATICS, 2013, 29 (13) :1679-1681
[8]   LINE-1 Elements in Structural Variation and Disease [J].
Beck, Christine R. ;
Luis Garcia-Perez, Jose ;
Badge, Richard M. ;
Moran, John V. .
ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 12, 2011, 12 :187-215
[9]   LINE-1 Retrotransposition Activity in Human Genomes [J].
Beck, Christine R. ;
Collier, Pamela ;
Macfarlane, Catriona ;
Malig, Maika ;
Kidd, Jeffrey M. ;
Eichler, Evan E. ;
Badge, Richard M. ;
Moran, John V. .
CELL, 2010, 141 (07) :1159-U110
[10]   The challenges and importance of structural variation detection in livestock [J].
Bickhart, Derek M. ;
Liu, George E. .
FRONTIERS IN GENETICS, 2014, 5