Discovering ancestors and connecting relatives in large genomic databases

被引:10
|
作者
Nani, J. P. [1 ,2 ]
Bacheller, L. R. [3 ]
Cole, J. B. [1 ]
VanRaden, P. M. [1 ]
机构
[1] ARS, USDA, Anim Genom & Improvement Lab, Beltsville, MD 20705 USA
[2] Inst Nacl Tecnol Agr, Estn Expt Agr Rafaela, RA-222300 Rafaela, SF, Argentina
[3] Council Dairy Cattle Breeding, Bowie, MD 20716 USA
关键词
ancestry discovery; pedigree; genomics; genotype; SELECTION; IDENTIFICATION; PEDIGREE; MARKERS;
D O I
10.3168/jds.2019-17580
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Genomic evaluation has improved both plant and animal breeding by allowing more accurate estimation of an individual's genetic potential. Because often only a small proportion of the population to be evaluated has been genotyped, genomic estimations rely heavily on complete pedigree information. Confirmation, discovery, and correction of parentage and connected relatives allow the creation of more complete pedigrees, which in turn increase the number of usable phenotypic records and prediction accuracy. Previous methods accounted for parent-progeny conflicts using SNP. More recently haplotype methods allowed discovery of distant relationships such as maternal grandsire (MGS) and maternal great-grandsire (MGGS) with improved accuracy. However, discovered MGS and MGGS often were not used, because no dam information was available to link them to the cAlf. An automated procedure to discover and fill missing maternal identification information was developed, allowing discovered MGS and MGGS to be used in imputation as well as in calculating breeding values for animals in the US dairy cattle database. An MGS was discovered for 295,136 animals with unknown dam, and the MGGS was discovered for 153,909 of these animals. A virtual maternal identification was added for animals with missing information. The effect of pedigree completion on progeny inbreeding, breeding values, and reliabilities was examined. Mean inbreeding of animals with missing maternal pedigree information was 6.69% before and 6.87% after pedigree assignment; expected future inbreeding was 7.24% before and 7.20% after assignment. Reliabilities for traditional breeding values increased from 26.6 to 32.6% for milk yield, 25.9 to 32.0% for fat yield, and 26.9 to 32.9% for protein yield; genomic reliabilities also increased slightly from 76.2 to 77.1% for milk, 76.0 to 76.9% for fat, and 76.3 to 77.3% for protein. The procedure developed for pedigree completion is a useful tool for improving accuracy of national and international evaluations and aiding producers in making better mating decisions.
引用
收藏
页码:1729 / 1734
页数:6
相关论文
共 50 条
  • [1] Discovering causality in large databases
    Zhang, SC
    Zhang, ZG
    APPLIED ARTIFICIAL INTELLIGENCE, 2002, 16 (05) : 333 - 358
  • [2] Discovering Association Rules in Large, Dense Databases
    Teusan, Tudor
    Nachouki, Gilles
    Briand, Henri
    Philippe, Jacques
    LECTURE NOTES IN COMPUTER SCIENCE <D>, 2000, 1910 : 638 - 645
  • [3] Hierarchical analysis for discovering knowledge in large databases
    Pai, WC
    INFORMATION SYSTEMS MANAGEMENT, 2003, 21 (01) : 81 - 88
  • [4] Discovering Human Ancestors
    Wayman, Erin
    SMITHSONIAN, 2012, 42 (09) : 38 - 39
  • [5] Clarifying relatives and ancestors
    Walter Shearer
    Nature, 2011, 470 : 465 - 465
  • [6] Clarifying relatives and ancestors
    Shearer, Walter
    NATURE, 2011, 470 (7335) : 465 - 465
  • [7] An Efficient Approach to Discovering Sequential Patterns in Large Databases
    Yen, Show-Jane
    Cho, Chung-Wen
    LECTURE NOTES IN COMPUTER SCIENCE <D>, 2000, 1910 : 685 - 690
  • [8] An efficient approach to discovering knowledge from large databases
    Yen, SJ
    Chen, ALP
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED INFORMATION SYSTEMS, 1996, : 8 - 18
  • [9] Discovering Association Rules Change from Large Databases
    Ye, Feiyue
    Liu, Jixue
    Qian, Jin
    Shi, Yuxi
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2011, 7002 : 388 - +
  • [10] Discovering representative models in large time series databases
    Rombo, S
    Terracina, G
    FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2004, 3055 : 84 - 97