Improved reference genome for the domestic horse increases assembly contiguity and composition

被引:160
作者
Kalbfleisch, Theodore S. [1 ]
Rice, Edward S. [2 ]
DePriest, Michael S., Jr. [1 ]
Walenz, Brian P. [3 ]
Hestand, Matthew S. [4 ]
Vermeesch, Joris R. [4 ]
O'Connell, Brendan L. [2 ,16 ]
Fiddes, Ian T. [2 ,5 ]
Vershinina, Alisa O. [6 ]
Saremi, Nedda F. [2 ]
Petersen, Jessica L. [7 ]
Finno, Carrie J. [8 ]
Bellone, Rebecca R. [8 ,9 ]
McCue, Molly E. [10 ]
Brooks, Samantha A. [11 ]
Bailey, Ernest [12 ]
Orlando, Ludovic [13 ,14 ]
Greene, Richard E. [2 ]
Miller, Donald C. [15 ]
Antczak, Douglas F. [15 ]
MacLeod, James N. [12 ]
机构
[1] Univ Louisville, Sch Med, Dept Biochem & Mol Genet, Louisville, KY 40292 USA
[2] UC Santa Cruz, Dept Biomol Engn, Santa Cruz, CA 95064 USA
[3] NHGRI, Genome Informat Sect, Computat & Stat Genom Branch, NIH, Bethesda, MD 20892 USA
[4] Katholieke Univ Leuven, Ctr Human Genet, B-3000 Leuven, Belgium
[5] 10x Genomics Inc, Pleasanton, CA 94566 USA
[6] UC Santa Cruz, Dept Ecol & Evolutionary Biol, Santa Cruz, CA 95064 USA
[7] Univ Nebraska, Dept Anim Sci, Lincoln, NE 68583 USA
[8] Univ Calif Davis, Dept Populat Hlth & Reprod, Davis, CA 95616 USA
[9] Univ Calif Davis, Vet Genet Lab, Davis, CA 95616 USA
[10] Univ Minnesota, Dept Vet Populat Med, St Paul, MN 55108 USA
[11] Univ Florida, UF Genet Inst, Dept Anim Sci, Gainesville, FL 32611 USA
[12] Univ Kentucky, Gluck Equine Res Ctr, Dept Vet Sci, Lexington, KY 40546 USA
[13] Nat Hist Museum Denmark, Ctr GeoGenet, DK-1350 Copenhagen, Denmark
[14] Univ Toulouse, Univ Paul Sabatier, Lab Anthropobiol Mol & Imagerie Synth, CNRS,UMR 5288, Toulouse, France
[15] Cornell Univ, Coll Vet Med, Baker Inst Anim Hlth, Ithaca, NY 14853 USA
[16] Oregon Hlth & Sci Univ, Med & Mol Genet, Portland, OR 97239 USA
基金
美国国家卫生研究院;
关键词
READ ALIGNMENT; MESSENGER-RNA; SEQUENCE; ANNOTATION; GENES;
D O I
10.1038/s42003-018-0199-z
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent advances in genomic sequencing technology and computational assembly methods have allowed scientists to improve reference genome assemblies in terms of contiguity and composition. EquCab2, a reference genome for the domestic horse, was released in 2007. Although of equal or better quality compared to other first-generation Sanger assemblies, it had many of the shortcomings common to them. In 2014, the equine genomics research community began a project to improve the reference sequence for the horse, building upon the solid foundation of EquCab2 and incorporating new short-read data, long-read data, and proximity ligation data. Here, we present EquCab3. The count of non-N bases in the incorporated chromosomes is improved from 2.33 Gb in EquCab2 to 2.41 Gb in EquCab3. Contiguity has also been improved nearly 40-fold with a contig N50 of 4.5 Mb and scaffold contiguity enhanced to where all but one of the 32 chromosomes is comprised of a single scaffold.
引用
收藏
页数:8
相关论文
共 54 条
[1]  
[Anonymous], TOOLKIT PROCESSING S
[2]  
[Anonymous], ALIGNING SEQUENCE RE, DOI DOI 10.48550/ARXIV.1303.3997
[3]   Pleiotropic effects of pigmentation genes in horses [J].
Bellone, R. R. .
ANIMAL GENETICS, 2010, 41 :100-110
[4]   A missense mutation in damage-specific DNA binding protein 2 is a genetic risk factor for limbal squamous cell carcinoma in horses [J].
Bellone, Rebecca R. ;
Liu, Jiayin ;
Petersen, Jessica L. ;
Mack, Maura ;
Singer-Berk, Moriel ;
Drogemuller, Cord ;
Malvick, Julia ;
Wallner, Barbara ;
Brem, Gottfried ;
Penedo, M. Cecilia ;
Lassaline, Mary .
INTERNATIONAL JOURNAL OF CANCER, 2017, 141 (02) :342-353
[5]   Evidence for a Retroviral Insertion in TRPM1 as the Cause of Congenital Stationary Night Blindness and Leopard Complex Spotting in the Horse [J].
Bellone, Rebecca R. ;
Holl, Heather ;
Setaluri, Vijayasaradhi ;
Devi, Sulochana ;
Maddodi, Nityanand ;
Archer, Sheila ;
Sandmeyer, Lynne ;
Ludwig, Arne ;
Foerster, Daniel ;
Pruvost, Melanie ;
Reissmann, Monika ;
Bortfeldt, Ralf ;
Adelson, David L. ;
Lim, Sim Lin ;
Nelson, Janelle ;
Haase, Bianca ;
Engensteiner, Martina ;
Leeb, Tosso ;
Forsyth, George ;
Mienaltowski, Michael J. ;
Mahadevan, Padmanabhan ;
Hofreiter, Michael ;
Paijmans, Johanna L. A. ;
Gonzalez Fortes, Gloria ;
Grahn, Bruce ;
Brooks, Samantha A. .
PLOS ONE, 2013, 8 (10)
[6]   Assembling large genomes with single-molecule sequencing and locality-sensitive hashing [J].
Berlin, Konstantin ;
Koren, Sergey ;
Chin, Chen-Shan ;
Drake, James P. ;
Landolin, Jane M. ;
Phillippy, Adam M. .
NATURE BIOTECHNOLOGY, 2015, 33 (06) :623-+
[7]   Whole-Genome SNP Association in the Horse: Identification of a Deletion in Myosin Va Responsible for Lavender Foal Syndrome [J].
Brooks, Samantha A. ;
Gabreski, Nicole ;
Miller, Donald ;
Brisbin, Abra ;
Brown, Helen E. ;
Streeter, Cassandra ;
Mezey, Jason ;
Cook, Deborah ;
Antczak, Douglas F. .
PLOS GENETICS, 2010, 6 (04)
[8]   Generation of an equine biobank to be used for Functional Annotation of Animal Genomes project [J].
Burns, E. N. ;
Bordbari, M. H. ;
Mienaltowski, M. J. ;
Affolter, V. K. ;
Barro, M. V. ;
Gianino, F. ;
Gianino, G. ;
Giulotto, E. ;
Kalbfleisch, T. S. ;
Katzman, S. A. ;
Lassaline, M. ;
Leeb, T. ;
Mack, M. ;
Muller, E. J. ;
MacLeod, J. N. ;
Ming-Whitfield, B. ;
Alanis, C. R. ;
Raudsepp, T. ;
Scott, E. ;
Vig, S. ;
Zhou, H. ;
Petersen, J. L. ;
Bellone, R. R. ;
Finno, C. J. .
ANIMAL GENETICS, 2018, 49 (06) :564-570
[9]   Structural annotation of equine protein-coding genes determined by mRNA sequencing [J].
Coleman, S. J. ;
Zeng, Z. ;
Wang, K. ;
Luo, S. ;
Khrebtukova, I. ;
Mienaltowski, M. J. ;
Schroth, G. P. ;
Liu, J. ;
MacLeod, J. N. .
ANIMAL GENETICS, 2010, 41 :121-130
[10]   Bipartite structure of the inactive mouse X chromosome [J].
Deng, Xinxian ;
Ma, Wenxiu ;
Ramani, Vijay ;
Hill, Andrew ;
Yang, Fan ;
Ay, Ferhat ;
Berletch, Joel B. ;
Blau, Carl Anthony ;
Shendure, Jay ;
Duan, Zhijun ;
Noble, William S. ;
Disteche, Christine M. .
GENOME BIOLOGY, 2015, 16