Discovery of Novel Sequences in 1,000 Swedish Genomes

被引:19
作者
Eisfeldt, Jesper [1 ,2 ,3 ]
Martensson, Gustaf [4 ]
Ameur, Adam [5 ]
Nilsson, Daniel [1 ,2 ,3 ]
Lindstrand, Anna [1 ,3 ]
机构
[1] Karolinska Inst, Ctr Mol Med, Dept Mol Med & Surg, Stockholm, Sweden
[2] Karolinska Inst, Sci Life Lab, Sci Pk, Solna, Sweden
[3] Karolinska Univ Hosp, Dept Clin Genet, Stockholm, Sweden
[4] KTH Royal Inst Technol, Sch Engn Sci Chem Biotechnol & Hlth, Sci Life Lab, Div Nanobiotechnol,Dept Prot Sci, Stockholm, Sweden
[5] Uppsala Univ, Sci Life Lab, Dept Immunol Genet & Pathol, Uppsala, Sweden
基金
瑞典研究理事会;
关键词
population genomics; novel sequences; de novo assembly; ancestral deletion; GENETIC-VARIATION; PROTEIN; DIVERSITY; ALIGNMENT; RESOURCE;
D O I
10.1093/molbev/msz176
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Novel sequences (NSs), not present in the human reference genome, are abundant and remain largely unexplored. Here, we utilize de novo assembly to study NS in 1,000 Swedish individuals first sequenced as part of the SweGen project revealing a total of 46 Mb in 61,044 distinct contigs of sequences not present in GRCh38. The contigs were aligned to recently published catalogs of Icelandic and Pan-African NSs, as well as the chimpanzee genome, revealing a great diversity of shared sequences. Analyzing the positioning of NS across the chimpanzee genome, we find that 2,807 NS align confidently within 143 chimpanzee orthologs of human genes. Aligning the whole genome sequencing data to the chimpanzee genome, we discover ancestral NS common throughout the Swedish population. The NSs were searched for repeats and repeat elements: revealing a majority of repetitive sequence (56%), and enrichment of simple repeats (28%) and satellites (15%). Lastly, we align the unmappable reads of a subset of the thousand genomes data to our collection of NS, as well as the previously published Pan-African NS: revealing that both the Swedish and Pan-African NS are widespread, and that the Swedish NSs are largely a subset of the Pan-African NS. Overall, these results highlight the importance of creating a more diverse reference genome and illustrate that significant amounts of the NS may be of ancestral origin.
引用
收藏
页码:18 / 30
页数:13
相关论文
共 50 条
  • [21] Mapping Retrotransposon LINE-1 Sequences into Two Cebidae Species and Homo sapiens Genomes and a Short Review on Primates
    Milioto, Vanessa
    Perelman, Polina L.
    La Paglia, Laura
    Biltueva, Larisa
    Roelke, Melody
    Dumas, Francesca
    GENES, 2022, 13 (10)
  • [22] Stage at breast cancer diagnosis and distance from diagnostic hospital in a periurban setting: A South African public hospital case series of over 1,000 women
    Dickens, Caroline
    Joffe, Maureen
    Jacobson, Judith
    Venter, Francois
    Schuez, Joachim
    Cubasch, Herbert
    McCormack, Valerie
    INTERNATIONAL JOURNAL OF CANCER, 2014, 135 (09) : 2173 - 2182
  • [23] Discovery and molecular characterization of a novel enamovirus, Grapevine enamovirus-1
    Fagundes Silva, Joao Marcos
    Al Rwahnih, Maher
    Blawid, Rosana
    Nagata, Tatsuya
    Martins Fajardo, Thor Vinicius
    VIRUS GENES, 2017, 53 (04) : 667 - 671
  • [24] Discovery and Binding Studies on a Series of Novel Pin1 Ligands
    Wu, Bainan
    Rega, Michele F.
    Wei, Jun
    Yuan, Hongbin
    Dahl, Russell
    Zhang, Ziming
    Pellecchia, Maurizio
    CHEMICAL BIOLOGY & DRUG DESIGN, 2009, 73 (04) : 369 - 379
  • [25] Discovery of Novel Inhibitors of Amyloid β-Peptide 1-42 Aggregation
    Lopez, Laura C.
    Dos-Reis, Suzana
    Espargaro, Alba
    Carrodeguas, Jose A.
    Maddelein, Marie-Lise
    Ventura, Salvador
    Sancho, Javier
    JOURNAL OF MEDICINAL CHEMISTRY, 2012, 55 (22) : 9521 - 9530
  • [26] Discovery of two novel potyvirus genome sequences by high-throughput RNA sequencing in Aconitum carmichaelii tissue samples
    Choi, Dongjin
    Rai, Megha
    Rai, Amit
    Yamazaki, Mami
    Hahn, Yoonsoo
    ACTA VIROLOGICA, 2023, 67
  • [27] Discovery of novel 7-azaindoles as PDK1 inhibitors
    Wucherer-Plietker, Margarita
    Merkul, Eugen
    Mueller, Thomas J. J.
    Esdar, Christina
    Knoechel, Thorsten
    Heinrich, Timo
    Buchstaller, Hans-Peter
    Greiner, Hartmut
    Dorsch, Dieter
    Finsinger, Dirk
    Calderini, Michel
    Bruge, David
    Graedler, Ulrich
    BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2016, 26 (13) : 3073 - 3080
  • [28] Discovery of Western European R1b1a2 Y Chromosome Variants in 1000 Genomes Project Data: An Online Community Approach
    Rocca, Richard A.
    Magoon, Gregory
    Reynolds, David F.
    Krahn, Thomas
    Tilroe, Vincent O.
    Boots, Peter M. Op den Velde
    Grierson, Andrew J.
    PLOS ONE, 2012, 7 (07):
  • [29] The Discovery of Novel PGK1 Activators as Apoptotic Inhibiting and Neuroprotective Agents
    Qiang, Shao-Jia
    Shi, Yu-Qi
    Wu, Tong-Yu
    Wang, Jing-Quan
    Chen, Xue-Lian
    Su, Jie
    Chen, Xin-Ping
    Li, Jia-Zhong
    Chen, Zhe-Sheng
    FRONTIERS IN PHARMACOLOGY, 2022, 13
  • [30] Eight Metagenome-Assembled Genomes Provide Evidence for Microbial Adaptation in 20,000-to 1,000,000-Year-Old Siberian Permafrost
    Sipes, Katie
    Almatari, Abraham
    Eddie, Alexander
    Williams, Daniel
    Spirina, Elena
    Rivkina, Elizaveta
    Liang, Renxing
    Onstott, Tullis C.
    Vishnivetskaya, Tatiana A.
    Lloyd, Karen G.
    APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2021, 87 (19) : 1 - 17