Comprehensive Characterization of Human Genome Variation by High Coverage Whole-Genome Sequencing of Forty Four Caucasians

被引:39
|
作者
Shen, Hui [1 ,2 ]
Li, Jian [1 ,2 ]
Zhang, Jigang [1 ,2 ]
Xu, Chao [1 ,2 ,3 ]
Jiang, Yan [1 ,2 ,3 ]
Wu, Zikai [1 ,2 ,3 ]
Zhao, Fuping [1 ,2 ]
Liao, Li [1 ,2 ]
Chen, Jun [1 ]
Lin, Yong [3 ]
Tian, Qing [1 ,2 ]
Papasian, Christopher J. [2 ]
Deng, Hong-Wen [1 ,2 ,3 ]
机构
[1] Tulane Univ, Sch Publ Hlth & Trop Med, Dept Biostat & Bioinformat, Ctr Bioinformat & Genom, New Orleans, LA 70118 USA
[2] Univ Missouri, Sch Med, Kansas City, MO 64108 USA
[3] Shanghai Univ Sci & Technol, Ctr Syst Biomed Sci, Shanghai 201800, Peoples R China
来源
PLOS ONE | 2013年 / 8卷 / 04期
基金
美国国家卫生研究院;
关键词
MASSIVELY-PARALLEL DNA; COPY NUMBER VARIATION; OF-FUNCTION VARIANTS; MISSING HERITABILITY;
D O I
10.1371/journal.pone.0059494
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Whole genome sequencing studies are essential to obtain a comprehensive understanding of the vast pattern of human genomic variations. Here we report the results of a high-coverage whole genome sequencing study for 44 unrelated healthy Caucasian adults, each sequenced to over 50-fold coverage (averaging 65.8x). We identified approximately 11 million single nucleotide polymorphisms (SNPs), 2.8 million short insertions and deletions, and over 500,000 block substitutions. We showed that, although previous studies, including the 1000 Genomes Project Phase 1 study, have catalogued the vast majority of common SNPs, many of the low-frequency and rare variants remain undiscovered. For instance, approximately 1.4 million SNPs and 1.3 million short indels that we found were novel to both the dbSNP and the 1000 Genomes Project Phase 1 data sets, and the majority of which (similar to 96%) have a minor allele frequency less than 5%. On average, each individual genome carried similar to 3.3 million SNPs and similar to 492,000 indels/block substitutions, including approximately 179 variants that were predicted to cause loss of function of the gene products. Moreover, each individual genome carried an average of 44 such loss-of-function variants in a homozygous state, which would completely "knock out" the corresponding genes. Across all the 44 genomes, a total of 182 genes were "knocked-out" in at least one individual genome, among which 46 genes were "knocked out" in over 30% of our samples, suggesting that a number of genes are commonly "knocked-out" in general populations. Gene ontology analysis suggested that these commonly "knocked-out" genes are enriched in biological process related to antigen processing and immune response. Our results contribute towards a comprehensive characterization of human genomic variation, especially for less-common and rare variants, and provide an invaluable resource for future genetic studies of human variation and diseases.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Comprehensive characterization of horse genome variation by whole-genome sequencing of 88 horses
    Jagannathan, V.
    Gerber, V.
    Rieder, S.
    Tetens, J.
    Thaller, G.
    Droegemueller, C.
    Leeb, T.
    ANIMAL GENETICS, 2019, 50 (01) : 74 - 77
  • [2] Coverage Bias and Sensitivity of Variant Calling for Four Whole-genome Sequencing Technologies
    Rieber, Nora
    Zapatka, Marc
    Lasitschka, Baerbel
    Jones, David
    Northcott, Paul
    Hutter, Barbara
    Jaeger, Natalie
    Kool, Marcel
    Taylor, Michael
    Lichter, Peter
    Pfister, Stefan
    Wolf, Stephan
    Brors, Benedikt
    Eils, Roland
    PLOS ONE, 2013, 8 (06):
  • [3] Human whole-genome shotgun sequencing
    Weber, JL
    Myers, EW
    GENOME RESEARCH, 1997, 7 (05) : 401 - 409
  • [4] Whole-genome sequencing
    Morris, Huw R.
    Houlden, Henry
    Polke, James
    PRACTICAL NEUROLOGY, 2021, 21 (04) : 322 - +
  • [5] Whole-genome sequencing: identification of additional pathogenic variation across the genome
    Mills, James Dominic
    Sisodiya, Sanjay M.
    BRAIN COMMUNICATIONS, 2021, 3 (04)
  • [6] A whole-genome shotgun approach to human reference genome sequencing
    Morishita, Shinichi
    NATURE REVIEWS GENETICS, 2024, 25 (04) : 236 - 236
  • [7] A whole-genome shotgun approach to human reference genome sequencing
    Shinichi Morishita
    Nature Reviews Genetics, 2024, 25 : 236 - 236
  • [8] Characterization of Uterine Leiomyomas by Whole-Genome Sequencing
    Mehine, Miika
    Kaasinen, Eevi
    Makinen, Netta
    Katainen, Riku
    Kampjarvi, Kati
    Pitkanen, Esa
    Heinonen, Hanna-Riikka
    Butzow, Ralf
    Kilpivaara, Outi
    Kuosmanen, Anna
    Ristolainen, Heikki
    Gentile, Massimiliano
    Sjoberg, Jari
    Vahteristo, Pia
    Aaltonen, Lauri A.
    NEW ENGLAND JOURNAL OF MEDICINE, 2013, 369 (01): : 43 - 53
  • [9] Construction of high coverage whole-genome sequencing libraries from single colon crypts without DNA extraction or whole-genome amplification
    Manojlovic, Zarko
    Wlodarczyk, Jordan
    Okitsu, Cindy
    Jin, Yuxin
    Van Den Berg, David
    Lieber, Michael R.
    Hsieh, Chih-Lin
    BMC RESEARCH NOTES, 2023, 16 (01)
  • [10] Construction of high coverage whole-genome sequencing libraries from single colon crypts without DNA extraction or whole-genome amplification
    Zarko Manojlovic
    Jordan Wlodarczyk
    Cindy Okitsu
    Yuxin Jin
    David Van Den Berg
    Michael R. Lieber
    Chih-Lin Hsieh
    BMC Research Notes, 16