An ethnically relevant consensus Korean reference genome is a step towards personal reference genomes

被引:48
作者
Cho, Yun Sung [1 ,2 ,3 ]
Kim, Hyunho [4 ]
Kim, Hak-Min [1 ,2 ]
Jho, Sungwoong [3 ]
Jun, JeHoon [3 ,4 ]
Lee, Yong Joo [4 ]
Chae, Kyun Shik [5 ]
Kim, Chang Geun [5 ]
Kim, Sangsoo [6 ]
Eriksson, Anders [7 ]
Edwards, Jeremy S. [8 ]
Lee, Semin [1 ,2 ]
Kim, Byung Chul [1 ,2 ]
Manica, Andrea [7 ]
Oh, Tae-Kwang [9 ]
Church, George M. [10 ]
Bhak, Jong [1 ,2 ,3 ,4 ]
机构
[1] UNIST, TGI, Ulsan 44919, South Korea
[2] UNIST, Sch Life Sci, Dept Biomed Engn, Ulsan 44919, South Korea
[3] Genome Res Fdn, Personal Genom Inst, Cheongju 28160, South Korea
[4] UNIST, Gerom Inc, Ulsan 44919, South Korea
[5] Korea Res Inst Stand & Sci, Natl Standard Reference Ctr, Daejeon 34113, South Korea
[6] Soongsil Univ, Sch Syst Biomed Sci, Seoul 06978, South Korea
[7] Univ Cambridge, Dept Zool, Downing St, Cambridge CB2 3EJ, England
[8] Univ New Mexico, Ctr Comprehens Canc, Chem & Chem Biol, Albuquerque, NM 87131 USA
[9] Korea Res Inst Biosci & Biotechnol, Infect & Immun Res Ctr, Daejeon 34141, South Korea
[10] Harvard Med Sch, Dept Genet, New Res Bldg NRB,77 Ave Louis Pasteur,Room 238, Boston, MA 02115 USA
关键词
STRUCTURAL VARIATION; SEQUENCE; DATABASE; TOOL; POLYMORPHISMS; ARCHITECTURE; NEANDERTHAL; GENERATION; FRAMEWORK; ALIGNMENT;
D O I
10.1038/ncomms13637
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Human genomes are routinely compared against a universal reference. However, this strategy could miss population-specific and personal genomic variations, which may be detected more efficiently using an ethnically relevant or personal reference. Here we report a hybrid assembly of a Korean reference genome (KOREF) for constructing personal and ethnic references by combining sequencing and mapping methods. We also build its consensus variome reference, providing information on millions of variants from 40 additional ethnically homogeneous genomes from the Korean Personal Genome Project. We find that the ethnically relevant consensus reference can be beneficial for efficient variant detection. Systematic comparison of human assemblies shows the importance of assembly quality, suggesting the necessity of new technologies to comprehensively map ethnic and personal genomic structure variations. In the era of large-scale population genome projects, the leveraging of ethnicity-specific genome assemblies as well as the human reference genome will accelerate mapping all human genome diversity.
引用
收藏
页数:12
相关论文
共 69 条
[1]   TEclass-a tool for automated classification of unknown eukaryotic transposable elements [J].
Abrusan, Gyorgy ;
Grundmann, Norbert ;
DeMester, Luc ;
Makalowski, Wojciech .
BIOINFORMATICS, 2009, 25 (10) :1329-1330
[2]   Limitations of next-generation genome sequence assembly [J].
Alkan, Can ;
Sajjadian, Saba ;
Eichler, Evan E. .
NATURE METHODS, 2011, 8 (01) :61-65
[3]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[4]   An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[5]   Integrating common and rare genetic variation in diverse human populations [J].
Altshuler, David M. ;
Gibbs, Richard A. ;
Peltonen, Leena ;
Dermitzakis, Emmanouil ;
Schaffner, Stephen F. ;
Yu, Fuli ;
Bonnen, Penelope E. ;
de Bakker, Paul I. W. ;
Deloukas, Panos ;
Gabriel, Stacey B. ;
Gwilliam, Rhian ;
Hunt, Sarah ;
Inouye, Michael ;
Jia, Xiaoming ;
Palotie, Aarno ;
Parkin, Melissa ;
Whittaker, Pamela ;
Chang, Kyle ;
Hawes, Alicia ;
Lewis, Lora R. ;
Ren, Yanru ;
Wheeler, David ;
Muzny, Donna Marie ;
Barnes, Chris ;
Darvishi, Katayoon ;
Hurles, Matthew ;
Korn, Joshua M. ;
Kristiansson, Kati ;
Lee, Charles ;
McCarroll, Steven A. ;
Nemesh, James ;
Keinan, Alon ;
Montgomery, Stephen B. ;
Pollack, Samuela ;
Price, Alkes L. ;
Soranzo, Nicole ;
Gonzaga-Jauregui, Claudia ;
Anttila, Verneri ;
Brodeur, Wendy ;
Daly, Mark J. ;
Leslie, Stephen ;
McVean, Gil ;
Moutsianas, Loukas ;
Nguyen, Huy ;
Zhang, Qingrun ;
Ghori, Mohammed J. R. ;
McGinnis, Ralph ;
McLaren, William ;
Takeuchi, Fumihiko ;
Grossman, Sharon R. .
NATURE, 2010, 467 (7311) :52-58
[6]  
[Anonymous], 2013, GENOMICS
[7]  
[Anonymous], 2007, CURR PROTOC IMMUNOL
[8]   The Genome of a Mongolian Individual Reveals the Genetic Imprints of Mongolians on Modern Human Populations [J].
Bai, Haihua ;
Guo, Xiaosen ;
Zhang, Dong ;
Narisu, Narisu ;
Bu, Junjie ;
Jirimutu, Jirimutu ;
Liang, Fan ;
Zhao, Xiang ;
Xing, Yanping ;
Wang, Dingzhu ;
Li, Tongda ;
Zhang, Yanru ;
Guan, Baozhu ;
Yang, Xukui ;
Yang, Zili ;
Shuangshan, Shuangshan ;
Su, Zhe ;
Wu, Huiguang ;
Li, Wenjing ;
Chen, Ming ;
Zhu, Shilin ;
Bayinnamula, Bayinnamula ;
Chang, Yuqi ;
Gao, Ying ;
Lan, Tianming ;
Suyalatu, Suyalatu ;
Huang, Hui ;
Su, Yan ;
Chen, Yujie ;
Li, Wenqi ;
Yang, Xu ;
Feng, Qiang ;
Wang, Jian ;
Yang, Huanming ;
Wang, Jun ;
Wu, Qizhu ;
Yin, Ye ;
Zhou, Huanmin .
GENOME BIOLOGY AND EVOLUTION, 2014, 6 (12) :3122-3136
[9]   MaskerAid:: a performance enhancement to RepeatMasker [J].
Bedell, JA ;
Korf, I ;
Gish, W .
BIOINFORMATICS, 2000, 16 (11) :1040-1041
[10]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580