De novo assembly of a haplotype-resolved human genome

被引:51
作者
Cao, Hongzhi [1 ,2 ]
Wu, Honglong [1 ,2 ]
Luo, Ruibang [1 ,4 ]
Huang, Shujia [1 ,5 ]
Sun, Yuhui [1 ,5 ]
Tong, Xin [1 ]
Xie, Yinlong [1 ,4 ]
Liu, Binghang [1 ,4 ]
Yang, Hailong [1 ]
Zheng, Hancheng [1 ,3 ]
Li, Jian [1 ,3 ]
Li, Bo [1 ]
Wang, Yu [1 ,5 ]
Yang, Fang [1 ]
Sun, Peng [1 ]
Liu, Siyang [1 ,3 ]
Gao, Peng [1 ]
Huang, Haodong [1 ,5 ]
Sun, Jing [1 ]
Chen, Dan [1 ]
He, Guangzhu [1 ]
Huang, Weihua [1 ]
Huang, Zheng [1 ]
Li, Yue [1 ]
Tellier, Laurent C. A. M. [1 ,3 ]
Liu, Xiao [1 ,3 ]
Feng, Qiang [1 ,3 ]
Xu, Xun [1 ]
Zhang, Xiuqing [1 ]
Bolund, Lars [1 ,6 ,7 ]
Krogh, Anders [1 ,3 ]
Kristiansen, Karsten [1 ,3 ]
Drmanac, Radoje [8 ]
Drmanac, Snezana [8 ]
Nielsen, Rasmus [1 ,9 ,10 ]
Li, Songgang [1 ]
Wang, Jian [1 ,11 ]
Yang, Huanming [1 ,11 ,12 ]
Li, Yingrui [1 ,13 ]
Wong, Gane Ka-Shu [1 ,14 ,15 ]
Wang, Jun [1 ,12 ,16 ,17 ,18 ]
机构
[1] BGI Shenzhen, Shenzhen, Peoples R China
[2] BGI Tianjin, Tianjin, Peoples R China
[3] Univ Copenhagen, Dept Biol, Copenhagen, Denmark
[4] HKU BGI Bioinformat Algorithms & Core Technol Res, Hong Kong, Hong Kong, Peoples R China
[5] S China Univ Technol, Sch Biosci & Bioengn, Guangzhou 510641, Guangdong, Peoples R China
[6] Univ Aarhus, Inst Biomed, Aarhus, Denmark
[7] Danish Ctr Translat Breast Canc Res, Copenhagen, Denmark
[8] Complete Genom Inc, Mountain View, CA USA
[9] Univ Calif Berkeley, Dept Integrat Biol, Berkeley, CA 94720 USA
[10] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[11] James D Watson Inst Genome Sci, Hangzhou, Zhejiang, Peoples R China
[12] King Abdulaziz Univ, Princess AI Jawhara Albrahim Ctr Excellence Res H, Jeddah 21413, Saudi Arabia
[13] Univ Queensland, Inst Mol Biosci, Brisbane, Qld, Australia
[14] Univ Alberta, Dept Biol Sci, Edmonton, AB, Canada
[15] Univ Alberta, Dept Med, Edmonton, AB, Canada
[16] Macau Univ Sci & Technol, Taipa, Macau, Peoples R China
[17] Univ Hong Kong, Dept Med, Hong Kong, Hong Kong, Peoples R China
[18] Univ Hong Kong, State Key Lab Pharmaceut Biotechnol, Hong Kong, Hong Kong, Peoples R China
关键词
SEQUENCE; SNP; CANCER;
D O I
10.1038/nbt.3200
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine.
引用
收藏
页码:617 / +
页数:10
相关论文
共 50 条
[1]   The haplotype-resolved genome and epigenome of the aneuploid HeLa cancer cell line [J].
Adey, Andrew ;
Burton, Joshua N. ;
Kitzman, Jacob O. ;
Hiatt, Joseph B. ;
Lewis, Alexandra P. ;
Martin, Beth K. ;
Qiu, Ruolan ;
Lee, Choli ;
Shendure, Jay .
NATURE, 2013, 500 (7461) :207-+
[2]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[3]   Adenosine deaminase activity in the serum and malignant tumors of breast cancer: The assessment of isoenzyme ADA1 and ADA2 activities [J].
Aghaei, M ;
Karami-Tehrani, F ;
Salami, S ;
Atri, M .
CLINICAL BIOCHEMISTRY, 2005, 38 (10) :887-891
[4]   The first Korean genome sequence and analysis: Full genome sequencing for a socio-ethnic group [J].
Ahn, Sung-Min ;
Kim, Tae-Hyung ;
Lee, Sunghoon ;
Kim, Deokhoon ;
Ghang, Ho ;
Kim, Dae-Soo ;
Kim, Byoung-Chul ;
Kim, Sang-Yoon ;
Kim, Woo-Yeon ;
Kim, Chulhong ;
Park, Daeui ;
Lee, Yong Seok ;
Kim, Sangsoo ;
Reja, Rohit ;
Jho, Sungwoong ;
Kim, Chang Geun ;
Cha, Ji-Young ;
Kim, Kyung-Hee ;
Lee, Bonghee ;
Bhak, Jong ;
Kim, Seong-Jin .
GENOME RESEARCH, 2009, 19 (09) :1622-1629
[5]   APPLICATIONS OF NEXT-GENERATION SEQUENCING Genome structural variation discovery and genotyping [J].
Alkan, Can ;
Coe, Bradley P. ;
Eichler, Evan E. .
NATURE REVIEWS GENETICS, 2011, 12 (05) :363-375
[6]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[7]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[8]   An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[9]   Integrating common and rare genetic variation in diverse human populations [J].
Altshuler, David M. ;
Gibbs, Richard A. ;
Peltonen, Leena ;
Dermitzakis, Emmanouil ;
Schaffner, Stephen F. ;
Yu, Fuli ;
Bonnen, Penelope E. ;
de Bakker, Paul I. W. ;
Deloukas, Panos ;
Gabriel, Stacey B. ;
Gwilliam, Rhian ;
Hunt, Sarah ;
Inouye, Michael ;
Jia, Xiaoming ;
Palotie, Aarno ;
Parkin, Melissa ;
Whittaker, Pamela ;
Chang, Kyle ;
Hawes, Alicia ;
Lewis, Lora R. ;
Ren, Yanru ;
Wheeler, David ;
Muzny, Donna Marie ;
Barnes, Chris ;
Darvishi, Katayoon ;
Hurles, Matthew ;
Korn, Joshua M. ;
Kristiansson, Kati ;
Lee, Charles ;
McCarroll, Steven A. ;
Nemesh, James ;
Keinan, Alon ;
Montgomery, Stephen B. ;
Pollack, Samuela ;
Price, Alkes L. ;
Soranzo, Nicole ;
Gonzaga-Jauregui, Claudia ;
Anttila, Verneri ;
Brodeur, Wendy ;
Daly, Mark J. ;
Leslie, Stephen ;
McVean, Gil ;
Moutsianas, Loukas ;
Nguyen, Huy ;
Zhang, Qingrun ;
Ghori, Mohammed J. R. ;
McGinnis, Ralph ;
McLaren, William ;
Takeuchi, Fumihiko ;
Grossman, Sharon R. .
NATURE, 2010, 467 (7311) :52-58
[10]   The significance of digital gene expression profiles [J].
Audic, S ;
Claverie, JM .
GENOME RESEARCH, 1997, 7 (10) :986-995