Extensive genomic and transcriptional diversity identified through massively parallel DNA and RNA sequencing of eighteen Korean individuals

被引:109
作者
Ju, Young Seok [1 ,2 ]
Kim, Jong-Il [1 ,3 ,4 ,5 ]
Kim, Sheehyun [1 ,2 ]
Hong, Dongwan [1 ]
Park, Hansoo [1 ,6 ,7 ]
Shin, Jong-Yeon [1 ,5 ]
Lee, Seungbok [1 ,4 ]
Lee, Won-Chul [1 ,4 ]
Kim, Sujung [5 ]
Yu, Saet-Byeol [5 ]
Park, Sung-Soo [5 ]
Seo, Seung-Hyun [5 ]
Yun, Ji-Young [5 ]
Kim, Hyun-Jin [1 ,4 ]
Lee, Dong-Sung [1 ,4 ]
Yavartanoo, Maryam [1 ,4 ]
Kang, Hyunseok Peter [1 ]
Gokcumen, Omer [6 ,7 ]
Govindaraju, Diddahally R. [6 ,7 ]
Jung, Jung Hee [2 ]
Chong, Hyonyong [2 ,8 ]
Yang, Kap-Seok [2 ]
Kim, Hyungtae [2 ]
Lee, Charles [6 ,7 ]
Seo, Jeong-Sun [1 ,2 ,3 ,4 ,5 ,8 ]
机构
[1] Seoul Natl Univ, Med Res Ctr, GMI, Seoul, South Korea
[2] Macrogen Inc, Seoul, South Korea
[3] Seoul Natl Univ, Coll Med, Dept Biochem, Seoul, South Korea
[4] Seoul Natl Univ, Grad Sch, Dept Biomed Sci, Seoul, South Korea
[5] Psoma Therapeut Inc, Seoul, South Korea
[6] Brigham & Womens Hosp, Dept Pathol, Boston, MA 02115 USA
[7] Harvard Univ, Sch Med, Boston, MA USA
[8] Axeq Technol, Rockville, MD USA
基金
美国国家卫生研究院;
关键词
GENE-EXPRESSION; STRUCTURAL VARIANTS; EDITING SITES; FAMILY; COMMON; SNP;
D O I
10.1038/ng.872
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Massively parallel sequencing technologies have identified a broad spectrum of human genome diversity. Here we deep sequenced and correlated 18 genomes and 17 transcriptomes of unrelated Korean individuals. This has allowed us to construct a genome-wide map of common and rare variants and also identify variants formed during DNA-RNA transcription. We identified 9.56 million genomic variants, 23.2% of which appear to be previously unidentified. From transcriptome sequencing, we discovered 4,414 transcripts not previously annotated. Finally, we revealed 1,809 sites of transcriptional base modification, where the transcriptional landscape is different from the corresponding genomic sequences, and 580 sites of allele-specific expression. Our findings suggest that a considerable number of unexplored genomic variants still remain to be identified in the human genome, and that the integrated analysis of genome and transcriptome sequencing is powerful for understanding the diversity and functional aspects of human genomic variants.
引用
收藏
页码:745 / U47
页数:10
相关论文
共 50 条
  • [41] Complete Khoisan and Bantu genomes from southern Africa
    Schuster, Stephan C.
    Miller, Webb
    Ratan, Aakrosh
    Tomsho, Lynn P.
    Giardine, Belinda
    Kasson, Lindsay R.
    Harris, Robert S.
    Petersen, Desiree C.
    Zhao, Fangqing
    Qi, Ji
    Alkan, Can
    Kidd, Jeffrey M.
    Sun, Yazhou
    Drautz, Daniela I.
    Bouffard, Pascal
    Muzny, Donna M.
    Reid, Jeffrey G.
    Nazareth, Lynne V.
    Wang, Qingyu
    Burhans, Richard
    Riemer, Cathy
    Wittekindt, Nicola E.
    Moorjani, Priya
    Tindall, Elizabeth A.
    Danko, Charles G.
    Teo, Wee Siang
    Buboltz, Anne M.
    Zhang, Zhenhai
    Ma, Qianyi
    Oosthuysen, Arno
    Steenkamp, Abraham W.
    Oostuisen, Hermann
    Venter, Philippus
    Gajewski, John
    Zhang, Yu
    Pugh, B. Franklin
    Makova, Kateryna D.
    Nekrutenko, Anton
    Mardis, Elaine R.
    Patterson, Nick
    Pringle, Tom H.
    Chiaromonte, Francesca
    Mullikin, James C.
    Eichler, Evan E.
    Hardison, Ross C.
    Gibbs, Richard A.
    Harkins, Timothy T.
    Hayes, Vanessa M.
    [J]. NATURE, 2010, 463 (7283) : 943 - 947
  • [42] ABySS: A parallel assembler for short read sequence data
    Simpson, Jared T.
    Wong, Kim
    Jackman, Shaun D.
    Schein, Jacqueline E.
    Jones, Steven J. M.
    Birol, Inanc
    [J]. GENOME RESEARCH, 2009, 19 (06) : 1117 - 1123
  • [43] RNA-sequence analysis of human B-cells
    Toung, Jonathan M.
    Morley, Michael
    Li, Mingyao
    Cheung, Vivian G.
    [J]. GENOME RESEARCH, 2011, 21 (06) : 991 - 998
  • [44] The sequence of the human genome
    Venter, JC
    Adams, MD
    Myers, EW
    Li, PW
    Mural, RJ
    Sutton, GG
    Smith, HO
    Yandell, M
    Evans, CA
    Holt, RA
    Gocayne, JD
    Amanatides, P
    Ballew, RM
    Huson, DH
    Wortman, JR
    Zhang, Q
    Kodira, CD
    Zheng, XQH
    Chen, L
    Skupski, M
    Subramanian, G
    Thomas, PD
    Zhang, JH
    Miklos, GLG
    Nelson, C
    Broder, S
    Clark, AG
    Nadeau, C
    McKusick, VA
    Zinder, N
    Levine, AJ
    Roberts, RJ
    Simon, M
    Slayman, C
    Hunkapiller, M
    Bolanos, R
    Delcher, A
    Dew, I
    Fasulo, D
    Flanigan, M
    Florea, L
    Halpern, A
    Hannenhalli, S
    Kravitz, S
    Levy, S
    Mobarry, C
    Reinert, K
    Remington, K
    Abu-Threideh, J
    Beasley, E
    [J]. SCIENCE, 2001, 291 (5507) : 1304 - +
  • [45] The diploid genome sequence of an Asian individual
    Wang, Jun
    Wang, Wei
    Li, Ruiqiang
    Li, Yingrui
    Tian, Geng
    Goodman, Laurie
    Fan, Wei
    Zhang, Junqing
    Li, Jun
    Zhang, Juanbin
    Guo, Yiran
    Feng, Binxiao
    Li, Heng
    Lu, Yao
    Fang, Xiaodong
    Liang, Huiqing
    Du, Zhenglin
    Li, Dong
    Zhao, Yiqing
    Hu, Yujie
    Yang, Zhenzhen
    Zheng, Hancheng
    Hellmann, Ines
    Inouye, Michael
    Pool, John
    Yi, Xin
    Zhao, Jing
    Duan, Jinjie
    Zhou, Yan
    Qin, Junjie
    Ma, Lijia
    Li, Guoqing
    Yang, Zhentao
    Zhang, Guojie
    Yang, Bin
    Yu, Chang
    Liang, Fang
    Li, Wenjie
    Li, Shaochuan
    Li, Dawei
    Ni, Peixiang
    Ruan, Jue
    Li, Qibin
    Zhu, Hongmei
    Liu, Dongyuan
    Lu, Zhike
    Li, Ning
    Guo, Guangwu
    Zhang, Jianguo
    Ye, Jia
    [J]. NATURE, 2008, 456 (7218) : 60 - U1
  • [46] The complete genome of an individual by massively parallel DNA sequencing
    Wheeler, David A.
    Srinivasan, Maithreyan
    Egholm, Michael
    Shen, Yufeng
    Chen, Lei
    McGuire, Amy
    He, Wen
    Chen, Yi-Ju
    Makhijani, Vinod
    Roth, G. Thomas
    Gomes, Xavier
    Tartaro, Karrie
    Niazi, Faheem
    Turcotte, Cynthia L.
    Irzyk, Gerard P.
    Lupski, James R.
    Chinault, Craig
    Song, Xing-zhi
    Liu, Yue
    Yuan, Ye
    Nazareth, Lynne
    Qin, Xiang
    Muzny, Donna M.
    Margulies, Marcel
    Weinstock, George M.
    Gibbs, Richard A.
    Rothberg, Jonathan M.
    [J]. NATURE, 2008, 452 (7189) : 872 - U5
  • [47] Fast and SNP-tolerant detection of complex variants and splicing in short reads
    Wu, Thomas D.
    Nacu, Serban
    [J]. BIOINFORMATICS, 2010, 26 (07) : 873 - 881
  • [48] Elucidating the inosinome: global approaches to adenosine-to-inosine RNA editing
    Wulff, Bjorn-Erik
    Sakurai, Masayuki
    Nishikura, Kazuko
    [J]. NATURE REVIEWS GENETICS, 2011, 12 (02) : 81 - 85
  • [49] A SNP in the ABCC11 gene is the determinant of human earwax type
    Yoshiura, K
    Kinoshita, A
    Ishida, T
    Ninokata, A
    Ishikawa, T
    Kaname, T
    Bannai, M
    Tokunaga, K
    Sonoda, S
    Komaki, R
    Ihara, M
    Saenko, VA
    Alipov, GK
    Sekine, I
    Komatsu, K
    Takahashi, H
    Nakashima, M
    Sosonkina, N
    Mapendano, CK
    Ghadami, M
    Nomura, M
    Liang, DS
    Miwa, N
    Kim, DK
    Garidkhuu, A
    Natsume, N
    Ohta, T
    Tomita, H
    Kaneko, A
    Kikuchi, M
    Russomando, G
    Hirayama, K
    Ishibashi, M
    Takahashi, A
    Saitou, N
    Murray, JC
    Saito, S
    Nakamura, Y
    Niikawa, N
    [J]. NATURE GENETICS, 2006, 38 (03) : 324 - 330
  • [50] Cancer resistance in transgenic mice expressing the SAC module of par-4
    Zhao, Yanming
    Burikhanov, Ravshan
    Qiu, Shirley
    Lele, Subodh M.
    Jennings, C. Darrell
    Bondada, Subbarao
    Spear, Brett
    Rangnekar, Vivek M.
    [J]. CANCER RESEARCH, 2007, 67 (19) : 9276 - 9285