Extensive genomic and transcriptional diversity identified through massively parallel DNA and RNA sequencing of eighteen Korean individuals

被引:109
作者
Ju, Young Seok [1 ,2 ]
Kim, Jong-Il [1 ,3 ,4 ,5 ]
Kim, Sheehyun [1 ,2 ]
Hong, Dongwan [1 ]
Park, Hansoo [1 ,6 ,7 ]
Shin, Jong-Yeon [1 ,5 ]
Lee, Seungbok [1 ,4 ]
Lee, Won-Chul [1 ,4 ]
Kim, Sujung [5 ]
Yu, Saet-Byeol [5 ]
Park, Sung-Soo [5 ]
Seo, Seung-Hyun [5 ]
Yun, Ji-Young [5 ]
Kim, Hyun-Jin [1 ,4 ]
Lee, Dong-Sung [1 ,4 ]
Yavartanoo, Maryam [1 ,4 ]
Kang, Hyunseok Peter [1 ]
Gokcumen, Omer [6 ,7 ]
Govindaraju, Diddahally R. [6 ,7 ]
Jung, Jung Hee [2 ]
Chong, Hyonyong [2 ,8 ]
Yang, Kap-Seok [2 ]
Kim, Hyungtae [2 ]
Lee, Charles [6 ,7 ]
Seo, Jeong-Sun [1 ,2 ,3 ,4 ,5 ,8 ]
机构
[1] Seoul Natl Univ, Med Res Ctr, GMI, Seoul, South Korea
[2] Macrogen Inc, Seoul, South Korea
[3] Seoul Natl Univ, Coll Med, Dept Biochem, Seoul, South Korea
[4] Seoul Natl Univ, Grad Sch, Dept Biomed Sci, Seoul, South Korea
[5] Psoma Therapeut Inc, Seoul, South Korea
[6] Brigham & Womens Hosp, Dept Pathol, Boston, MA 02115 USA
[7] Harvard Univ, Sch Med, Boston, MA USA
[8] Axeq Technol, Rockville, MD USA
基金
美国国家卫生研究院;
关键词
GENE-EXPRESSION; STRUCTURAL VARIANTS; EDITING SITES; FAMILY; COMMON; SNP;
D O I
10.1038/ng.872
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Massively parallel sequencing technologies have identified a broad spectrum of human genome diversity. Here we deep sequenced and correlated 18 genomes and 17 transcriptomes of unrelated Korean individuals. This has allowed us to construct a genome-wide map of common and rare variants and also identify variants formed during DNA-RNA transcription. We identified 9.56 million genomic variants, 23.2% of which appear to be previously unidentified. From transcriptome sequencing, we discovered 4,414 transcripts not previously annotated. Finally, we revealed 1,809 sites of transcriptional base modification, where the transcriptional landscape is different from the corresponding genomic sequences, and 580 sites of allele-specific expression. Our findings suggest that a considerable number of unexplored genomic variants still remain to be identified in the human genome, and that the integrated analysis of genome and transcriptome sequencing is powerful for understanding the diversity and functional aspects of human genomic variants.
引用
收藏
页码:745 / U47
页数:10
相关论文
共 50 条
  • [1] Limitations of next-generation genome sequence assembly
    Alkan, Can
    Sajjadian, Saba
    Eichler, Evan E.
    [J]. NATURE METHODS, 2011, 8 (01) : 61 - 65
  • [2] A map of human genome variation from population-scale sequencing
    Altshuler, David
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Collins, Francis S.
    De la Vega, Francisco M.
    Donnelly, Peter
    Egholm, Michael
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Knoppers, Bartha M.
    Lander, Eric S.
    Lehrach, Hans
    Mardis, Elaine R.
    McVean, Gil A.
    Nickerson, DebbieA.
    Peltonen, Leena
    Schafer, Alan J.
    Sherry, Stephen T.
    Wang, Jun
    Wilson, Richard K.
    Gibbs, Richard A.
    Deiros, David
    Metzker, Mike
    Muzny, Donna
    Reid, Jeff
    Wheeler, David
    Wang, Jun
    Li, Jingxiang
    Jian, Min
    Li, Guoqing
    Li, Ruiqiang
    Liang, Huiqing
    Tian, Geng
    Wang, Bo
    Wang, Jian
    Wang, Wei
    Yang, Huanming
    Zhang, Xiuqing
    Zheng, Huisong
    Lander, Eric S.
    Altshuler, David L.
    Ambrogio, Lauren
    Bloom, Toby
    Cibulskis, Kristian
    Fennell, Tim J.
    Gabriel, Stacey B.
    [J]. NATURE, 2010, 467 (7319) : 1061 - 1073
  • [3] Integrating common and rare genetic variation in diverse human populations
    Altshuler, David M.
    Gibbs, Richard A.
    Peltonen, Leena
    Dermitzakis, Emmanouil
    Schaffner, Stephen F.
    Yu, Fuli
    Bonnen, Penelope E.
    de Bakker, Paul I. W.
    Deloukas, Panos
    Gabriel, Stacey B.
    Gwilliam, Rhian
    Hunt, Sarah
    Inouye, Michael
    Jia, Xiaoming
    Palotie, Aarno
    Parkin, Melissa
    Whittaker, Pamela
    Chang, Kyle
    Hawes, Alicia
    Lewis, Lora R.
    Ren, Yanru
    Wheeler, David
    Muzny, Donna Marie
    Barnes, Chris
    Darvishi, Katayoon
    Hurles, Matthew
    Korn, Joshua M.
    Kristiansson, Kati
    Lee, Charles
    McCarroll, Steven A.
    Nemesh, James
    Keinan, Alon
    Montgomery, Stephen B.
    Pollack, Samuela
    Price, Alkes L.
    Soranzo, Nicole
    Gonzaga-Jauregui, Claudia
    Anttila, Verneri
    Brodeur, Wendy
    Daly, Mark J.
    Leslie, Stephen
    McVean, Gil
    Moutsianas, Loukas
    Nguyen, Huy
    Zhang, Qingrun
    Ghori, Mohammed J. R.
    McGinnis, Ralph
    McLaren, William
    Takeuchi, Fumihiko
    Grossman, Sharon R.
    [J]. NATURE, 2010, 467 (7311) : 52 - 58
  • [4] Bailey TL., 1994, Proc Int Conf Intel Syst Mol Biol, V2, P28
  • [5] Genome, epigenome and RNA sequences of monozygotic twins discordant for multiple sclerosis
    Baranzini, Sergio E.
    Mudge, Joann
    van Velkinburgh, Jennifer C.
    Khankhanian, Pouya
    Khrebtukova, Irina
    Miller, Neil A.
    Zhang, Lu
    Farmer, Andrew D.
    Bell, Callum J.
    Kim, Ryan W.
    May, Gregory D.
    Woodward, Jimmy E.
    Caillier, Stacy J.
    McElroy, Joseph P.
    Gomez, Refujia
    Pando, Marcelo J.
    Clendenen, Leonda E.
    Ganusova, Elena E.
    Schilkey, Faye D.
    Ramaraj, Thiruvarangan
    Khan, Omar A.
    Huntley, Jim J.
    Luo, Shujun
    Kwok, Pui-yan
    Wu, Thomas D.
    Schroth, Gary P.
    Oksenberg, Jorge R.
    Hauser, Stephen L.
    Kingsmore, Stephen F.
    [J]. NATURE, 2010, 464 (7293) : 1351 - U6
  • [6] Accurate whole human genome sequencing using reversible terminator chemistry
    Bentley, David R.
    Balasubramanian, Shankar
    Swerdlow, Harold P.
    Smith, Geoffrey P.
    Milton, John
    Brown, Clive G.
    Hall, Kevin P.
    Evers, Dirk J.
    Barnes, Colin L.
    Bignell, Helen R.
    Boutell, Jonathan M.
    Bryant, Jason
    Carter, Richard J.
    Cheetham, R. Keira
    Cox, Anthony J.
    Ellis, Darren J.
    Flatbush, Michael R.
    Gormley, Niall A.
    Humphray, Sean J.
    Irving, Leslie J.
    Karbelashvili, Mirian S.
    Kirk, Scott M.
    Li, Heng
    Liu, Xiaohai
    Maisinger, Klaus S.
    Murray, Lisa J.
    Obradovic, Bojan
    Ost, Tobias
    Parkinson, Michael L.
    Pratt, Mark R.
    Rasolonjatovo, Isabelle M. J.
    Reed, Mark T.
    Rigatti, Roberto
    Rodighiero, Chiara
    Ross, Mark T.
    Sabot, Andrea
    Sankar, Subramanian V.
    Scally, Aylwyn
    Schroth, Gary P.
    Smith, Mark E.
    Smith, Vincent P.
    Spiridou, Anastassia
    Torrance, Peta E.
    Tzonev, Svilen S.
    Vermaas, Eric H.
    Walter, Klaudia
    Wu, Xiaolin
    Zhang, Lu
    Alam, Mohammed D.
    Anastasi, Carole
    [J]. NATURE, 2008, 456 (7218) : 53 - 59
  • [7] X-inactivation profile reveals extensive variability in X-linked gene expression in females
    Carrel, L
    Willard, HF
    [J]. NATURE, 2005, 434 (7031) : 400 - 404
  • [8] Mutation spectrum revealed by breakpoint sequencing of human germline CNVs
    Conrad, Donald F.
    Bird, Christine
    Blackburne, Ben
    Lindsay, Sarah
    Mamanova, Lira
    Lee, Charles
    Turner, Daniel J.
    Hurles, Matthew E.
    [J]. NATURE GENETICS, 2010, 42 (05) : 385 - U43
  • [9] The AID/APOBEC family of nucleic acid mutators
    Conticello, Silvestro G.
    [J]. GENOME BIOLOGY, 2008, 9 (06)
  • [10] Polymorphisms of alpha-adducin and salt sensitivity in patients with essential hypertension
    Cusi, D
    Barlassina, C
    Azzani, T
    Casari, G
    Citterio, L
    Devoto, M
    Glorioso, N
    Lanzani, C
    Manunta, P
    Righetti, M
    Rivera, R
    Stella, P
    Troffa, C
    Zagato, L
    Bianchi, G
    [J]. LANCET, 1997, 349 (9062) : 1353 - 1357