Whole genome variant analysis in three ethnically diverse Indians

被引:2
作者
Malhotra, Seema [1 ]
Singh, Sayar [1 ]
Sarkar, Soma [1 ]
机构
[1] Govt India, DIPAS, Def Res & Dev Org, Minist Def, Lucknow Rd, Delhi 110054, India
关键词
Indian genome; Ethnic; Genetic diversity; Whole genome sequencing; HIGH-ALTITUDE; POPULATION; MTDNA; SEQUENCE; MUTATION; ROLES; DNA;
D O I
10.1007/s13258-018-0650-z
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
India represents an amazing confluence of geographically, linguistically and socially disparate ethnic populations (Indian Genome Variation Consortium, J Genet 87:3-20, 2008). Understanding the genetic diversity of Indian population remains a daunting task. In this paper we present detailed analysis of genomic variations (high-depth coverage ( 30x) using Illumina Hiseq 2000 platform) from three healthy Indian male individuals each belonging to three geographically delineated regions and linguistic phylum viz. high altitude region of Ladakh (Tibeto-Burman linguistic phylum), sub mountainous region of Kumaun (Indo-European linguistic phylum) and sea level region of Telangana (Dravidian linguistic phylum) for probing the extent of genetic diversity in our population. The sequencing analysis provided high quality data ( 95% of the total reads aligned to the human reference genome for each sample) and very good alignment quality (> 80% of the filtered mapped reads had a quality score of 60). A total of 4.3, 3.7 and 4.3 million single nucleotide variations were identified in the genome of high altitude, sub mountainous and sea level respectively by comparing with human reference genome. Approximately 17.3, 18.2, 17.4% of the variants were unique in the three genomes. The study identified many novel variations in the three diverse genomes (132,970 in Ladakh, 112,317 in Kumaun and 128,881 in Telangana individual) and is an important resource for creating a baseline and a comprehensive catalogue of human genomic variation across the Indian as well as the Asian continent.
引用
收藏
页码:497 / 510
页数:14
相关论文
共 43 条
  • [1] The first Korean genome sequence and analysis: Full genome sequencing for a socio-ethnic group
    Ahn, Sung-Min
    Kim, Tae-Hyung
    Lee, Sunghoon
    Kim, Deokhoon
    Ghang, Ho
    Kim, Dae-Soo
    Kim, Byoung-Chul
    Kim, Sang-Yoon
    Kim, Woo-Yeon
    Kim, Chulhong
    Park, Daeui
    Lee, Yong Seok
    Kim, Sangsoo
    Reja, Rohit
    Jho, Sungwoong
    Kim, Chang Geun
    Cha, Ji-Young
    Kim, Kyung-Hee
    Lee, Bonghee
    Bhak, Jong
    Kim, Seong-Jin
    [J]. GENOME RESEARCH, 2009, 19 (09) : 1622 - 1629
  • [2] A map of human genome variation from population-scale sequencing
    Altshuler, David
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Collins, Francis S.
    De la Vega, Francisco M.
    Donnelly, Peter
    Egholm, Michael
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Knoppers, Bartha M.
    Lander, Eric S.
    Lehrach, Hans
    Mardis, Elaine R.
    McVean, Gil A.
    Nickerson, DebbieA.
    Peltonen, Leena
    Schafer, Alan J.
    Sherry, Stephen T.
    Wang, Jun
    Wilson, Richard K.
    Gibbs, Richard A.
    Deiros, David
    Metzker, Mike
    Muzny, Donna
    Reid, Jeff
    Wheeler, David
    Wang, Jun
    Li, Jingxiang
    Jian, Min
    Li, Guoqing
    Li, Ruiqiang
    Liang, Huiqing
    Tian, Geng
    Wang, Bo
    Wang, Jian
    Wang, Wei
    Yang, Huanming
    Zhang, Xiuqing
    Zheng, Huisong
    Lander, Eric S.
    Altshuler, David L.
    Ambrogio, Lauren
    Bloom, Toby
    Cibulskis, Kristian
    Fennell, Tim J.
    Gabriel, Stacey B.
    [J]. NATURE, 2010, 467 (7319) : 1061 - 1073
  • [3] [Anonymous], J RENIN ANGIOTENSIN
  • [4] [Anonymous], 2012, Nature
  • [5] Brahmachari SK, 2008, J GENET, V87, P3
  • [6] The South Asian Genome
    Chambers, John C.
    Abbott, James
    Zhang, Weihua
    Turro, Ernest
    Scott, William R.
    Tan, Sian-Tsung
    Afzal, Uzma
    Afaq, Saima
    Loh, Marie
    Lehne, Benjamin
    O'Reilly, Paul
    Gaulton, Kyle J.
    Pearson, Richard D.
    Li, Xinzhong
    Lavery, Anita
    Vandrovcova, Jana
    Wass, Mark N.
    Miller, Kathryn
    Sehmi, Joban
    Oozageer, Laticia
    Kooner, Ishminder K.
    Al-Hussaini, Abtehale
    Mills, Rebecca
    Grewal, Jagvir
    Panoulas, Vasileios
    Lewin, Alexandra M.
    Northwood, Korrinne
    Wander, Gurpreet S.
    Geoghegan, Frank
    Li, Yingrui
    Wang, Jun
    Aitman, Timothy J.
    McCarthy, Mark I.
    Scott, James
    Butcher, Sarah
    Elliott, Paul
    Kooner, Jaspal S.
    [J]. PLOS ONE, 2014, 9 (08):
  • [7] Updating Phylogeny of Mitochondrial DNA Macrohaplogroup M in India: Dispersal of Modern Human in South Asian Corridor
    Chandrasekar, Adimoolam
    Kumar, Satish
    Sreenath, Jwalapuram
    Sarkar, Bishwa Nath
    Urade, Bhaskar Pralhad
    Mallick, Sujit
    Bandopadhyay, Syam Sundar
    Barua, Pinuma
    Barik, Subihra Sankar
    Basu, Debasish
    Kiran, Uttaravalli
    Gangopadhyay, Prodyot
    Sahani, Ramesh
    Prasad, Bhagavatula Venkata Ravi
    Gangopadhyay, Shampa
    Lakshmi, Gandikota Rama
    Ravuri, Rajasekhara Reddy
    Padmaja, Koneru
    Venugopal, Pulamaghatta N.
    Sharma, Madhu Bala
    Rao, Vadlamudi Raghavendra
    [J]. PLOS ONE, 2009, 4 (10):
  • [8] Uncovering the roles of rare variants in common disease through whole-genome sequencing
    Cirulli, Elizabeth T.
    Goldstein, David B.
    [J]. NATURE REVIEWS GENETICS, 2010, 11 (06) : 415 - 425
  • [9] Origin and Post-Glacial Dispersal of Mitochondrial DNA Haplogroups C and D in Northern Asia
    Derenko, Miroslava
    Malyarchuk, Boris
    Grzybowski, Tomasz
    Denisova, Galina
    Rogalla, Urszula
    Perkova, Maria
    Dambueva, Irina
    Zakharov, Ilia
    [J]. PLOS ONE, 2010, 5 (12):
  • [10] Whole Genome Sequence of a Turkish Individual
    Dogan, Haluk
    Can, Handan
    Otu, Hasan H.
    [J]. PLOS ONE, 2014, 9 (01):