Whole genome variant analysis in three ethnically diverse Indians

被引:2
作者
Malhotra, Seema [1 ]
Singh, Sayar [1 ]
Sarkar, Soma [1 ]
机构
[1] Govt India, DIPAS, Def Res & Dev Org, Minist Def, Lucknow Rd, Delhi 110054, India
关键词
Indian genome; Ethnic; Genetic diversity; Whole genome sequencing; HIGH-ALTITUDE; POPULATION; MTDNA; SEQUENCE; MUTATION; ROLES; DNA;
D O I
10.1007/s13258-018-0650-z
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
India represents an amazing confluence of geographically, linguistically and socially disparate ethnic populations (Indian Genome Variation Consortium, J Genet 87:3-20, 2008). Understanding the genetic diversity of Indian population remains a daunting task. In this paper we present detailed analysis of genomic variations (high-depth coverage ( 30x) using Illumina Hiseq 2000 platform) from three healthy Indian male individuals each belonging to three geographically delineated regions and linguistic phylum viz. high altitude region of Ladakh (Tibeto-Burman linguistic phylum), sub mountainous region of Kumaun (Indo-European linguistic phylum) and sea level region of Telangana (Dravidian linguistic phylum) for probing the extent of genetic diversity in our population. The sequencing analysis provided high quality data ( 95% of the total reads aligned to the human reference genome for each sample) and very good alignment quality (> 80% of the filtered mapped reads had a quality score of 60). A total of 4.3, 3.7 and 4.3 million single nucleotide variations were identified in the genome of high altitude, sub mountainous and sea level respectively by comparing with human reference genome. Approximately 17.3, 18.2, 17.4% of the variants were unique in the three genomes. The study identified many novel variations in the three diverse genomes (132,970 in Ladakh, 112,317 in Kumaun and 128,881 in Telangana individual) and is an important resource for creating a baseline and a comprehensive catalogue of human genomic variation across the Indian as well as the Asian continent.
引用
收藏
页码:497 / 510
页数:14
相关论文
共 43 条
  • [21] The Population Genetics of dN/dS
    Kryazhimskiy, Sergey
    Plotkin, Joshua B.
    [J]. PLOS GENETICS, 2008, 4 (12):
  • [22] Fast and accurate short read alignment with Burrows-Wheeler transform
    Li, Heng
    Durbin, Richard
    [J]. BIOINFORMATICS, 2009, 25 (14) : 1754 - 1760
  • [23] A genetic mechanism for Tibetan high-altitude adaptation
    Lorenzo, Felipe R.
    Huff, Chad
    Myllymaki, Mikko
    Olenchock, Benjamin
    Swierczek, Sabina
    Tashi, Tsewang
    Gordeuk, Victor
    Wuren, Tana
    Ri-Li, Ge
    McClain, Donald A.
    Khan, Tahsin M.
    Koul, Parvaiz A.
    Guchhait, Prasenjit
    Salama, Mohamed E.
    Xing, Jinchuan
    Semenza, Gregg L.
    Liberzon, Ella
    Wilson, Andrew
    Simonson, Tatum S.
    Jorde, Lynn B.
    Kaelin, William G., Jr.
    Koivunen, Peppi
    Prchal, Josef T.
    [J]. NATURE GENETICS, 2014, 46 (09) : 951 - +
  • [24] The impact of next-generation sequencing technology on genetics
    Mardis, Elaine R.
    [J]. TRENDS IN GENETICS, 2008, 24 (03) : 133 - 141
  • [25] Carriers of human mitochondrial DNA macrohaplogroup M colonized India from southeastern Asia
    Marrero, Patricia
    Abu-Amero, Khaled K.
    Larruga, Jose M.
    Cabrera, Vicente M.
    [J]. BMC EVOLUTIONARY BIOLOGY, 2016, 16 : 1 - 13
  • [26] Most of the extant mtDNA boundaries in South and Southwest Asia were likely shaped during the initial settlement of Eurasia by anatomically modern humans
    Metspalu, M
    Kivisild, T
    Metspalu, E
    Parik, J
    Hudjashov, G
    Kaldma, K
    Serk, P
    Karmin, M
    Behar, DM
    Gilbert, MTP
    Endicott, P
    Mastana, S
    Papiha, SS
    Skorecki, K
    Torroni, A
    Villems, R
    [J]. BMC GENETICS, 2004, 5 (1)
  • [27] Natural genetic variation caused by small insertions and deletions in the human genome
    Mills, Ryan E.
    Pittard, W. Stephen
    Mullaney, Julienne M.
    Farooq, Umar
    Creasy, Todd H.
    Mahurkar, Anup A.
    Kemeza, David M.
    Strassler, Daniel S.
    Ponting, Chris P.
    Webber, Caleb
    Devine, Scott E.
    [J]. GENOME RESEARCH, 2011, 21 (06) : 830 - 839
  • [28] GENETIC DISTANCE BETWEEN POPULATIONS
    NEI, M
    [J]. AMERICAN NATURALIST, 1972, 106 (949) : 283 - +
  • [29] A large-scale analysis of human mitochondrial DNA sequences with special reference to the population history of East Eurasian
    Oota, H
    Saitou, N
    Ueda, S
    [J]. ANTHROPOLOGICAL SCIENCE, 2002, 110 (03) : 293 - 312
  • [30] Passarino G, 1996, AM J HUM GENET, V59, P927