A Population-Specific Major Allele Reference Genome From The United Arab Emirates Population

被引:14
作者
Daw Elbait, Gihan [1 ]
Henschel, Andreas [1 ,2 ]
Tay, Guan K. [1 ,3 ,4 ,5 ]
Al Safar, Habiba S. [1 ,3 ,6 ]
机构
[1] Khalifa Univ Sci & Technol, Ctr Biotechnol, Abu Dhabi, U Arab Emirates
[2] Khalifa Univ Sci & Technol, Dept Elect Engn & Comp Sci, Abu Dhabi, U Arab Emirates
[3] Khalifa Univ Sci & Technol, Dept Biomed Engn, Abu Dhabi, U Arab Emirates
[4] Univ Western Australia, Fac Hlth & Med Sci, Div Psychiat, Crawley, WA, Australia
[5] Edith Cowan Univ, Sch Med & Hlth Sci, Joondalup, WA, Australia
[6] Khalifa Univ Sci & Technol, Coll Med & Hlth Sci, Dept Genet & Mol Biol, Abu Dhabi, U Arab Emirates
关键词
UAE reference genome; next generation sequencing; structural variants; population representative sampling; population genetics; Arab genome; reference genome; SEQUENCE; FRAMEWORK; ALIGNMENT; VARIANTS; DATABASE;
D O I
10.3389/fgene.2021.660428
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The ethnic composition of the population of a country contributes to the uniqueness of each national DNA sequencing project and, ideally, individual reference genomes are required to reduce the confounding nature of ethnic bias. This work represents a representative Whole Genome Sequencing effort of an understudied population. Specifically, high coverage consensus sequences from 120 whole genomes and 33 whole exomes were used to construct the first ever population specific major allele reference genome for the United Arab Emirates (UAE). When this was applied and compared to the archetype hg19 reference, assembly of local Emirati genomes was reduced by similar to 19% (i.e., some 1 million fewer calls). In compiling the United Arab Emirates Reference Genome (UAERG), sets of annotated 23,038,090 short (novel: 1,790,171) and 137,713 structural (novel: 8,462) variants; their allele frequencies (AFs) and distribution across the genome were identified. Population-specific genetic characteristics including loss-of-function variants, admixture, and ancestral haplogroup distribution were identified and reported here. We also detect a strong correlation between F-ST and admixture components in the UAE. This baseline study was conceived to establish a high-quality reference genome and a genetic variations resource to enable the development of regional population specific initiatives and thus inform the application of population studies and precision medicine in the UAE.
引用
收藏
页数:15
相关论文
共 67 条
[1]   Mapping Human Genetic Diversity in Asia [J].
Abdulla, Mahmood Ameen ;
Ahmed, Ikhlak ;
Assawamakin, Anunchai ;
Bhak, Jong ;
Brahmachari, Samir K. ;
Calacal, Gayvelline C. ;
Chaurasia, Amit ;
Chen, Chien-Hsiun ;
Chen, Jieming ;
Chen, Yuan-Tsong ;
Chu, Jiayou ;
Cutiongco-de la Paz, Eva Maria C. ;
De Ungria, Maria Corazon A. ;
Delfin, Frederick C. ;
Edo, Juli ;
Fuchareon, Suthat ;
Ghang, Ho ;
Gojobori, Takashi ;
Han, Junsong ;
Ho, Sheng-Feng ;
Hoh, Boon Peng ;
Huang, Wei ;
Inoko, Hidetoshi ;
Jha, Pankaj ;
Jinam, Timothy A. ;
Jin, Li ;
Jung, Jongsun ;
Kangwanpong, Daoroong ;
Kampuansai, Jatupol ;
Kennedy, Giulia C. ;
Khurana, Preeti ;
Kim, Hyung-Lae ;
Kim, Kwangjoong ;
Kim, Sangsoo ;
Kim, Woo-Yeon ;
Kimm, Kuchan ;
Kimura, Ryosuke ;
Koike, Tomohiro ;
Kulawonganunchai, Supasak ;
Kumar, Vikrant ;
Lai, Poh San ;
Lee, Jong-Young ;
Lee, Sunghoon ;
Liu, Edison T. ;
Majumder, Partha P. ;
Mandapati, Kiran Kumar ;
Marzuki, Sangkot ;
Mitchell, Wayne ;
Mukerji, Mitali ;
Naritomi, Kenji .
SCIENCE, 2009, 326 (5959) :1541-1545
[2]   A 1000 Arab genome project to study the Emirati population [J].
Al-Ali, Mariam ;
Osman, Wael ;
Tay, Guan K. ;
AlSafar, Habiba S. .
JOURNAL OF HUMAN GENETICS, 2018, 63 (04) :533-536
[3]   Sequencing and analysis of the whole genome of Indian Gujarati male [J].
Almal, Suhani ;
Jeon, Sungwon ;
Agarwal, Milee ;
Patel, Sweta ;
Patel, Shivangi ;
Bhak, Youngjune ;
Jun, JeHoon ;
Bhak, Jong ;
Padh, Harish .
GENOMICS, 2019, 111 (02) :196-204
[4]  
Almarri M.A., 2020, BIORXIV PREPRINT, DOI [10.1101/2020.10.18.342816, DOI 10.1101/2020.10.18.342816]
[5]   Introducing the first whole genomes of nationals from the United Arab Emirates [J].
AlSafar, Habiba S. ;
Al-Ali, Mariam ;
Elbait, Gihan Daw ;
Al-Maini, Mustafa H. ;
Ruta, Dymitr ;
Peramo, Braulio ;
Henschel, Andreas ;
Tay, Guan K. .
SCIENTIFIC REPORTS, 2019, 9 (1)
[6]   Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA [J].
Andrews, RM ;
Kubacka, I ;
Chinnery, PF ;
Lightowlers, RN ;
Turnbull, DM ;
Howell, N .
NATURE GENETICS, 1999, 23 (02) :147-147
[7]  
[Anonymous], 2018, PIC TOOLK
[8]   Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[9]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[10]   The Genome of the Netherlands: design, and project goals [J].
Boomsma, Dorret I. ;
Wijmenga, Cisca ;
Slagboom, Eline P. ;
Swertz, Morris A. ;
Karssen, Lennart C. ;
Abdellaoui, Abdel ;
Ye, Kai ;
Guryev, Victor ;
Vermaat, Martijn ;
van Dijk, Freerk ;
Francioli, Laurent C. ;
Hottenga, Jouke Jan ;
Laros, Jeroen F. J. ;
Li, Qibin ;
Li, Yingrui ;
Cao, Hongzhi ;
Chen, Ruoyan ;
Du, Yuanping ;
Li, Ning ;
Cao, Sujie ;
van Setten, Jessica ;
Menelaou, Androniki ;
Pulit, Sara L. ;
Hehir-Kwa, Jayne Y. ;
Beekman, Marian ;
Elbers, Clara C. ;
Byelas, Heorhiy ;
de Craen, Anton J. M. ;
Deelen, Patrick ;
Dijkstra, Martijn ;
den Dunnen, Johan T. ;
de Knijff, Peter ;
Houwing-Duistermaat, Jeanine ;
Koval, Vyacheslav ;
Estrada, Karol ;
Hofman, Albert ;
Kanterakis, Alexandros ;
van Enckevort, David ;
Mai, Hailiang ;
Kattenberg, Mathijs ;
van Leeuwen, Elisabeth M. ;
Neerincx, Pieter B. T. ;
Oostra, Ben ;
Rivadeneira, Fernanodo ;
Suchiman, Eka H. D. ;
Uitterlinden, Andre G. ;
Willemsen, Gonneke ;
Wolffenbuttel, Bruce H. ;
Wang, Jun ;
de Bakker, Paul I. W. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2014, 22 (02) :221-227