A rapid and reference-free imputation method for low-cost genotyping platforms

被引:1
作者
Duong, Vinh Chi [1 ,2 ]
Vu, Giang Minh [1 ,2 ]
Nguyen, Thien Khac [2 ]
Nguyen, Hung Tran The [1 ,3 ]
Pham, Thang Luong [2 ]
Vo, Nam S. [1 ,2 ]
Hoang, Tham Hong [1 ,2 ]
机构
[1] Vingroup Big Data Inst, Ctr Biomed Informat, Hanoi, Vietnam
[2] GeneStory Joint Stock Co, Hanoi, Vietnam
[3] Nanyang Technol Univ, Singapore, Singapore
关键词
D O I
10.1038/s41598-023-50086-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Most current genotype imputation methods are reference-based, which posed several challenges to users, such as high computational costs and reference panel inaccessibility. Thus, deep learning models are expected to create reference-free imputation methods performing with higher accuracy and shortening the running time. We proposed a imputation method using recurrent neural networks integrating with an additional discriminator network, namely GRUD. This method was applied to datasets from genotyping chips and Low-Pass Whole Genome Sequencing (LP-WGS) with the reference panels from The 1000 Genomes Project (1KGP) phase 3, the dataset of 4810 Singaporeans (SG10K), and The 1000 Vietnamese Genome Project (VN1K). Our model performed more accurately than other existing methods on multiple datasets, especially with common variants with large minor allele frequency, and shrank running time and memory usage. In summary, these results indicated that GRUD can be implemented in genomic analyses to improve the accuracy and running-time of genotype imputation.
引用
收藏
页数:10
相关论文
共 32 条
  • [1] Ahn J., 2021, arXiv
  • [2] A global reference for human genetic variation
    Altshuler, David M.
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Donnelly, Peter
    Eichler, Evan E.
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Green, Eric D.
    Hurles, Matthew E.
    Knoppers, Bartha M.
    Korbel, Jan O.
    Lander, Eric S.
    Lee, Charles
    Lehrach, Hans
    Mardis, Elaine R.
    Marth, Gabor T.
    McVean, Gil A.
    Nickerson, Deborah A.
    Wang, Jun
    Wilson, Richard K.
    Boerwinkle, Eric
    Doddapaneni, Harsha
    Han, Yi
    Korchina, Viktoriya
    Kovar, Christie
    Lee, Sandra
    Muzny, Donna
    Reid, Jeffrey G.
    Zhu, Yiming
    Chang, Yuqi
    Feng, Qiang
    Fang, Xiaodong
    Guo, Xiaosen
    Jian, Min
    Jiang, Hui
    Jin, Xin
    Lan, Tianming
    Li, Guoqing
    Li, Jingxiang
    Li, Yingrui
    Liu, Shengmao
    Liu, Xiao
    Lu, Yao
    Ma, Xuedi
    Tang, Meifang
    Wang, Bo
    [J]. NATURE, 2015, 526 (7571) : 68 - +
  • [3] Deep Text Summarization using Generative Adversarial Networks in Indian Languages
    Bhargava, Rupal
    Sharma, Gargi
    Sharma, Yashvardhan
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 147 - 153
  • [4] A One-Penny Imputed Genome from Next-Generation Reference Panels
    Browning, Brian L.
    Zhou, Ying
    Browning, Sharon R.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2018, 103 (03) : 338 - 348
  • [5] Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, 10.48550/arXiv.1406.1078.]
  • [6] Chung JY, 2014, Arxiv, DOI [arXiv:1412.3555, 10.48550/arXiv.1412.3555]
  • [7] Genotype Imputation from Large Reference Panels
    Das, Sayantan
    Abecasis, Goncalo R.
    Browning, Brian L.
    [J]. ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 19, 2018, 19 : 73 - 96
  • [8] Next-generation genotype imputation service and methods
    Das, Sayantan
    Forer, Lukas
    Schoenherr, Sebastian
    Sidore, Carlo
    Locke, Adam E.
    Kwong, Alan
    Vrieze, Scott I.
    Chew, Emily Y.
    Levy, Shawn
    McGue, Matt
    Schlessinger, David
    Stambolian, Dwight
    Loh, Po-Ru
    Iacono, William G.
    Swaroop, Anand
    Scott, Laura J.
    Cucca, Francesco
    Kronenberg, Florian
    Boehnke, Michael
    Abecasis, Goncalo R.
    Fuchsberger, Christian
    [J]. NATURE GENETICS, 2016, 48 (10) : 1284 - 1287
  • [9] Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
  • [10] Rapid, Reference-Free human genotype imputation with denoising autoencoders
    Dias, Raquel
    Evans, Doug
    Chen, Shang-Fu
    Chen, Kai-Yu
    Loguercio, Salvatore
    Chan, Leslie
    Torkamani, Ali
    Stephens, Matthew
    [J]. ELIFE, 2022, 11