Prioritizing Disease-Linked Variants, Genes, and Pathways with an Interactive Whole-Genome Analysis Pipeline

被引:19
|
作者
Lee, In-Hee [1 ]
Lee, Kyungjoon [2 ]
Hsing, Michael [1 ]
Choe, Yongjoon [1 ]
Park, Jin-Ho [1 ,3 ]
Kim, Shu Hee [4 ]
Bohn, Justin M. [1 ]
Neu, Matthew B. [1 ]
Hwang, Kyu-Baek [5 ]
Green, Robert C. [6 ]
Kohane, Isaac S. [1 ,2 ]
Kong, Sek Won [1 ]
机构
[1] Childrens Hosp, Harvard Div Hlth Sci & Technol, Childrens Hosp Informat Program, Dept Med, Boston, MA 02115 USA
[2] Harvard Univ, Ctr Biomed Informat, Sch Med, Boston, MA 02115 USA
[3] Seoul Natl Univ Hosp, Dept Family Med, Seoul 110744, South Korea
[4] Stanford Univ, Palo Alto, CA 94305 USA
[5] Soongsil Univ, Sch Comp Sci & Engn, Seoul 156743, South Korea
[6] Brigham & Womens Hosp, Dept Med, Div Genet, Boston, MA 02115 USA
关键词
whole-genome sequences; variant annotation; disease gene discovery; analysis pipeline; RARE-VARIANT; PERSONAL GENOMES; UVEAL MELANOMA; MUTATIONS; SEQUENCE; EXOME; DATABASE; TOOL; FRAMEWORK; COMMON;
D O I
10.1002/humu.22520
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Whole-genome sequencing (WGS) studies are uncovering disease-associated variants in both rare and nonrare diseases. Utilizing the next-generation sequencing for WGS requires a series of computational methods for alignment, variant detection, and annotation, and the accuracy and reproducibility of annotation results are essential for clinical implementation. However, annotating WGS with up to date genomic information is still challenging for biomedical researchers. Here, we present one of the fastest and highly scalable annotation, filtering, and analysis pipelinegNOMEto prioritize phenotype-associated variants while minimizing false-positive findings. Intuitive graphical user interface of gNOME facilitates the selection of phenotype-associated variants, and the result summaries are provided at variant, gene, and genome levels. Moreover, the enrichment results of specific variants, genes, and gene sets between two groups or compared with population scale WGS datasets that is already integrated in the pipeline can help the interpretation. We found a small number of discordant results between annotation software tools in part due to different reporting strategies for the variants with complex impacts. Using two published whole-exome datasets of uveal melanoma and bladder cancer, we demonstrated gNOME's accuracy of variant annotation and the enrichment of loss-of-function variants in known cancer pathways. gNOME Web server and source codes are freely available to the academic community ().
引用
收藏
页码:537 / 547
页数:11
相关论文
共 43 条
  • [1] A Whole-Genome Analysis Framework for Effective Identification of Pathogenic Regulatory Variants in Mendelian Disease
    Smedley, Damian
    Schubach, Max
    Jacobsen, Julius O. B.
    Koehler, Sebastian
    Zemojtel, Tomasz
    Spielmann, Malte
    Jaeger, Marten
    Hochheiser, Harry
    Washington, Nicole L.
    McMurry, Julie A.
    Haendel, Melissa A.
    Mungall, Christopher J.
    Lewis, Suzanna E.
    Groza, Tudor
    Valentini, Giorgio
    Robinson, Peter N.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2016, 99 (03) : 595 - 606
  • [2] VCF.Filter: interactive prioritization of disease-linked genetic variants from sequencing data
    Mueller, Heiko
    Jimenez-Heredia, Raul
    Krolo, Ana
    Hirschmugl, Tatjana
    Dmytrus, Jasmin
    Boztug, Kaan
    Bock, Christoph
    NUCLEIC ACIDS RESEARCH, 2017, 45 (W1) : W567 - W572
  • [3] Whole-genome sequencing identifies complex contributions to genetic risk by variants in genes causing monogenic systemic lupus erythematosus
    Almlof, Jonas Carlsson
    Nystedt, Sara
    Leonard, Dag
    Eloranta, Maija-Leena
    Grosso, Giorgia
    Sjowall, Christopher
    Bengtsson, Anders A.
    Jonsen, Andreas
    Gunnarsson, Iva
    Svenungsson, Elisabet
    Ronnblom, Lars
    Sandling, Johanna K.
    Syvanen, Ann-Christine
    HUMAN GENETICS, 2019, 138 (02) : 141 - 150
  • [4] Identification of Genes Associated With Hirschsprung Disease, Based on Whole-Genome Sequence Analysis, and Potential Effects on Enteric Nervous System Development
    Tang, Clara Sze-man
    Li, Peng
    Lai, Frank Pui-Ling
    Fu, Alexander Xi
    Lau, Sin-Ting
    So, Man Ting
    Lui, Kathy Nga-Chu
    Li, Zhixin
    Zhuang, Xuehan
    Yu, Michelle
    Liu, Xuelai
    Ngo, Ngoc D.
    Miao, Xiaoping
    Zhang, Xi
    Yi, Bin
    Tang, Shaotao
    Sun, Xiaobing
    Zhang, Furen
    Liu, Hong
    Liu, Qiji
    Zhang, Ruizhong
    Wang, Hualong
    Huang, Liuming
    Dong, Xiao
    Tou, Jinfa
    Cheah, Kathryn Song-Eng
    Yang, Wanling
    Yuan, Zhenwei
    Yip, Kevin Yuk-lap
    Sham, Pak-Chung
    Tam, Paul Kwang-Hang
    Garcia-Barcelo, Maria-Merce
    Ngan, Elly Sau-Wai
    GASTROENTEROLOGY, 2018, 155 (06) : 1908 - +
  • [5] BacSeq: A User-Friendly Automated Pipeline for Whole-Genome Sequence Analysis of Bacterial Genomes
    Chukamnerd, Arnon
    Jeenkeawpiam, Kongpop
    Chusri, Sarunyou
    Pomwised, Rattanaruji
    Singkhamanan, Kamonnut
    Surachat, Komwit
    MICROORGANISMS, 2023, 11 (07)
  • [6] Whole-Genome Profile of Greek Patients with Teratozοοspermia: Identification of Candidate Variants and Genes
    Kyrgiafini, Maria-Anna
    Giannoulis, Themistoklis
    Chatziparasidou, Alexia
    Christoforidis, Nikolaos
    Mamuris, Zissis
    GENES, 2022, 13 (09)
  • [7] Whole-genome analysis of Fusarium graminearum insertional mutants identifies virulence associated genes and unmasks untagged chromosomal deletions
    Urban, Martin
    King, Robert
    Hassani-Pak, Keywan
    Hammond-Kosack, Kim E.
    BMC GENOMICS, 2015, 16
  • [8] The NCBI Comparative Genome Viewer (CGV) is an interactive visualization tool for the analysis of whole-genome eukaryotic alignments
    Rangwala, Sanjida H.
    Rudnev, Dmitry V.
    Ananiev, Victor V.
    Oh, Dong-Ha
    Asztalos, Andrea
    Benica, Barrett
    Borodin, Evgeny A.
    Bouk, Nathan
    Evgeniev, Vladislav I.
    Kodali, Vamsi K.
    Lotov, Vadim
    Mozes, Eyal
    Omelchenko, Marina V.
    Savkina, Sofya
    Sukharnikov, Ekaterina
    Virothaisakun, Joel
    Murphy, Terence D.
    Pruitt, Kim D.
    Schneider, Valerie A.
    PLOS BIOLOGY, 2024, 22 (05)
  • [9] Development and Validation of Clinical Whole-Exome and Whole-Genome Sequencing for Detection of Germline Variants in Inherited Disease
    Hegde, Madhuri
    Santani, Avni
    Mao, Rong
    Ferreira-Gonzalez, Andrea
    Weck, Karen E.
    Voelkerding, Karl V.
    ARCHIVES OF PATHOLOGY & LABORATORY MEDICINE, 2017, 141 (06) : 798 - 805
  • [10] Rapid Mining of Candidate Genes for the Branchless Phenotype in Watermelon by Bulked Segregant Analysis Using Whole-genome Resequencing
    Ren, Kaili
    Su, Yongquan
    Tang, Taoxia
    Kong, Weiping
    Yang, Yonggang
    Zhao, Xiaoqin
    Cheng, Hong
    JOURNAL OF THE AMERICAN SOCIETY FOR HORTICULTURAL SCIENCE, 2024, 149 (04) : 195 - 205