Prioritizing Disease-Linked Variants, Genes, and Pathways with an Interactive Whole-Genome Analysis Pipeline

被引:19
|
作者
Lee, In-Hee [1 ]
Lee, Kyungjoon [2 ]
Hsing, Michael [1 ]
Choe, Yongjoon [1 ]
Park, Jin-Ho [1 ,3 ]
Kim, Shu Hee [4 ]
Bohn, Justin M. [1 ]
Neu, Matthew B. [1 ]
Hwang, Kyu-Baek [5 ]
Green, Robert C. [6 ]
Kohane, Isaac S. [1 ,2 ]
Kong, Sek Won [1 ]
机构
[1] Childrens Hosp, Harvard Div Hlth Sci & Technol, Childrens Hosp Informat Program, Dept Med, Boston, MA 02115 USA
[2] Harvard Univ, Ctr Biomed Informat, Sch Med, Boston, MA 02115 USA
[3] Seoul Natl Univ Hosp, Dept Family Med, Seoul 110744, South Korea
[4] Stanford Univ, Palo Alto, CA 94305 USA
[5] Soongsil Univ, Sch Comp Sci & Engn, Seoul 156743, South Korea
[6] Brigham & Womens Hosp, Dept Med, Div Genet, Boston, MA 02115 USA
关键词
whole-genome sequences; variant annotation; disease gene discovery; analysis pipeline; RARE-VARIANT; PERSONAL GENOMES; UVEAL MELANOMA; MUTATIONS; SEQUENCE; EXOME; DATABASE; TOOL; FRAMEWORK; COMMON;
D O I
10.1002/humu.22520
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Whole-genome sequencing (WGS) studies are uncovering disease-associated variants in both rare and nonrare diseases. Utilizing the next-generation sequencing for WGS requires a series of computational methods for alignment, variant detection, and annotation, and the accuracy and reproducibility of annotation results are essential for clinical implementation. However, annotating WGS with up to date genomic information is still challenging for biomedical researchers. Here, we present one of the fastest and highly scalable annotation, filtering, and analysis pipelinegNOMEto prioritize phenotype-associated variants while minimizing false-positive findings. Intuitive graphical user interface of gNOME facilitates the selection of phenotype-associated variants, and the result summaries are provided at variant, gene, and genome levels. Moreover, the enrichment results of specific variants, genes, and gene sets between two groups or compared with population scale WGS datasets that is already integrated in the pipeline can help the interpretation. We found a small number of discordant results between annotation software tools in part due to different reporting strategies for the variants with complex impacts. Using two published whole-exome datasets of uveal melanoma and bladder cancer, we demonstrated gNOME's accuracy of variant annotation and the enrichment of loss-of-function variants in known cancer pathways. gNOME Web server and source codes are freely available to the academic community ().
引用
收藏
页码:537 / 547
页数:11
相关论文
共 43 条
  • [21] Whole-Genome Sequencing of a Single Proband Together with Linkage Analysis Identifies a Mendelian Disease Gene
    Sobreira, Nara L. M.
    Cirulli, Elizabeth T.
    Avramopoulos, Dimitrios
    Wohler, Elizabeth
    Oswald, Gretchen L.
    Stevens, Eric L.
    Ge, Dongliang
    Shianna, Kevin V.
    Smith, Jason P.
    Maia, Jessica M.
    Gumbs, Curtis E.
    Pevsner, Jonathan
    Thomas, George
    Valle, David
    Hoover-Fong, Julie E.
    Goldstein, David B.
    PLOS GENETICS, 2010, 6 (06): : 1 - 6
  • [22] Use of whole genome analysis to identify shared genomic variants across breeds in canine mitral valve disease
    Williams, Brian
    Friedenberg, Steven G.
    Keene, Bruce W.
    Tou, Sandy P.
    DeFrancesco, Teresa C.
    Meurs, Kathryn M.
    HUMAN GENETICS, 2021, 140 (11) : 1563 - 1568
  • [23] High-depth whole-genome sequencing identifies structure variants, copy number variants and short tandem repeats associated with Parkinson's disease
    Wang, Chaodong
    Liu, Hankui
    Li, Xu-Ying
    Ma, Jinghong
    Gu, Zhuqin
    Feng, Xiuli
    Xie, Shu
    Tang, Bei-Sha
    Chen, Shengdi
    Wang, Wei
    Wang, Jian
    Zhang, Jianguo
    Chan, Piu
    NPJ PARKINSONS DISEASE, 2024, 10 (01)
  • [24] Whole-genome sequencing and RNA sequencing analysis reveals novel risk genes and differential expression patterns in hepatoblastoma
    Wang, Wuqian
    Zhang, Na
    Chen, Luan
    Zhao, Xianglong
    Shan, Yuhua
    Yang, Fan
    Wang, Bo
    Gao, Hongxiang
    Xu, Min
    Tang, Ping
    Qin, Shengying
    Gu, Song
    GENE, 2024, 897
  • [25] Whole-genome comparative analysis of virulence genes unveils similarities and differences between endophytes and other symbiotic bacteria
    Lopez-Fernandez, Sebastian
    Sonego, Paolo
    Moretto, Marco
    Pancher, Michael
    Engelen, Kristof
    Pertot, Ilaria
    Campisano, Andrea
    FRONTIERS IN MICROBIOLOGY, 2015, 6
  • [26] Genetic Landscape of Rare Autoinflammatory Disease Variants in Qatar and Middle Eastern Populations Through the Integration of Whole-Genome and Exome Datasets
    Sharma, Parul
    Jain, Abhinav
    Scaria, Vinod
    FRONTIERS IN GENETICS, 2021, 12
  • [27] Structure determination and analysis of titin A-band fibronectin type III domains provides insights for disease-linked variants and protein oligomerisation
    Rees, Martin
    Nikoopour, Roksana
    Alexandrovich, Alexander
    Pfuhl, Mark
    Lopes, Luis R.
    Akhtar, Mohammed M.
    Syrris, Petros
    Elliott, Perry
    Carr-White, Gerry
    Gautel, Mathias
    JOURNAL OF STRUCTURAL BIOLOGY, 2023, 215 (03)
  • [28] Whole genome microarray expression analysis in blood identifies pathways linked to signs and symptoms of a patient with hypercalprotectinaemia and hyperzincaemia
    Isaksson, H. S.
    Farkas, S. A.
    Muller, P.
    Gustafsson, D.
    Nilsson, T. K.
    CLINICAL AND EXPERIMENTAL IMMUNOLOGY, 2018, 191 (02) : 240 - 251
  • [29] Global analysis of human duplicated genes reveals the relative importance of whole-genome duplicates originated in the early vertebrate evolution
    Acharya, Debarun
    Ghosh, Tapash C.
    BMC GENOMICS, 2016, 17
  • [30] Bacterial Whole-Genome Sequencing Revisited: Portable, Scalable, and Standardized Analysis for Typing and Detection of Virulence and Antibiotic Resistance Genes
    Leopold, Shana R.
    Goering, Richard V.
    Witten, Anika
    Harmsen, Dag
    Mellmann, Alexander
    JOURNAL OF CLINICAL MICROBIOLOGY, 2014, 52 (07) : 2365 - 2370