PhyreRisk: A Dynamic Web Application to Bridge Genomics, Proteomics and 3D Structural Data to Guide Interpretation of Human Genetic Variants

被引:19
作者
Ofoegbu, Tochukwu C. [1 ]
David, Alessia [1 ]
Kelley, Lawrence A. [1 ]
Mezulis, Stefans [1 ]
Islam, Suhail A. [1 ]
Mersmann, Sophia F. [1 ]
Stromich, Leonie [1 ]
Vakser, Ilya A. [2 ,3 ]
Houlston, Richard S. [4 ]
Sternberg, Michael J. E. [1 ]
机构
[1] Imperial Coll London, Ctr Integrat Syst Biol & Bioinformat, Dept Life Sci, London SW7 2AZ, England
[2] Univ Kansas, Computat Biol Program, Lawrence, KS 66045 USA
[3] Univ Kansas, Dept Mol Biosci, Lawrence, KS 66045 USA
[4] Inst Canc Res, Div Genet & Epidemiol, London SM2 5NG, England
基金
英国惠康基金; 英国生物技术与生命科学研究理事会;
关键词
web resource; sequence-structure mapping; human proteome; genetic variants; PROTEIN; BROWSER; SNPS;
D O I
10.1016/j.jmb.2019.04.043
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
PhyreRisk is an open-access, publicly accessible web application for interactively bridging genomic, proteomic and structural data facilitating the mapping of human variants onto protein structures. A major advance over other tools for sequence-structure variant mapping is that PhyreRisk provides information on 20,214 human canonical proteins and an additional 22,271 alternative protein sequences (isoforms). Specifically, PhyreRisk provides structural coverage (partial or complete) for 70% (14,035 of 20,214 canonical proteins) of the human proteome, by storing 18,874 experimental structures and 84,818 pre-built models of canonical proteins and their isoforms generated using our in house Phyre2. PhyreRisk reports 55,732 experimentally, multi-validated protein interactions from IntAct and 24,260 experimental structures of protein complexes. Another major feature of PhyreRisk is that, rather than presenting a limited set of precomputed variant-structure mapping of known genetic variants, it allows the user to explore novel variants using, as input, genomic coordinates formats (Ensembl, VCF, reference SNP ID and HGVS notations) and Human Build GRCh37 and GRCh38. PhyreRisk also supports mapping variants using amino acid coordinates and searching for genes or proteins of interest. PhyreRisk is designed to empower researchers to translate genetic data into protein structural information, thereby providing a more comprehensive appreciation of the functional impact of variants. (C) 2019 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:2460 / 2466
页数:7
相关论文
共 28 条
[1]  
Adzhubei Ivan, 2013, Curr Protoc Hum Genet, VChapter 7, DOI 10.1002/0471142905.hg0720s76
[2]   UniProt: the universal protein knowledgebase [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Bye-A-Jee, Hema ;
Cowley, Andrew ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Castro, Leyla Garcia ;
Figueira, Luis ;
Garmiri, Penelope ;
Georghiou, George ;
Gonzalez, Daniel ;
Hatton-Ellis, Emma ;
Li, Weizhong ;
Liu, Wudong ;
Lopez, Rodrigo ;
Luo, Jie ;
Lussi, Yvonne ;
MacDougall, Alistair ;
Nightingale, Andrew ;
Palka, Barbara ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Speretta, Elena ;
Turner, Edward ;
Tyagi, Nidhi ;
Volynkin, Vladimir ;
Wardell, Tony ;
Warner, Kate ;
Watkins, Xavier ;
Zaru, Rossana ;
Zellner, Hermann ;
Xenarios, Ioannis .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D158-D169
[3]   RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy [J].
Burley, Stephen K. ;
Berman, Helen M. ;
Bhikadiya, Charmi ;
Bi, Chunxiao ;
Chen, Li ;
Di Costanzo, Luigi ;
Christie, Cole ;
Dalenberg, Ken ;
Duarte, Jose M. ;
Dutta, Shuchismita ;
Feng, Zukang ;
Ghosh, Sutapa ;
Goodsell, David S. ;
Green, Rachel K. ;
Guranovic, Vladimir ;
Guzenko, Dmytro ;
Hudson, Brian P. ;
Kalro, Tara ;
Liang, Yuhe ;
Lowe, Robert ;
Namkoong, Harry ;
Peisach, Ezra ;
Periskova, Irina ;
Prlic, Andreas ;
Randle, Chris ;
Rose, Alexander ;
Rose, Peter ;
Sala, Raul ;
Sekharan, Monica ;
Shao, Chenghua ;
Tan, Lihua ;
Tao, Yi-Ping ;
Valasatava, Yana ;
Voigt, Maria ;
Westbrook, John ;
Woo, Jesse ;
Yang, Huanwang ;
Young, Jasmine ;
Zhuravleva, Marina ;
Zardecki, Christine .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D464-D474
[4]   SIFTS: updated Structure Integration with Function, Taxonomy and Sequences resource allows 40-fold increase in coverage of structure-based annotations for proteins [J].
Dana, Jose M. ;
Gutmanas, Aleksandras ;
Tyagi, Nidhi ;
Qi, Guoying ;
O'Donovan, Claire ;
Martin, Maria ;
Velankar, Sameer .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D482-D489
[5]   Mapping genetic variations to three-dimensional protein structures to enhance variant interpretation: a proposed framework [J].
Glusman, Gustavo ;
Rose, Peter W. ;
Prlic, Andreas ;
Dougherty, Jennifer ;
Duarte, Jose M. ;
Hoffman, Andrew S. ;
Barton, Geoffrey J. ;
Bendixen, Emoke ;
Bergquist, Timothy ;
Bock, Christian ;
Brunk, Elizabeth ;
Buljan, Marija ;
Burley, Stephen K. ;
Cai, Binghuang ;
Carter, Hannah ;
Gao, JianJiong ;
Godzik, Adam ;
Heuer, Michael ;
Hicks, Michael ;
Hrabe, Thomas ;
Karchin, Rachel ;
Leman, Julia Koehler ;
Lane, Lydie ;
Masica, David L. ;
Mooney, Sean D. ;
Moult, John ;
Omenn, Gilbert S. ;
Pearl, Frances ;
Pejaver, Vikas ;
Reynolds, Sheila M. ;
Rokem, Ariel ;
Schwede, Torsten ;
Song, Sicheng ;
Tilgner, Hagen ;
Valasatava, Yana ;
Zhang, Yang ;
Deutsch, Eric W. .
GENOME MEDICINE, 2017, 9
[6]   COSMIC-3D provides structural perspectives on cancer genetics for drug discovery [J].
Jubb, Harry C. ;
Saini, Harpreet K. ;
Verdonk, Marcel L. ;
Forbes, Simon A. .
NATURE GENETICS, 2018, 50 (09) :1200-1202
[7]   The ExAC browser: displaying reference data information from over 60 000 exomes [J].
Karczewski, Konrad J. ;
Weisburd, Ben ;
Thomas, Brett ;
Solomonson, Matthew ;
Ruderfer, Douglas M. ;
Kavanagh, David ;
Hamamsy, Tymor ;
Lek, Monkol ;
Samocha, Kaitlin E. ;
Cummings, Beryl B. ;
Birnbaum, Daniel ;
Daly, Mark J. ;
MacArthur, Daniel G. .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D840-D845
[8]   The Phyre2 web portal for protein modeling, prediction and analysis [J].
Kelley, Lawrence A. ;
Mezulis, Stefans ;
Yates, Christopher M. ;
Wass, Mark N. ;
Sternberg, Michael J. E. .
NATURE PROTOCOLS, 2015, 10 (06) :845-858
[9]   Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm [J].
Kumar, Prateek ;
Henikoff, Steven ;
Ng, Pauline C. .
NATURE PROTOCOLS, 2009, 4 (07) :1073-1082
[10]   GWIDD: a comprehensive resource for genome-wide structural modeling of protein-protein interactions [J].
Kundrotas, Petras J. ;
Zhu, Zhengwei ;
Vakser, Ilya A. .
HUMAN GENOMICS, 2012, 6