Rational confederation of genes and diseases: NGS interpretation via GeneCards, MalaCards and VarElect

被引:85
作者
Rappaport, Noa [1 ,3 ]
Fishilevich, Simon [1 ]
Nudel, Ron [1 ]
Twik, Michal [1 ]
Belinky, Frida [1 ,2 ]
Plaschkes, Inbar [1 ]
Stein, Tsippi Iny [1 ]
Cohen, Dana [1 ]
Oz-Levi, Danit [1 ]
Safran, Marilyn [1 ]
Lancet, Doron [1 ]
机构
[1] Weizmann Inst Sci, Dept Mol Genet, Rehovot, Israel
[2] NIH, Natl Ctr Biotechnol Informat, Bldg 10, Bethesda, MD 20892 USA
[3] Inst Syst Biol, Seattle, WA USA
关键词
COMPENDIUM;
D O I
10.1186/s12938-017-0359-2
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Background: A key challenge in the realm of human disease research is next generation sequencing (NGS) interpretation, whereby identified filtered variant-harboring genes are associated with a patient's disease phenotypes. This necessitates bioinformatics tools linked to comprehensive knowledgebases. The GeneCards suite databases, which include GeneCards (human genes), MalaCards (human diseases) and PathCards (human pathways) together with additional tools, are presented with the focus on MalaCards utility for NGS interpretation as well as for large scale bioinformatic analyses. Results: VarElect, our NGS interpretation tool, leverages the broad information in the GeneCards suite databases. MalaCards algorithms unify disease-related terms and annotations from 69 sources. Further, MalaCards defines hierarchical relatedness- aliases, disease families, a related diseases network, categories and ontological classifications. GeneCards and MalaCards delineate and share a multi-tiered, scored gene-disease network, with stringency levels, including the definition of elite status- high quality gene-disease pairs, coming from manually curated trustworthy sources, that includes 4500 genes for 8000 diseases. This unique resource is key to NGS interpretation by VarElect. VarElect, a comprehensive search tool that helps infer both direct and indirect links between genes and user-supplied disease/phenotype terms, is robustly strengthened by the information found in MalaCards. The indirect mode benefits from GeneCards' diverse gene-to-gene relationships, including SuperPathsintegrated biological pathways from 12 information sources. We are currently adding an important information layer in the form of "disease SuperPaths", generated from the gene-disease matrix by an algorithm similar to that previously employed for biological pathway unification. This allows the discovery of novel gene-disease and disease-disease relationships. The advent of whole genome sequencing necessitates capacities to go beyond protein coding genes. GeneCards is highly useful in this respect, as it also addresses 101,976 non-protein-coding RNA genes. In a more recent development, we are currently adding an inclusive map of regulatory elements and their inferred target genes, generated by integration from 4 resources. Conclusions: MalaCards provides a rich big-data scaffold for in silico biomedical discovery within the gene-disease universe. VarElect, which depends significantly on both GeneCards and MalaCards power, is a potent tool for supporting the interpretation of wet-lab experiments, notably NGS analyses of disease. The GeneCards suite has thus transcended its 2-decade role in biomedical research, maturing into a key player in clinical investigation.
引用
收藏
页数:14
相关论文
共 30 条
[1]   A role for TENM1 mutations in congenital general anosmia [J].
Alkelai, A. ;
Olender, T. ;
Haffner-Krausz, R. ;
Tsoory, M. M. ;
Boyko, V. ;
Tatarskyy, P. ;
Gross-Isseroff, R. ;
Milgrom, R. ;
Shushan, S. ;
Blau, I. ;
Cohn, E. ;
Beeri, R. ;
Levy-Lahad, E. ;
Pras, E. ;
Lancet, D. .
CLINICAL GENETICS, 2016, 90 (03) :211-219
[2]   Pathguide: a Pathway Resource List [J].
Bader, Gary D. ;
Cary, Michael P. ;
Sander, Chris .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D504-D506
[3]  
Belinky Frida, 2015, Database (Oxford), V2015, DOI 10.1093/database/bav006
[4]   Non-redundant compendium of human ncRNA genes in GeneCards [J].
Belinky, Frida ;
Bahir, Iris ;
Stelzer, Gil ;
Zimmerman, Shahar ;
Rosen, Naomi ;
Nativ, Noam ;
Dalah, Irina ;
Stein, Tsippi Iny ;
Rappaport, Noa ;
Mituyama, Toutai ;
Safran, Marilyn ;
Lancet, Doron .
BIOINFORMATICS, 2013, 29 (02) :255-261
[5]   GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data [J].
Ben-Ari Fuchs, Shani ;
Lieder, Iris ;
Stelzer, Gil ;
Mazor, Yaron ;
Buzhor, Ella ;
Kaplan, Sergey ;
Bogoch, Yoel ;
Plaschkes, Inbar ;
Shitrit, Alina ;
Rappaport, Noa ;
Kohn, Asher ;
Edgar, Ron ;
Shenhav, Liraz ;
Safran, Marilyn ;
Lancet, Doron ;
Guan-Golan, Yaron ;
Warshawsky, David ;
Shtrichman, Ronit .
OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2016, 20 (03) :139-151
[6]  
Buzhor E, 2014, REGEN MED, V9, P649, DOI [10.2217/RME.14.35, 10.2217/rme.14.35]
[7]   A Novel Approach to High-Quality Postmortem Tissue Procurement: The GTEx Project [J].
Carithers, Latarsha J. ;
Ardlie, Kristin ;
Barcus, Mary ;
Branton, Philip A. ;
Britton, Angela ;
Buia, Stephen A. ;
Compton, Carolyn C. ;
DeLuca, David S. ;
Peter-Demchok, Joanne ;
Gelfand, Ellen T. ;
Guan, Ping ;
Korzeniewski, Greg E. ;
Lockhart, Nicole C. ;
Rabiner, Chana A. ;
Rao, Abhi K. ;
Robinson, Karna L. ;
Roche, Nancy V. ;
Sawyer, Sherilyn J. ;
Segre, Ayellet V. ;
Shive, Charles E. ;
Smith, Anna M. ;
Sobin, Leslie H. ;
Undale, Anita H. ;
Valentino, Kimberly M. ;
Vaught, Jim ;
Young, Taylor R. ;
Moore, Helen M. .
BIOPRESERVATION AND BIOBANKING, 2015, 13 (05) :311-319
[8]   Single-nucleotide evolutionary constraint scores highlight disease-causing mutations [J].
Cooper, Gregory M. ;
Goode, David L. ;
Ng, Sarah B. ;
Sidow, Arend ;
Bamshad, Michael J. ;
Shendure, Jay ;
Nickerson, Deborah A. .
NATURE METHODS, 2010, 7 (04) :250-251
[9]   GeneHancer: genome-wide integration of enhancers and target genes in GeneCards [J].
Fishilevich, Simon ;
Nudel, Ron ;
Rappaport, Noa ;
Hadar, Rotem ;
Plaschkes, Inbar ;
Stein, Tsippi Iny ;
Rosen, Naomi ;
Kohn, Asher ;
Twik, Michal ;
Safran, Marilyn ;
Lancet, Doron ;
Cohen, Dana .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2017,
[10]   Genic insights from integrated human proteomics in GeneCards [J].
Fishilevich, Simon ;
Zimmerman, Shahar ;
Kohn, Asher ;
Stein, Tsippi Iny ;
Olender, Tsviya ;
Kolker, Eugene ;
Safran, Marilyn ;
Lancet, Doron .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,