DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants

被引:1761
|
作者
Pinero, Janet [1 ]
Bravo, Alex [1 ]
Queralt-Rosinach, Nuria [1 ,2 ]
Gutierrez-Sacristan, Alba [1 ]
Deu-Pons, Jordi [1 ]
Centeno, Emilio [1 ]
Garcia-Garcia, Javier [1 ]
Sanz, Ferran [1 ]
Furlong, Laura I. [1 ]
机构
[1] Univ Pompeu Fabra, Hosp Mar Med Res Inst IMIM, Res Programme Biomed Informat GRIB, Dept Expt & Hlth Sci DCEXS, E-08003 Barcelona, Spain
[2] Scripps Res Inst, Dept Mol & Expt Med, 10666 N Torrey Pines Rd, La Jolla, CA 92037 USA
基金
欧盟地平线“2020”;
关键词
DATABASE; NETWORK; UPDATE; EXTRACTION; PHENOTYPES; KNOWLEDGE; CATALOG; BIOLOGY; SYSTEMS; TEXT;
D O I
10.1093/nar/gkw943
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The information about the genetic basis of human diseases lies at the heart of precision medicine and drug discovery. However, to realize its full potential to support these goals, several problems, such as fragmentation, heterogeneity, availability and different conceptualization of the data must be overcome. To provide the community with a resource free of these hurdles, we have developed DisGeNET (http://www.disgenet.org), one of the largest available collections of genes and variants involved in human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models and the scientific literature. DisGeNET data are homogeneously annotated with controlled vocabularies and community-driven ontologies. Additionally, several original metrics are provided to assist the prioritization of genotype-phenotype relationships. The information is accessible through a web interface, a Cytoscape App, an RDF SPARQL endpoint, scripts in several programming languages and an R package. DisGeNET is a versatile platform that can be used for different research purposes including the investigation of the molecular underpinnings of specific human diseases and their comorbidities, the analysis of the properties of disease genes, the generation of hypothesis on drug therapeutic action and drug adverse effects, the validation of computationally predicted disease genes and the evaluation of text-mining methods performance.
引用
收藏
页码:D833 / D839
页数:7
相关论文
共 50 条
  • [41] LincSNP 2.0: an updated database for linking disease-associated SNPs to human long non-coding RNAs and their TFBSs
    Ning, Shangwei
    Yue, Ming
    Wang, Peng
    Liu, Yue
    Zhi, Hui
    Zhang, Yan
    Zhang, Jizhou
    Gao, Yue
    Guo, Maoni
    Zhou, Dianshuang
    Li, Xin
    Li, Xia
    NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) : D74 - D78
  • [42] Ontological Analysis of Coronavirus Associated Human Genes at the COVID-19 Disease Portal
    Wang, Shur-Jen
    Brodie, Kent C.
    De Pons, Jeffrey L.
    Demos, Wendy M.
    Gibson, Adam C.
    Hayman, G. Thomas
    Hill, Morgan L.
    Kaldunski, Mary L.
    Lamers, Logan
    Laulederkind, Stanley J. F.
    Nalabolu, Harika S.
    Thota, Jyothi
    Thorat, Ketaki
    Tutaj, Marek A.
    Tutaj, Monika
    Vedi, Mahima
    Zacher, Stacy
    Smith, Jennifer R.
    Dwinell, Melinda R.
    Kwitek, Anne E.
    GENES, 2022, 13 (12)
  • [43] Integration of biological data via NMF for identification of human disease-associated gene modules through multi-label classification
    Alberuni, Syed
    Ray, Sumanta
    PLOS ONE, 2024, 19 (12):
  • [44] Detailed computational study of p53 and p16: using evolutionary sequence analysis and disease-associated mutations to predict the functional consequences of allelic variants
    M S Greenblatt
    J G Beaudet
    J R Gump
    K S Godin
    L Trombley
    J Koh
    J P Bond
    Oncogene, 2003, 22 : 1150 - 1163
  • [45] DPP9: Comprehensive In Silico Analyses of Loss of Function Gene Variants and Associated Gene Expression Signatures in Human Hepatocellular Carcinoma
    Huang, Jiali Carrie
    Al Emran, Abdullah
    Endaya, Justine Moreno
    McCaughan, Geoffrey W.
    Gorrell, Mark D.
    Zhang, Hui Emma
    CANCERS, 2021, 13 (07)
  • [46] Detailed computational study of p53 and p16:: using evolutionary sequence analysis and disease-associated mutations to predict the functional consequences of allelic variants
    Greenblatt, MS
    Beaudet, JG
    Gump, JR
    Godin, KS
    Trombley, L
    Koh, J
    Bond, JP
    ONCOGENE, 2003, 22 (08) : 1150 - 1163
  • [47] Integrating 400 million variants from 80,000 human samples with extensive annotations: towards a knowledge base to analyze disease cohorts
    Jörg Hakenberg
    Wei-Yi Cheng
    Philippe Thomas
    Ying-Chih Wang
    Andrew V. Uzilov
    Rong Chen
    BMC Bioinformatics, 17
  • [48] Integrating 400 million variants from 80,000 human samples with extensive annotations: towards a knowledge base to analyze disease cohorts
    Hakenberg, Joerg
    Cheng, Wei-Yi
    Thomas, Philippe
    Wang, Ying-Chih
    Uzilov, Andrew V.
    Chen, Rong
    BMC BIOINFORMATICS, 2016, 17
  • [49] Genetic variants in eleven telomere-associated genes and the risk of incident cardio/cerebrovascular disease: The Women's Genome Health Study
    Zee, Robert Y. L.
    Ridker, Paul M.
    Chasman, Daniel I.
    CLINICA CHIMICA ACTA, 2011, 412 (1-2) : 199 - 202
  • [50] Human disease genes website series: An international, open and dynamic library for up-to-date clinical information
    Dingemans, Alexander J. M.
    Stremmelaar, Diante E.
    Vissers, Lisenka E. L. M.
    Jansen, Sandra
    Nabais Sa, Maria J.
    van Remortele, Angela
    Jonis, Noraly
    Truijen, Kim
    van de Ven, Sam
    Ewals, Jeroen
    Verbruggen, Michel
    Koolen, David A.
    Brunner, Han G.
    Eichler, Evan E.
    Gecz, Jozef
    de Vries, Bert B. A.
    AMERICAN JOURNAL OF MEDICAL GENETICS PART A, 2021, 185 (04) : 1039 - 1046