LitVar: a semantic search engine for linking genomic variant data in PubMed and PMC

被引:86
作者
Allot, Alexis [1 ]
Peng, Yifan [1 ]
Wei, Chih-Hsuan [1 ]
Lee, Kyubum [1 ]
Phan, Lon [1 ]
Lu, Zhiyong [1 ]
机构
[1] NCBI, NLM, NIH, 8600 Rockville Pike, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
INFORMATION; CLINVAR; CANCER; SYSTEM; TMVAR; DBSNP;
D O I
10.1093/nar/gky355
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The identification and interpretation of genomic variants play a key role in the diagnosis of genetic diseases and related research. These tasks increasingly rely on accessing relevant manually curated information from domain databases (e.g. SwissProt or Clin-Var). However, due to the sheer volume of medical literature and high cost of expert curation, curated variant information in existing databases are often incomplete and out-of-date. In addition, the same genetic variant can be mentioned in publications with various names (e.g. 'A146T' versus 'c.436G>A' versus 'rs121913527'). A search in PubMed using only one name usually cannot retrieve all relevant articles for the variant of interest. Hence, to help scientists, healthcare professionals, and database curators find the most up-to-date published variant research, we have developed LitVar for the search and retrieval of standardized variant information. In addition, LitVar uses advanced text mining techniques to compute and extract relationships between variants and other associated entities such as diseases and chemicals/drugs. LitVar is publicly available at https://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/LitVar.
引用
收藏
页码:W530 / W536
页数:7
相关论文
共 29 条
[1]  
[Anonymous], 2009, NATURAL LANGUAGE PRO, DOI DOI 10.1007/S10579-010-9124-X
[2]   MutationFinder: a high-performance system for extracting point mutation mentions from text [J].
Caporaso, J. Gregory ;
Baumgartner, William A., Jr. ;
Randolph, David A. ;
Cohen, K. Bretonnel ;
Hunter, Lawrence .
BIOINFORMATICS, 2007, 23 (14) :1862-1865
[3]   nala: text mining natural language mutation mentions [J].
Cejuela, Juan Miguel ;
Bojchevski, Aleksandar ;
Uhlig, Carsten ;
Bekmukhametov, Rustem ;
Karn, Sanjeev Kumar ;
Mahmuti, Shpend ;
Baghudana, Ashish ;
Dubey, Ankit ;
Satagopam, Venkata P. ;
Rost, Burkhard .
BIOINFORMATICS, 2017, 33 (12) :1852-1858
[4]   BioC: a minimalist approach to interoperability for biomedical text processing [J].
Comeau, Donald C. ;
Dogan, Rezarta Islamaj ;
Ciccarese, Paolo ;
Cohen, Kevin Bretonnel ;
Krallinger, Martin ;
Leitner, Florian ;
Lu, Zhiyong ;
Peng, Yifan ;
Rinaldi, Fabio ;
Torii, Manabu ;
Valencia, Alfonso ;
Verspoor, Karin ;
Wiegers, Thomas C. ;
Wu, Cathy H. ;
Wilbur, W. John .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2013,
[5]   Toward an automatic method for extracting cancer- and other disease-related point mutations from the biomedical literature [J].
Doughty, Emily ;
Kertesz-Farkas, Attila ;
Bodenreider, Olivier ;
Thompson, Gary ;
Adadey, Asa ;
Peterson, Thomas ;
Kann, Maricel G. .
BIOINFORMATICS, 2011, 27 (03) :408-415
[6]   Cutting Edge: Towards PubMed 2.0 [J].
Fiorini, Nicolas ;
Lipman, David J. ;
Lu, Zhiyong .
ELIFE, 2017, 6
[7]   COSMIC: somatic cancer genetics at high-resolution [J].
Forbes, Simon A. ;
Beare, David ;
Boutselakis, Harry ;
Bamford, Sally ;
Bindal, Nidhi ;
Tate, John ;
Cole, Charlotte G. ;
Ward, Sari ;
Dawson, Elisabeth ;
Ponting, Laura ;
Stefancsik, Raymund ;
Harsha, Bhavana ;
Kok, Chai Yin ;
Jia, Mingming ;
Jubb, Harry ;
Sondka, Zbyslaw ;
Thompson, Sam ;
De, Tisham ;
Campbell, Peter J. .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D777-D783
[8]  
Khare R, 2014, METHODS MOL BIOL, V1159, P11, DOI 10.1007/978-1-4939-0709-0_2
[9]   ClinVar: improving access to variant interpretations and supporting evidence [J].
Landrum, Melissa J. ;
Lee, Jennifer M. ;
Benson, Mark ;
Brown, Garth R. ;
Chao, Chen ;
Chitipiralla, Shanmuga ;
Gu, Baoshan ;
Hart, Jennifer ;
Hoffman, Douglas ;
Jang, Wonhee ;
Karapetyan, Karen ;
Katz, Kenneth ;
Liu, Chunlei ;
Maddipatla, Zenith ;
Malheiro, Adriana ;
McDaniel, Kurt ;
Ovetsky, Michael ;
Riley, George ;
Zhou, George ;
Holmes, J. Bradley ;
Kattman, Brandi L. ;
Maglott, Donna R. .
NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) :D1062-D1067
[10]   Data integration in biological research: an overview [J].
Lapatas, Vasileios ;
Stefanidakis, Michalis ;
Jimenez, Rafael C. ;
Via, Allegra ;
Schneider, Maria Victoria .
JOURNAL OF BIOLOGICAL RESEARCH-THESSALONIKI, 2015, 22 :1-16