Development of Text Mining Tools for Information Retrieval from Patents

被引:3
作者
Alves, Tiago [1 ,2 ]
Rodrigues, Ruben [1 ]
Costa, Hugo [2 ]
Rocha, Miguel [1 ]
机构
[1] Univ Minho, Ctr Biol Engn, P-4710057 Braga, Portugal
[2] Silicolife Lda, P-4715387 Braga, Portugal
来源
11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS | 2017年 / 616卷
关键词
Biomedical text mining; Patents; Information retrieval task; PDF to text conversion; @Note2;
D O I
10.1007/978-3-319-60816-7_9
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Biomedical literature is composed of an ever increasing number of publications in natural language. Patents are a relevant fraction of those, being important sources of information due to all the curated data from the granting process. However, their unstructured data turns the search of information a challenging task. To surpass that, Biomedical text mining (BioTM) creates methodologies to search and structure that data. Several BioTM techniques can be applied to patents. From those, Information Retrieval is the process where relevant data is obtained from collections of documents. In this work, a patent pipeline was developed and integrated into @Note2, an open-source computational framework for BioTM. This integration allows to run further BioTM tools over the patent documents, including Information Extraction processes as Named Entity Recognition or Relation Extraction.
引用
收藏
页码:66 / 73
页数:8
相关论文
共 15 条
[1]  
[Anonymous], 2015, GUID PREP PAT LANDSC
[2]   Getting started in text mining [J].
Cohen, K. Bretonnel ;
Hunter, Lawrence .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (01) :0001-0003
[3]   Combining literature text mining with microarray data: advances for system biology modeling [J].
Faro, Alberto ;
Giordano, Daniela ;
Spampinato, Concetto .
BRIEFINGS IN BIOINFORMATICS, 2012, 13 (01) :61-82
[4]  
Hannan S. A, 2014, INT J ADV RES SCI EN, V3
[5]  
Holley R., 2009, D LIB MAGAZINE MAGAZ, V15
[6]   Detection of IUPAC and IUPAC-like chemical names [J].
Klinger, Roman ;
Kolarik, Corinna ;
Fluck, Juliane ;
Hofmann-Apitius, Martin ;
Friedrich, Christoph M. .
BIOINFORMATICS, 2008, 24 (13) :I268-I276
[7]   Text-mining and information-retrieval services for molecular biology [J].
Krallinger, M ;
Valencia, A .
GENOME BIOLOGY, 2005, 6 (07)
[8]  
Latimer MT, 2005, GENOME BIOL, V6
[9]   PubMed and beyond: a survey of web tools for searching biomedical literature [J].
Lu, Zhiyong .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2011,
[10]  
Miner G, 2012, PRACTICAL TEXT MINING AND STATISTICAL ANALYSIS FOR NON-STRUCTURED TEXT DATA APPLICATIONS, P1