A rule-based named-entity recognition method for knowledge extraction of evidence based dietary recommendations

被引:78
作者
Eftimov, Tome [1 ,2 ]
Seljak, Barbara Korousic [1 ]
Korosec, Peter [1 ,3 ]
机构
[1] Josef Stefan Inst, Comp Syst Dept, Ljubljana, Slovenia
[2] Josef Stefan Int Postgrad Sch, Ljubljana, Slovenia
[3] Nat Sci & Informat Technol, Fac Math, Koper, Slovenia
来源
PLOS ONE | 2017年 / 12卷 / 06期
关键词
ALGORITHMS;
D O I
10.1371/journal.pone.0179488
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Evidence-based dietary information represented as unstructured text is a crucial information that needs to be accessed in order to help dietitians follow the new knowledge arrives daily with newly published scientific reports. Different named-entity recognition (NER) methods have been introduced previously to extract useful information from the biomedical literature. They are focused on, for example extracting gene mentions, proteins mentions, relationships between genes and proteins, chemical concepts and relationships between drugs and diseases. In this paper, we present a novel NER method, called drNER, for knowledge extraction of evidence-based dietary information. To the best of our knowledge this is the first attempt at extracting dietary concepts. DrNER is a rule-based NER that consists of two phases. The first one involves the detection and determination of the entities mention, and the second one involves the selection and extraction of the entities. We evaluate the method by using text corpora from heterogeneous sources, including text from several scientifically validated web sites and text from scientific publications. Evaluation of the method showed that drNER gives good results and can be used for knowledge extraction of evidence-based dietary recommendations.
引用
收藏
页数:32
相关论文
共 80 条
  • [1] Using text mining techniques to extract phenotypic information from the PhenoCHF corpus
    Alnazzawi, Noha
    Thompson, Paul
    Batista-Navarro, Riza
    Ananiadou, Sophia
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2015, 15
  • [2] Ando RK, 2005, J MACH LEARN RES, V6, P1817
  • [3] [Anonymous], 1999, FDN STAT NATURAL LAN
  • [4] [Anonymous], 2015, P 9 ACM C REC SYST N
  • [5] [Anonymous], 2015, OPENNLP APACHE OPENN
  • [6] [Anonymous], FAM PRACT MANAGE
  • [7] [Anonymous], 2016, P 4 BIONLP SHAR TASK
  • [8] [Anonymous], [No title captured]
  • [9] [Anonymous], 2002, HDB BRAIN THEORY NEU
  • [10] [Anonymous], 2016, P 4 BIONLP SHARED TA