Student Research Abstract: Dual Architecture for Name Entity Extraction and Relation Extraction with Applications in Medical Corpora

被引:0
作者
Caballero, Ernesto Quevedo [1 ]
机构
[1] Baylor Univ, Comp Sci, Waco, TX 76798 USA
来源
37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING | 2022年
基金
美国国家科学基金会;
关键词
Deep Learning; Ontology Learning; Information Retrieval;
D O I
10.1145/3477314.3506960
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
There is a growing interest in automatic knowledge discovery in plain text documents. Automation enables the analysis of massive collections of information. Such efforts are relevant in the health domain which has a large volume of available resources to transform areas important for society when addressing various health research challenges. However, knowledge discovery is usually aided by annotated corpora, which are scarce resources in the literature. This work considers as a start point existent health-oriented Spanish dataset. In addition, it also creates an English variant using the same tagging system. Furthermore, we design and analyze two separated architectures for Entity Extraction and Relation Recognition that outperform previous works in the Spanish dataset. We also evaluate their performance in the English version with such promising results. Finally, we perform a use case experiment to evaluate the utility of the output of these two architectures in Information Retrieval systems.
引用
收藏
页码:883 / 886
页数:4
相关论文
共 15 条
  • [1] [Anonymous], 2020, UH MAJA KD EHEALTH K
  • [2] Boteva Vera, 2016, Advances in Information Retrieval. 38th European Conference on IR Research, ECIR 2016. Proceedings
  • [3] LNCS 9626, P716, DOI 10.1007/978-3-319-30671-1_58
  • [4] Dai X, 2020, Arxiv, DOI arXiv:2010.11683
  • [5] Garcia-Pablos Aitor, 2020, P IBERIAN LANGUAGES, V2020
  • [6] A Survey on Deep Learning for Named Entity Recognition
    Li, Jing
    Sun, Aixin
    Han, Jianglei
    Li, Chenliang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (01) : 50 - 70
  • [7] Liu Y, 2015, Arxiv, DOI arXiv:1507.04646
  • [8] Pawar S, 2017, Arxiv, DOI arXiv:1712.05191
  • [9] A computational ecosystem to support eHealth Knowledge Discovery technologies in Spanish
    Piad-Morffis, Alejandro
    Gutierrez, Yoan
    Almeida-Cruz, Yudivian
    Munoz, Rafael
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 109
  • [10] Piad-Morffis Alejandro, 2020, P IBERIAN LANGUAGES