Student Research Abstract: Dual Architecture for Name Entity Extraction and Relation Extraction with Applications in Medical Corpora

被引:0
作者
Caballero, Ernesto Quevedo [1 ]
机构
[1] Baylor Univ, Comp Sci, Waco, TX 76798 USA
来源
37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING | 2022年
基金
美国国家科学基金会;
关键词
Deep Learning; Ontology Learning; Information Retrieval;
D O I
10.1145/3477314.3506960
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
There is a growing interest in automatic knowledge discovery in plain text documents. Automation enables the analysis of massive collections of information. Such efforts are relevant in the health domain which has a large volume of available resources to transform areas important for society when addressing various health research challenges. However, knowledge discovery is usually aided by annotated corpora, which are scarce resources in the literature. This work considers as a start point existent health-oriented Spanish dataset. In addition, it also creates an English variant using the same tagging system. Furthermore, we design and analyze two separated architectures for Entity Extraction and Relation Recognition that outperform previous works in the Spanish dataset. We also evaluate their performance in the English version with such promising results. Finally, we perform a use case experiment to evaluate the utility of the output of these two architectures in Information Retrieval systems.
引用
收藏
页码:883 / 886
页数:4
相关论文
共 15 条