Active Learning for Uneven Noisy Labeled Data in Mention-Level Relation Extraction

被引:1
作者
Wei Yuliang [1 ]
Xin Guodong [1 ]
Wang Wei [1 ]
Wang Bailing [1 ]
机构
[1] Harbin Inst Technol, Harbin 264209, Heilongjiang, Peoples R China
关键词
Relation extraction; active learning; text mining; deep learning;
D O I
10.1109/ACCESS.2019.2911889
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mention-level relation extraction (mRE) plays an important role in extracting relational information from short texts such as those exchanged in a social network. Deep learning (DL) has made remarkable achievements; the main problem encountered with DL in mRE is a lack of training samples. In this paper, we present a design for a quick sample-marking method. First, we construct an uneven noisy labeled data (UNLD) set using a pattern matching algorithm, and then a relabeling framework is put forward for modifying the UNLD. With regard to the accuracy, the recall rates of categories with sufficient samples increased from 0.4 to nearly 1 using the relabeling framework. We have released our code and other resources for further research (https://github.com/curtainsky/UNLD).
引用
收藏
页码:51648 / 51655
页数:8
相关论文
共 22 条
[1]  
Agichtein E., 2000, ACM 2000. Digital Libraries. Proceedings of the Fifth ACM Conference on Digital Libraries, P85, DOI 10.1145/336597.336644
[2]  
[Anonymous], 2017, P 2017 C EMP METH NA
[3]  
Brin S, 1999, LECT NOTES COMPUT SC, V1590, P172
[4]  
Chen JX, 2006, COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, P129
[5]  
Cheng H T, 2016, P 1 WORKSH DEEP LEAR, P7
[6]  
Gunes Erkan., 2007, Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLF-CoNLL), P228
[7]  
Hoffmann Raphael, 2011, P 49 ANN M ASS COMP, P541
[8]  
Kambhatla Nanda, 2004, P ACL 2004 INT DEM, P22, DOI DOI 10.3115/1219044.1219066
[9]   Deep learning [J].
LeCun, Yann ;
Bengio, Yoshua ;
Hinton, Geoffrey .
NATURE, 2015, 521 (7553) :436-444
[10]  
LIANG HL, 2017, P IEEE C COMP VIS PA, P2340