StuffIE: Semantic Tagging of Unlabeled Facets Using Fine-Grained Information Extraction

被引:8
作者
Prasojo, Radityo Eko [1 ]
Kacimi, Mouna [1 ]
Nutt, Werner [1 ]
机构
[1] Free Univ Bozen Bolzano, Bozen Bolzano, Italy
来源
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT | 2018年
关键词
Facet extraction; distant learning; semantic labeling;
D O I
10.1145/3269206.3271812
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent knowledge extraction methods are moving towards ternary and higher-arity relations to capture more information about binary facts. An example is to include the time, the location, and the duration of a specific fact. These relations can be even more complex to extract in advanced domains such as news, where events typically come with different facets including reasons, consequences, purposes, involved parties, and related events. The main challenge consists in first finding the set of facets related to each fact, and second tagging those facets to the relevant category. In this paper, we tackle the above problems by proposing StuffIE, a fine-grained information extraction approach which is facet-centric. We exploit the Stanford dependency parsing enhanced by lexical databases such as WordNet to extract nested triple relations. Then, we exploit the syntactical dependencies to semantically tag facets using distant learning based on Oxford dictionary. We have tested the accuracy of the extracted facets and their semantic tags using DUC'04 dataset. The results show the high accuracy and coverage of our approach with respect to ClausIE, OLLIE, SEMAFOR SRL and Illinois SRL.
引用
收藏
页码:467 / 476
页数:10
相关论文
共 27 条
[1]  
Angeli G, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P344
[2]  
[Anonymous], 1998, COLING 1998 VOLUME 1, DOI DOI 10.3115/980845.980860
[3]  
Arora Tengyu Ma Sanjeev, 2017, ICLR
[4]  
Bhutani Nikita., 2016, P 2016 C EMP METH NA, P55, DOI [10.18653/v1/D16-1006, DOI 10.18653/V1/D16-1006]
[5]  
Buche Patrice, 2016, P 6 WIMS WIMS 16 ACM
[6]  
Christensen Janara., 2010, Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading, FAM-LbR '10, P52
[7]  
Clarke J., 2012, P 8 INT C LANG RES E
[8]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[9]  
Corder SP, 1968, DOUBLE OBJECT VERBS
[10]  
de Sa Mesquita Filipe, 2013, P 2013 C EMP METH NA, P447