Modeling Instance Interactions for Joint Information Extraction with Neural High-Order Conditional Random Field

被引:0
作者
Jia, Zixia [1 ,2 ]
Yan, Zhaohui [2 ]
Han, Wenjuan [3 ]
Zheng, Zilong [1 ]
Tu, Kewei [2 ]
机构
[1] Beijing Inst Gen Artificial Intelligence BIGAI, Beijing, Peoples R China
[2] ShanghaiTech Univ, Shanghai, Peoples R China
[3] Beijing Jiaotong Univ, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1 | 2023年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prior works on joint Information Extraction (IE) typically model instance (e.g., event triggers, entities, roles, relations) interactions by representation enhancement, type dependencies scoring, or global decoding. We find that the previous models generally consider binary type dependency scoring of a pair of instances, and leverage local search such as beam search to approximate global solutions. To better integrate cross-instance interactions, in this work, we introduce a joint IE framework (CRFIE) that formulates joint IE as a high-order Conditional Random Field. Specifically, we design binary factors and ternary factors to directly model interactions between not only a pair of instances but also triplets. Then, these factors are utilized to jointly predict labels of all instances. To address the intractability problem of exact high-order inference, we incorporate a high-order neural decoder that is unfolded from a mean-field variational inference method, which achieves consistent learning and inference. The experimental results show that our approach achieves consistent improvements on three IE tasks compared with our baseline and prior work.
引用
收藏
页码:13695 / 13710
页数:16
相关论文
共 46 条
  • [1] [Anonymous], 2022, P 2022 C N AM CHAPT
  • [2] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [3] Dugad Rakesh, 1996, SPANN961
  • [4] VITERBI ALGORITHM
    FORNEY, GD
    [J]. PROCEEDINGS OF THE IEEE, 1973, 61 (03) : 268 - 278
  • [5] Fu TJ, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1409
  • [6] Ghamrawi N., 2005, P 14 ACM INT C INF K, P195, DOI DOI 10.1145/1099554.1099591
  • [7] Jia Zixia, 2022, P AAAI C ART INT
  • [8] Khue Le-Huu D., 2021, ARXIV211014759
  • [9] Lafferty J.D., 2001, P INT C MACHINE LEAR, P282
  • [10] Lample G., 2016, P 2016 C N AM CHAPTE, P260, DOI 10.18653/v1/N16-1030