QuAChIE: Question Answering based Chinese Information Extraction System

被引:4
作者
Ru, Dongyu [1 ]
Wang, Zhenghui [1 ]
Qiu, Lin [1 ]
Zhou, Hao [2 ]
Li, Lei [2 ]
Zhang, Weinan [1 ]
Yu, Yong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Bytedance AI Lab, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20) | 2020年
关键词
Information Extraction; Question Answering;
D O I
10.1145/3397271.3401411
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present the design of QuAChIE, a Question Answering based Chinese Information Extraction system. QuAChIE mainly depends on a well-trained question answering model to extract high-quality triples. The group of head entity and relation are regarded as a question given the input text as the context. For the training and evaluation of each model in the system, we build a large-scale information extraction dataset using Wikidata and Wikipedia pages by distant supervision. The advanced models implemented on top of the pre-trained language model and the enormous distant supervision data enable QuAChIE to extract relation triples from documents with cross-sentence correlations. The experimental results on the test set and the case study based on the interactive demonstration show its satisfactory Information Extraction quality on Chinese document-level texts.
引用
收藏
页码:2177 / 2180
页数:4
相关论文
共 13 条
  • [1] [Anonymous], 2017, ABS171107010 CORR
  • [2] Huang Wen-Chin, 2019, ARXIV PREPRINT ARXIV
  • [3] Jacob Devlin, 2018, NAACL
  • [4] Chinese Open Relation Extraction and Knowledge Base Establishment
    Jia, Shengbin
    Shijia, E.
    Li, Maozhen
    Xiang, Yang
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2018, 17 (03)
  • [5] Li XY, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1340
  • [6] Li ZR, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P4377
  • [7] Qiu L., 2018, ARXIV PREPRINT ARXIV
  • [8] Raiman J, 2018, AAAI CONF ARTIF INTE, P5406
  • [9] CoType: Joint Extraction of Typed Entities and Relations with Knowledge Bases
    Ren, Xiang
    Wu, Zeqiu
    He, Wenqi
    Qu, Meng
    Voss, Clare R.
    Ji, Heng
    Abdelzaher, Tarek F.
    Han, Jiawei
    [J]. PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 1015 - 1024
  • [10] Vaswani A, 2017, ADV NEUR IN, V30