QuAChIE: Question Answering based Chinese Information Extraction System

被引:4
作者
Ru, Dongyu [1 ]
Wang, Zhenghui [1 ]
Qiu, Lin [1 ]
Zhou, Hao [2 ]
Li, Lei [2 ]
Zhang, Weinan [1 ]
Yu, Yong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Bytedance AI Lab, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20) | 2020年
关键词
Information Extraction; Question Answering;
D O I
10.1145/3397271.3401411
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present the design of QuAChIE, a Question Answering based Chinese Information Extraction system. QuAChIE mainly depends on a well-trained question answering model to extract high-quality triples. The group of head entity and relation are regarded as a question given the input text as the context. For the training and evaluation of each model in the system, we build a large-scale information extraction dataset using Wikidata and Wikipedia pages by distant supervision. The advanced models implemented on top of the pre-trained language model and the enormous distant supervision data enable QuAChIE to extract relation triples from documents with cross-sentence correlations. The experimental results on the test set and the case study based on the interactive demonstration show its satisfactory Information Extraction quality on Chinese document-level texts.
引用
收藏
页码:2177 / 2180
页数:4
相关论文
共 13 条
[1]  
Devlin Jacob, 2018, Bert: Pre-training of deep bidirectional transformers for language understanding
[2]  
Huang W., 2019, ARXIV PREPRINT ARXIV
[3]   Chinese Open Relation Extraction and Knowledge Base Establishment [J].
Jia, Shengbin ;
Shijia, E. ;
Li, Maozhen ;
Xiang, Yang .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2018, 17 (03)
[4]  
Li XY, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1340
[5]  
Li ZR, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P4377
[6]  
Qiu L., 2018, ARXIV PREPRINT ARXIV
[7]  
Raiman J, 2018, AAAI CONF ARTIF INTE, P5406
[8]   CoType: Joint Extraction of Typed Entities and Relations with Knowledge Bases [J].
Ren, Xiang ;
Wu, Zeqiu ;
He, Wenqi ;
Qu, Meng ;
Voss, Clare R. ;
Ji, Heng ;
Abdelzaher, Tarek F. ;
Han, Jiawei .
PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, :1015-1024
[9]  
Vaswani A, 2017, ADV NEUR IN, V30
[10]   Wikidata: A Free Collaborative Knowledgebase [J].
Vrandecic, Denny ;
Kroetzsch, Markus .
COMMUNICATIONS OF THE ACM, 2014, 57 (10) :78-85