Chinese Triple Extraction Based on BERT Model

被引:2
|
作者
Deng, Weidong [1 ,2 ]
Liu, Yun [1 ,2 ]
机构
[1] Beijing Jiao Tong Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
[2] Beijing Municipal Commiss Educ, Key Lab Commun & Informat Syst, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021) | 2021年
基金
中国国家自然科学基金;
关键词
triple; information extraction; relation classification; entity tagging; BERT;
D O I
10.1109/IMCOM51814.2021.9377404
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information extraction (IE) plays a crucial role in natural language processing, which extracts structured facts like entities, attributes, relations and events from unstructured text. The results of information extraction can be applied in many fields including information retrieval, intelligent QA system, to name a few. We define a pair of entities and their relation from a sentence as a triple. Different from most relation extraction tasks, which only extract one relation from a sentence of known entities, we achieved that extracting both relation and entities(a triple, as defined above), from a plain sentence. Until now, there are so many methods proposed to solve information extraction problem and deep learning has made great progress last several years. Among the field of deep learning, the pretrained model BERT has achieved greatly successful results in a lot of NLP tasks. So we divide our triple extraction task into two sub-tasks, relation classification and entity tagging, and design two models based on BERT for these two sub-tasks, including a CNN-BERT and a Simple BERT. We experimented our models on DuIE Chinese dataset and achieved excellent results.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] A Unified Knowledge Extraction Method Based on BERT and Handshaking Tagging Scheme
    Yang, Ning
    Pun, Sio Hang
    Vai, Mang, I
    Yang, Yifan
    Miao, Qingliang
    APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [42] Extraction of temporal information from social media messages using the BERT model
    Kai Ma
    Yongjian Tan
    Miao Tian
    Xuejing Xie
    Qinjun Qiu
    Sanfeng Li
    Xin Wang
    Earth Science Informatics, 2022, 15 : 573 - 584
  • [43] Sentiment Analysis of Chinese E-commerce Reviews Based on BERT
    Xie, Song
    Cao, Jingjing
    Wu, Zhou
    Liu, Kai
    Tao, Xiaohui
    Xie, Haoran
    2020 IEEE 18TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), VOL 1, 2020, : 713 - 718
  • [44] Research on Chinese Keyword Recognition Based on BERT Binary Classification Algorithm
    Zhu, Chunling
    Wu, Di
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND DIGITAL APPLICATIONS, MIDA2024, 2024, : 689 - 695
  • [45] A Sentence Classification Method for Chinese Spelling Error Detection Based on BERT
    Jiang, Jin
    Zhou, Yanquan
    2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, : 369 - 372
  • [46] BERT with Enhanced Layer for Assistant Diagnosis Based on Chinese Obstetric EMRs
    Zhang, Kunli
    Liu, Chuang
    Duan, Xuemin
    Zhou, Lijuan
    Zhao, Yueshu
    Zan, Hongying
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 384 - 389
  • [47] Multi-label Classification of Chinese Judicial Documents based on BERT
    Dai, Mian
    Liu, Chao-Lin
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1866 - 1867
  • [48] Extraction of temporal information from social media messages using the BERT model
    Ma, Kai
    Tan, Yongjian
    Tian, Miao
    Xie, Xuejing
    Qiu, Qinjun
    Li, Sanfeng
    Wang, Xin
    EARTH SCIENCE INFORMATICS, 2022, 15 (01) : 573 - 584
  • [49] BERT-BiLSTM-Attention model for sentiment analysis on Chinese stock reviews
    Li, Xiaoyan
    Chen, Lei
    Chen, Baoguo
    Ge, Xianlei
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [50] Clinical Trial Information Extraction with BERT
    Liu, Xiong
    Hersch, Greg L.
    Khalil, Iya
    Devarakonda, Murthy
    2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 505 - 506