Extraction of temporal information from social media messages using the BERT model

被引:20
作者
Ma, Kai [1 ]
Tan, Yongjian [1 ]
Tian, Miao [1 ]
Xie, Xuejing [2 ]
Qiu, Qinjun [2 ,3 ,4 ]
Li, Sanfeng [5 ]
Wang, Xin [6 ]
机构
[1] China Three Gorges Univ, Coll Comp & Informat Technol, Yichang 443002, Peoples R China
[2] Natl Engn Res Ctr Geog Informat Syst, Wuhan 430074, Peoples R China
[3] China Univ Geosci, Sch Geog & Informat Engn, Wuhan 430074, Peoples R China
[4] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China
[5] Wuhan Zondy Cyber Sci & Technol Co Ltd, Wuhan, Peoples R China
[6] Jinan Rail Transit Grp Co Ltd, Jinan, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Temporal information extraction; Temporal expression recognition; BERT; Natural language processing; SYSTEM;
D O I
10.1007/s12145-021-00756-6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Temporal information extraction from social media messages is of critical importance to several geographical applications. Combined with the characteristics of temporal information descriptions in Chinese text, different time expression patterns formed by time unit combinations are summarized. A deep learning-based information extraction algorithm (named BERT-BiLSTM-CRF) for automatically extracting temporal information from social media messages is proposed. Based on the bidirectional long short-term memory-conditional random field (BiLSTM-CRF) model, the BERT (bidirectional encoder representations from transformers) pretrained language model was used to enhance the generalization ability of the word vector model to capture long-range contextual information; then, the trained word vector was input into the BiLSTM-CRF model for further training. The proposed model was then evaluated on the constructed corpus, a set of manually annotated Chinese texts from social media messages. Among the basic models, the BERT-BiLSTM-CRF achieved the highest average F1-score of 85%. The experimental results show that the proposed method outperforms the current state-of-the-art models.
引用
收藏
页码:573 / 584
页数:12
相关论文
共 51 条
  • [11] Annotation projection for temporal information extraction
    Giannella, Chris R.
    Winder, Ransom K.
    Jubinski, Joseph P.
    [J]. NATURAL LANGUAGE ENGINEERING, 2019, 25 (03) : 385 - 403
  • [12] Hyperspectral image classification using multi-task feature leverage with multi-variant deep learning
    Jayapriya, Kalyanakumar
    Jacob, Israel Jeena
    Darney, Paulraj Ebby
    [J]. EARTH SCIENCE INFORMATICS, 2020, 13 (04) : 1093 - 1102
  • [13] Jeong Y.S., 2015, P 19 C COMP NAT LANG, P279
  • [14] Kolomiyets O., 2010, Proceedings of the 5th International Workshop on Semantic Evaluation, P325
  • [15] Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review
    Kreimeyer, Kory
    Foster, Matthew
    Pandey, Abhishek
    Arya, Nina
    Halford, Gwendolyn
    Jones, Sandra F.
    Forshee, Richard
    Walderhaug, Mark
    Botsis, Taxiarchis
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 73 : 14 - 29
  • [16] A Survey on Temporal Reasoning for Temporal Information Extraction from Text
    Leeuwenberg, Artuur
    Moens, Marie-Francine
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 66 : 341 - 380
  • [17] Li J., 2012, COMPUT SCI, V39, P191
  • [18] Li WJ, 2001, J AM SOC INF SCI TEC, V52, P748, DOI 10.1002/asi.1126
  • [19] A system for automatically extracting clinical events with temporal information
    Li, Zhijing
    Li, Chen
    Long, Yu
    Wang, Xuan
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)
  • [20] MedTime: A temporal information extraction system for clinical narratives
    Lin, Yu-Kai
    Chen, Hsinchun
    Brown, Randall A.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 : S20 - S28