Extraction of temporal information from social media messages using the BERT model

被引：20

作者：

Ma, Kai ^{[1
]}

Tan, Yongjian ^{[1
]}

Tian, Miao ^{[1
]}

Xie, Xuejing ^{[2
]}

Qiu, Qinjun ^{[2
,3
,4
]}

Li, Sanfeng ^{[5
]}

Wang, Xin ^{[6
]}

机构：

[1] China Three Gorges Univ, Coll Comp & Informat Technol, Yichang 443002, Peoples R China

[2] Natl Engn Res Ctr Geog Informat Syst, Wuhan 430074, Peoples R China

[3] China Univ Geosci, Sch Geog & Informat Engn, Wuhan 430074, Peoples R China

[4] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China

[5] Wuhan Zondy Cyber Sci & Technol Co Ltd, Wuhan, Peoples R China

[6] Jinan Rail Transit Grp Co Ltd, Jinan, Peoples R China

来源：

EARTH SCIENCE INFORMATICS | 2022年 / 15卷 / 01期

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Temporal information extraction; Temporal expression recognition; BERT; Natural language processing; SYSTEM;

D O I：

10.1007/s12145-021-00756-6

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Temporal information extraction from social media messages is of critical importance to several geographical applications. Combined with the characteristics of temporal information descriptions in Chinese text, different time expression patterns formed by time unit combinations are summarized. A deep learning-based information extraction algorithm (named BERT-BiLSTM-CRF) for automatically extracting temporal information from social media messages is proposed. Based on the bidirectional long short-term memory-conditional random field (BiLSTM-CRF) model, the BERT (bidirectional encoder representations from transformers) pretrained language model was used to enhance the generalization ability of the word vector model to capture long-range contextual information; then, the trained word vector was input into the BiLSTM-CRF model for further training. The proposed model was then evaluated on the constructed corpus, a set of manually annotated Chinese texts from social media messages. Among the basic models, the BERT-BiLSTM-CRF achieved the highest average F1-score of 85%. The experimental results show that the proposed method outperforms the current state-of-the-art models.

引用

页码：573 / 584

页数：12

共 51 条

[1] Ahn D, 2005, DAGST SEM P SCHL DAG
[2] Extraction of temporal relations from clinical free text: A systematic review of current approaches
Alfattni, Ghada
Peek, Niels
Nenadic, Goran
[J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 108 (108)
[3] Amigo E, 2021, SIGIR 2011 WORKSH EN
[4] [Anonymous], 2017, ARXIV PREPRINT ARXIV
[5] TEMPTING system: A hybrid method of rule and machine learning for temporal relation extraction in patient discharge summaries
Chang, Yung-Chun
Dai, Hong-Jie
Wu, Johnny Chi-Yang
Chen, Jian-Ming
Tsai, Richard Tzong-Han
Hsu, Wen-Lian
[J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 : S54 - S62
[6] Pattern-based bootstrapping framework for biomedical relation extraction
Deepika, S. S.
Geetha, T. V.
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 99
[7] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8] FINDING STRUCTURE IN TIME
ELMAN, JL
[J]. COGNITIVE SCIENCE, 1990, 14 (02) : 179 - 211
[9] Ferro L, TIDES 2005 STANDARD
[10] Restricted Boltzmann machines for vector representation of speech in speaker recognition
Ghahabi, Omid
Hernando, Javier
[J]. COMPUTER SPEECH AND LANGUAGE, 2018, 47 : 16 - 29

← 1 2 3 4 5 6 →