Extraction of temporal information from social media messages using the BERT model

被引：20

作者：

Ma, Kai ^{[1
]}

Tan, Yongjian ^{[1
]}

Tian, Miao ^{[1
]}

Xie, Xuejing ^{[2
]}

Qiu, Qinjun ^{[2
,3
,4
]}

Li, Sanfeng ^{[5
]}

Wang, Xin ^{[6
]}

机构：

[1] China Three Gorges Univ, Coll Comp & Informat Technol, Yichang 443002, Peoples R China

[2] Natl Engn Res Ctr Geog Informat Syst, Wuhan 430074, Peoples R China

[3] China Univ Geosci, Sch Geog & Informat Engn, Wuhan 430074, Peoples R China

[4] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China

[5] Wuhan Zondy Cyber Sci & Technol Co Ltd, Wuhan, Peoples R China

[6] Jinan Rail Transit Grp Co Ltd, Jinan, Peoples R China

来源：

EARTH SCIENCE INFORMATICS | 2022年 / 15卷 / 01期

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Temporal information extraction; Temporal expression recognition; BERT; Natural language processing; SYSTEM;

D O I：

10.1007/s12145-021-00756-6

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Temporal information extraction from social media messages is of critical importance to several geographical applications. Combined with the characteristics of temporal information descriptions in Chinese text, different time expression patterns formed by time unit combinations are summarized. A deep learning-based information extraction algorithm (named BERT-BiLSTM-CRF) for automatically extracting temporal information from social media messages is proposed. Based on the bidirectional long short-term memory-conditional random field (BiLSTM-CRF) model, the BERT (bidirectional encoder representations from transformers) pretrained language model was used to enhance the generalization ability of the word vector model to capture long-range contextual information; then, the trained word vector was input into the BiLSTM-CRF model for further training. The proposed model was then evaluated on the constructed corpus, a set of manually annotated Chinese texts from social media messages. Among the basic models, the BERT-BiLSTM-CRF achieved the highest average F1-score of 85%. The experimental results show that the proposed method outperforms the current state-of-the-art models.

引用

页码：573 / 584

页数：12

共 51 条

[11] Annotation projection for temporal information extraction
Giannella, Chris R.
Winder, Ransom K.
Jubinski, Joseph P.
[J]. NATURAL LANGUAGE ENGINEERING, 2019, 25 (03) : 385 - 403
[12] Hyperspectral image classification using multi-task feature leverage with multi-variant deep learning
Jayapriya, Kalyanakumar
Jacob, Israel Jeena
Darney, Paulraj Ebby
[J]. EARTH SCIENCE INFORMATICS, 2020, 13 (04) : 1093 - 1102
[13] Jeong Y.S., 2015, P 19 C COMP NAT LANG, P279
[14] Kolomiyets O., 2010, Proceedings of the 5th International Workshop on Semantic Evaluation, P325
[15] Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review
Kreimeyer, Kory
Foster, Matthew
Pandey, Abhishek
Arya, Nina
Halford, Gwendolyn
Jones, Sandra F.
Forshee, Richard
Walderhaug, Mark
Botsis, Taxiarchis
[J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 73 : 14 - 29
[16] A Survey on Temporal Reasoning for Temporal Information Extraction from Text
Leeuwenberg, Artuur
Moens, Marie-Francine
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 66 : 341 - 380
[17] Li J., 2012, COMPUT SCI, V39, P191
[18] Li WJ, 2001, J AM SOC INF SCI TEC, V52, P748, DOI 10.1002/asi.1126
[19] A system for automatically extracting clinical events with temporal information
Li, Zhijing
Li, Chen
Long, Yu
Wang, Xuan
[J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)
[20] MedTime: A temporal information extraction system for clinical narratives
Lin, Yu-Kai
Chen, Hsinchun
Brown, Randall A.
[J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 : S20 - S28

← 1 2 3 4 5 6 →