Application of named entity recognition on tweets during earthquake disaster: a deep learning-based approach

被引：0

作者：

Nazmiye Eligüzel

Cihan Çetinkaya

Türkay Dereli

机构：

[1] Gaziantep University,Industrial Engineering

[2] Adana Alparslan Türkeş Science and Technology University,Department of Management Information Systems

[3] Hasan Kalyoncu University,Office of the President

来源：

Soft Computing | 2022年 / 26卷

关键词：

Disaster; Earthquake; Named entity recognition; Recurrent Neural Network; Twitter;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Twitter is an intensely utilized platform for disaster events and emergencies. Therefore, Twitter is an important resource for providing the essential information. Named entity recognition (NER), which is the process of determining the elementary units in a text and classifying them with pre-defined categories, plays a significant role to extract essential and usefulness information. However, NER is a challenging task due to the utilized informal text in the Twitter platform such as grammatical errors and nonstandard abbreviations. In this paper, recurrent neural network (RNN)-based approaches considering diversity of activation functions and optimization functions with NER tools are utilized to extract named entities such as organization, person, and location from the tweets. Inputs for RNN models are provided via two different NER tools which are natural language toolkit (NLTK) and general architecture for text engineering (Gate). Then, pre-labeled data are trained via GloVe word embedding technique, and RNN model variants such as LSTM, BLSTM, and GRU are demonstrated. Therefore, outperforming models among RNN variants are presented for predicting named entities. Yellowbrick interpreter is used for evaluation of the proposed method and Wilcoxon signed-rank test are applied on results of two different data sets to demonstrate consistency of the proposed method. In addition, comparison is made with existing machine learning methods. The experiments by utilizing the Nepal earthquake Twitter data set show that the RNN-based approaches achieve good results in finding named entities. In emergencies, the results of this paper can help in reducing the efforts of event location detection and provide better disaster management.

引用

页码：395 / 421

页数：26

共 79 条

[1]

Aarthi D(2019)Question classification using a rule based model Int J Innov Technol Explor Eng 9 4172-4176

[2]

Viswanathan V(2013)Arabic person names recognition by using a rule based approach J Comput Sci 9 922-927

[3]

Nandhini B(2019)Ontology-based healthcare named entity recognition from twitter messages using a recurrent neural network approach Int J Environ Res Public Health 16 1-19

[4]

Ilakiyaselvan N(2019)Developing a twitter-based traffic event detection model using deep learning architectures Expert Syst Appl 118 425-439

[5]

Aboaoga M(2019)Arabic named entity recognition using deep learning approach Int J Electr Comput Eng 9 2025-2032

[6]

Ab Aziz MJ(2020)Advanced engineering ınformatics comparison of different machine learning techniques on location extraction by utilizing geo-tagged tweets: a case study Adv Eng Inform 46 101151-9

[7]

Batbaatar E(2019)Improving NLTK for processing Portuguese OpenAccess Ser Inform 74 1-43

[8]

Ryu KH(2018)Combining rule-based and statistical mechanisms for low-resource named entity recognition Mach Transl 32 31-667

[9]

Dabiri S(2013)An algorithm for local geoparsing of microtext GeoInformatica 17 635-773

[10]

Heaslip K(2011)Geo-parsing messages from microtext Trans GIS 15 753-144

← 1 2 3 4 5 6 7 8 →