Discovering a tourism destination with social media data: BERT-based sentiment analysis

被引:18
作者
Santiago Vinan-Ludena, Marlon [1 ]
de Campos, Luis M. [1 ]
机构
[1] Univ Granada, CITIC UGR, ETSI Informat & Telecomunicac, Dept Ciencias Computac & Inteligencia Artificial, Granada, Spain
关键词
Social media; Deep learning; Tourism; Sentiment analysis; Data analysis; PERFORMANCE;
D O I
10.1108/JHTT-09-2021-0259
中图分类号
F [经济];
学科分类号
02 ;
摘要
Purpose The main purpose of this paper is to analyze a tourist destination using sentiment analysis techniques with data from Twitter and Instagram to find the most representative entities (or places) and perceptions (or aspects) of the users. Design/methodology/approach The authors used 90,725 Instagram posts and 235,755 Twitter tweets to analyze tourism in Granada (Spain) to identify the important places and perceptions mentioned by travelers on both social media sites. The authors used several approaches for sentiment classification for English and Spanish texts, including deep learning models. Findings The best results in a test set were obtained using a bidirectional encoder representations from transformers (BERT) model for Spanish texts and Tweeteval for English texts, and these were subsequently used to analyze the data sets. It was then possible to identify the most important entities and aspects, and this, in turn, provided interesting insights for researchers, practitioners, travelers and tourism managers so that services could be improved and better marketing strategies formulated. Research limitations/implications The authors propose a Spanish-Tourism-BERT model for performing sentiment classification together with a process to find places through hashtags and to reveal the important negative aspects of each place. Practical implications The study enables managers and practitioners to implement the Spanish-BERT model with our Spanish Tourism data set that the authors released for adoption in applications to find both positive and negative perceptions. Originality/value This study presents a novel approach on how to apply sentiment analysis in the tourism domain. First, the way to evaluate the different existing models and tools is presented; second, a model is trained using BERT (deep learning model); third, an approach of how to identify the acceptance of the places of a destination through hashtags is presented and, finally, the evaluation of why the users express positivity (negativity) through the identification of entities and aspects.
引用
收藏
页码:907 / 921
页数:15
相关论文
共 27 条
[1]  
Abirami AM, 2016, J UNIVERS COMPUT SCI, V22, P650
[2]   New Filtering Scheme Based on Term Weighting to Improve Object Based Opinion Mining on Tourism Product Reviews [J].
Afrizal, Ahimsa Denhas ;
Rakhmawati, Nur Aini ;
Tjahyanto, Aris .
FIFTH INFORMATION SYSTEMS INTERNATIONAL CONFERENCE, 2019, 161 :805-812
[3]   Sentiment Analysis in Tourism: Capitalizing on Big Data [J].
Alaei, Ali Reza ;
Becken, Susanne ;
Stantic, Bela .
JOURNAL OF TRAVEL RESEARCH, 2019, 58 (02) :175-191
[4]  
[Anonymous], 2017, CEUR WORKSHOP PROC
[5]  
Canete J, 2020, P PML4DC ICLR 2020
[6]   The Role of User-Generated Content in Tourists' Travel Planning Behavior [J].
Cox, Carmen ;
Burgess, Stephen ;
Sellitto, Carmine ;
Buultjens, Jeremy .
JOURNAL OF HOSPITALITY MARKETING & MANAGEMENT, 2009, 18 (08) :743-764
[7]  
Daugherty T., 2008, Journal of Interactive Advertising, V8, P16, DOI [DOI 10.1080/15252019.2008.10722139, https://doi.org/10.1080/15252019.2008.10722139]
[8]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9]   Is Xenios Zeus Still Alive? Destination Image of Athens in the Years of Recession [J].
Gkritzali, Alkmini ;
Gritzalis, Dimitris ;
Stavrou, Vassilis .
JOURNAL OF TRAVEL RESEARCH, 2018, 57 (04) :540-554
[10]  
Goyal N., 2019, CORR