A study on online travel reviews through intelligent data analysis

被引:16
作者
Fazzolari, Michela [1 ]
Petrocchi, Marinella [1 ]
机构
[1] CNR, Inst Informat & Telemat, Pisa, Italy
关键词
Online travel reviews; Frequent itemsets; Reviewers activities; Recurrent destinations; Text mining; Association rule mining; WORD-OF-MOUTH; USER REVIEWS; TOURISM; IMPACT;
D O I
10.1007/s40558-018-0121-z
中图分类号
F [经济];
学科分类号
02 ;
摘要
The purpose of this paper is to show the application of a set of intelligent data analysis techniques to about 7million of online travel reviews, with the aim of automatically extracting useful information. The reviews, collected from two popular online tourism-related review platforms, are all those posted by reviewers about one specific Italian location, from 2010 to 2017. To carry out the study, the following methodology is applied: a preliminary statistical analysis is performed to acquire general knowledge about the datasets, such as the geographical distribution of reviewers, their activities, and a comparison among the time of visits and the average scores of the reviews. Then, Natural Language Processing techniques are applied to extract and compare the most frequent words used in the two platforms. Finally, an Association Rule Learning algorithm is applied, to extract preferred destinations for distinct groups of reviewers, by mining interesting associations among the countries of origin of the reviewers and the most frequent destinations visited. By elaborating the available data, it is possible to automatically disclose valuable information for consumers and providers. The information automatically extracted can be exploited, for example, to build a recommender system for customers or a market analysis tool for service providers.
引用
收藏
页码:37 / 58
页数:22
相关论文
共 38 条
[1]  
Aghdam AR, 2014, 2014 INTERNATIONAL CONFERENCE ON INDUSTRIAL AUTOMATION, INFORMATION AND COMMUNICATIONS TECHNOLOGY (IAICT), P130, DOI 10.1109/IAICT.2014.6922099
[2]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[3]   Factors influencing consumer intention in social commerce adoption [J].
Akman, Ibrahim ;
Mishra, Alok .
INFORMATION TECHNOLOGY & PEOPLE, 2017, 30 (02) :356-370
[4]   Sentiment Analysis in Tourism: Capitalizing on Big Data [J].
Alaei, Ali Reza ;
Becken, Susanne ;
Stantic, Bela .
JOURNAL OF TRAVEL RESEARCH, 2019, 58 (02) :175-191
[5]   powerlaw: A Python']Python Package for Analysis of Heavy-Tailed Distributions [J].
Alstott, Jeff ;
Bullmore, Edward T. ;
Plenz, Dietmar .
PLOS ONE, 2014, 9 (01)
[6]   Social media use for travel purposes: a cross cultural comparison between Portugal and the UK [J].
Amaro S. ;
Duarte P. .
Information Technology & Tourism, 2017, 17 (2) :161-181
[7]   Understanding Satisfied and Dissatisfied Hotel Customers: Text Mining of Online Hotel Reviews [J].
Berezina, Katerina ;
Bilgihan, Anil ;
Cobanoglu, Cihan ;
Okumus, Fevzi .
JOURNAL OF HOSPITALITY MARKETING & MANAGEMENT, 2016, 25 (01) :1-24
[8]  
Bird S, 2009, Natural language processing with python, DOI DOI 10.5555/1717171
[9]   Identifying Helpful Online Reviews with Word Embedding Features [J].
Chen, Jie ;
Zhang, Chunxia ;
Niu, Zhendong .
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2016, 2016, 9983 :123-133
[10]   Predicting consumer product demands via Big Data: the roles of online promotional marketing and online reviews [J].
Chong, Alain Yee Loong ;
Ch'ng, Eugene ;
Liu, Martin J. ;
Li, Boying .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2017, 55 (17) :5142-5156