Combination of Topic Modelling and Decision Tree Classification for Tourist Destination Marketing

被引:7
作者
Christodoulou, Evripides [1 ]
Gregoriades, Andreas [1 ]
Pampaka, Maria [2 ]
Herodotou, Herodotos [1 ]
机构
[1] Cyprus Univ Technol, Limassol, Cyprus
[2] Univ Manchester, Manchester, Lancs, England
来源
ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS | 2020年 / 382卷
关键词
Topic modelling; Sentiment analysis; Decision tree; Tourists' reviews; SENTIMENT ANALYSIS; CUSTOMER; BEHAVIOR; CULTURE;
D O I
10.1007/978-3-030-49165-9_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper applies a smart tourism approach to tourist destination marketing campaigns through the analysis of tourists' reviews from TripAdvisor to identify significant patterns in the data. The proposed method combines topic modelling using Structured Topic Analysis with sentiment polarity, information on culture, and purchasing power of tourists for the development of a Decision Tree (DT) to predict tourists' experience. For data collection and analysis, several custom-made python scripts were used. Data underwent integration, cleansing, incomplete data processing, and imbalance data treatments prior to being analysed. The patterns that emerged from the DT are expressed in terms of rules that highlight variable combinations leading to negative or positive sentiment. The generated predictive model can be used by destination management to tailor marketing strategy by targeting tourists who are more likely to be satisfied at the destination according to their needs.
引用
收藏
页码:95 / 108
页数:14
相关论文
共 41 条
[1]   TOWARD AN UNDERSTANDING OF INEQUITY [J].
ADAMS, JS .
JOURNAL OF ABNORMAL PSYCHOLOGY, 1963, 67 (05) :422-&
[2]   What Makes Online Content Viral? [J].
Berger, Jonah ;
Milkman, Katherine L. .
JOURNAL OF MARKETING RESEARCH, 2012, 49 (02) :192-205
[3]  
Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
[4]   A machine learning approach to sentiment analysis in multilingual Web texts [J].
Boiy, Erik ;
Moens, Marie-Francine .
INFORMATION RETRIEVAL, 2009, 12 (05) :526-558
[5]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[6]  
Breiman L, 1984, Classification and Regression Trees, V1st, DOI DOI 10.1201/9781315139470
[7]  
Chamlertwat W, 2012, J UNIVERS COMPUT SCI, V18, P973
[8]  
Chaney A., 2012, P INT AAAI C WEB SOC, DOI DOI 10.1609/ICWSM.V6I1.14321
[9]  
Crotts J.C., 2000, Managing Service Quality, V10, P410, DOI DOI 10.1108/09604520010351167
[10]  
Csardi G, 2006, Interjournal Complex Systems, V1695