Application of Classification and Word Embedding Techniques to Evaluate Tourists' Hotel-revisit Intention

被引:5
作者
Christodoulou, Evripides [1 ]
Gregoriades, Andreas [1 ]
Pampaka, Maria [2 ]
Herodotou, Herodotos [1 ]
机构
[1] Cyprus Univ Technol, Limassol, Cyprus
[2] Univ Manchester, Manchester, Lancs, England
来源
PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS 2021), VOL 1 | 2021年
关键词
XGBoost; Topic Analysis; Word2Vec; Revisit Intention; Data Mining; Tourists' Reviews; TWITTER;
D O I
10.5220/0010453502160223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Revisit intention is a key indicator for future business performance in the hospitality industry. This work focuses on the identification of patterns from user-generated data explaining the reasons why tourist may revisit a hotel they stayed at during their holidays and aims to identify differences among two classes of hotels (4-5 star and 2-3 star). The method utilises data from TripAdvisor retrieved using a scrapper application. Topic modelling is initially performed to identify the main themes discussed in each tourist review. Subsequently, reviews are labelled depending on whether they mention the intention of their author to revisit the hotel in the future using an ontology of revisit-intention generated using Word2Vec word embedding. The identified topics from the labelled reviews are utilised to train an Extreme Gradient Boosting model (XGBoost) to predict revisit intention, which is then used to identify topic-patterns in reviews that relate to revisit intention. The learned model achieved satisfactory performance and was used to identify the most influential topics related to revisit intention using an explainable machine learning technique to illustrate visually the rules embedded in the learned XGBoost model. The method is applied on reviews from tourists that visited Cyprus between 2009-2019. Results highlight that staff professionalism (e.g., politeness, smile) is critical for both classes of hotels: however, its effect is smaller on 2-3 start hotels where cleanliness has greater influence on revisiting.
引用
收藏
页码:216 / 223
页数:8
相关论文
共 27 条
  • [1] Badarneh M. B, 2001, TOURISM MANAGE
  • [2] Chamlertwat W, 2012, J UNIVERS COMPUT SCI, V18, P973
  • [3] Why Customers Don't Revisit in Tourism and Hospitality Industry?
    Chang, Jing-Rong
    Chen, Mu-Yen
    Chen, Long-Sheng
    Tseng, Shu-Cih
    [J]. IEEE ACCESS, 2019, 7 : 146588 - 146606
  • [4] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [5] Combination of Topic Modelling and Decision Tree Classification for Tourist Destination Marketing
    Christodoulou, Evripides
    Gregoriades, Andreas
    Pampaka, Maria
    Herodotou, Herodotos
    [J]. ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS, 2020, 382 : 95 - 108
  • [6] Emerging Trends Word2Vec
    Church, Kenneth Ward
    [J]. NATURAL LANGUAGE ENGINEERING, 2017, 23 (01) : 155 - 162
  • [7] Gumus M, 2017, 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), P1100, DOI 10.1109/UBMK.2017.8093500
  • [8] Predicting hotel review helpfulness: The impact of review visibility, and interaction between hotel stars and review ratings
    Hu, Ya-Han
    Chen, Kuanchin
    [J]. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2016, 36 (06) : 929 - 944
  • [9] Effects of Travel Motivation, Past Experience, Perceived Constraint, and Attitude on Revisit Intention
    Huang, Songshan
    Hsu, Cathy H. C.
    [J]. JOURNAL OF TRAVEL RESEARCH, 2009, 48 (01) : 29 - 44
  • [10] Temporal destination revisit intention: The effects of novelty seeking and satisfaction
    Jang, SooCheong Shawn
    Feng, Ruomei
    [J]. TOURISM MANAGEMENT, 2007, 28 (02) : 580 - 590