Prediction of hotel booking cancellations: Integration of machine learning and probability model based on interpretable feature interaction

被引:28
作者
Chen, Shuixia [1 ,2 ]
Ngai, Eric W. T. [2 ]
Ku, Yaoyao [1 ]
Xu, Zeshui [1 ]
Gou, Xunjie [1 ]
Zhang, Chenxi [1 ]
机构
[1] Sichuan Univ, Business Sch, Chengdu 610064, Peoples R China
[2] Hong Kong Polytech Univ, Dept Management & Mkt, Hung Hom, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Machine learning; Cancellation prediction; Bayesian network; Feature interaction; BAYESIAN NETWORKS; ANALYTICS; SELECTION;
D O I
10.1016/j.dss.2023.113959
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reliable hotel cancellation prediction can help establish appropriate operational strategies for hotel management. In this sector, personal name records (PNR) data may be the most representative information source for prediction tasks. Despite the popularity of PNR, its inherent lack of availability has been commonly disregarded in the literature. Existing studies have directly input PNR into high-dimensional machine learning (ML) models to achieve cancellation predictions. Another type of model generates cancellation prediction based on the probability modeling of samples. In this study, we propose an interpretable feature interaction method to enrich the existing PNR information. Thereafter, we empirically assess the prediction performance of the two model classes. This study specifically determines whether or not the two methods can cross-fertilize each other to improve cancellation prediction. To do so, we propose a model integrating Bayesian networks (BNs) and Lasso regression for this prediction task. This study utilizes BNs for the probability model consistent with our correlated variables and dichotomous prediction setting. Moreover, we use a linear ML model (i.e., Lasso regression), given its advantages in reducing ineffective predictors and transparency for ranking feature importance. Empirical results show that the proposed integration model has better prediction performance, and the obtained BN estimators and interactive features are the most important predictors. This study contributes to the booking cancellation literature by proposing an interpretable feature interaction and a prediction method integrating two types of effective models. The obtained accurate and interpretable cancellation prediction further contributes to offering practical implications to hoteliers in managerial decision-making.
引用
收藏
页数:14
相关论文
共 41 条
[1]   Hotel booking demand datasets [J].
Antonio, Nuno ;
de Almeida, Ana ;
Nunes, Luis .
DATA IN BRIEF, 2019, 22 :41-49
[2]   Predicting Hotel Bookings Cancellation With a Machine Learning Classification Model [J].
Antonio, Nuno ;
de Almeida, Ana ;
Nunes, Luis .
2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, :1049-1054
[3]   Online Decision Making with High-Dimensional Covariates [J].
Bastani, Hamsa ;
Bayati, Mohsen .
OPERATIONS RESEARCH, 2020, 68 (01) :276-294
[4]   An Analytics Approach to Designing Combination Chemotherapy Regimens for Cancer [J].
Bertsimas, Dimitris ;
O'Hair, Allison ;
Relyea, Stephen ;
Silberholz, John .
MANAGEMENT SCIENCE, 2016, 62 (05) :1511-1531
[5]   Spline-rule ensemble classifiers with structured sparsity regularization for interpretable customer churn modeling [J].
Bock, Koen W. De ;
De Caigny, Arno .
DECISION SUPPORT SYSTEMS, 2021, 150
[6]   Predicting traffic flow using Bayesian networks [J].
Castillo, Enrique ;
Maria Menendez, Jose ;
Sanchez-Cambronero, Santos .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2008, 42 (05) :482-509
[7]   Customer purchase forecasting for online tourism: A data-driven method with multiplex behavior data [J].
Chen, Shui-xia ;
Wang, Xiao-kang ;
Zhang, Hong-yu ;
Wang, Jian-qiang ;
Peng, Juan-juan .
TOURISM MANAGEMENT, 2021, 87
[8]   Customer purchase prediction from the perspective of imbalanced data: A machine learning framework based on factorization machine [J].
Chen, Shui-xia ;
Wang, Xiao-kang ;
Zhang, Hong-yu ;
Wang, Jian-qiang .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 173
[9]   Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning [J].
Chou, Ping ;
Chuang, Howard Hao-Chun ;
Chou, Yen-Chun ;
Liang, Ting-Peng .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 296 (02) :635-651
[10]   Exploring investors' expectancies and its impact on project funding success likelihood in crowdfunding by using text analytics and Bayesian networks [J].
Costello, Francis Joseph ;
Lee, Kun Chang .
DECISION SUPPORT SYSTEMS, 2022, 154