TY - JOUR
T1 - Prediction of hotel booking cancellations
T2 - Integration of machine learning and probability model based on interpretable feature interaction
AU - Chen, Shuixia
AU - Ngai, Eric W.T.
AU - Ku, Yaoyao
AU - Xu, Zeshui
AU - Gou, Xunjie
AU - Zhang, Chenxi
N1 - Funding Information:
Zeshui Xu received the Ph.D. degree in management science and engineering from Southeast University, Nanjing, China, in 2003. He is the Distinguished Young Scholar of the National Natural Science Foundation of China and the Chang Jiang Scholars of the Ministry of Education of China. He is currently a Professor with the Business School, Sichuan University, Chengdu. He is currently the Academician of EASA, IASCYS; Fellow of IEEE, IFSA, RSA, ORS, IET, BCS, IAAM, VEBLEO, AAIA, the Associate Editor-in-Chief of Applied Intelligence, Senior Editor of IEEE Access, the Associate Editor or Editorial Board Member for >40 professional journals. His research interests include information fusion, group decision making, and aggregation operators.
Funding Information:
The authors are grateful for the constructive comments of the referees on an earlier version of this article. This work was supported by the National Natural Science Foundation of China (No. 72071135 ), Chinese National Funding of Social Sciences (No. 22FGLB005 ), and Humanities and Social Science Fund of Ministry of Education of the People's Republic of China (No. 21YJC630030 ).
Publisher Copyright:
© 2023 Elsevier B.V.
PY - 2023/7
Y1 - 2023/7
N2 - Reliable hotel cancellation prediction can help establish appropriate operational strategies for hotel management. In this sector, personal name records (PNR) data may be the most representative information source for prediction tasks. Despite the popularity of PNR, its inherent lack of availability has been commonly disregarded in the literature. Existing studies have directly input PNR into high-dimensional machine learning (ML) models to achieve cancellation predictions. Another type of model generates cancellation prediction based on the probability modeling of samples. In this study, we propose an interpretable feature interaction method to enrich the existing PNR information. Thereafter, we empirically assess the prediction performance of the two model classes. This study specifically determines whether or not the two methods can cross-fertilize each other to improve cancellation prediction. To do so, we propose a model integrating Bayesian networks (BNs) and Lasso regression for this prediction task. This study utilizes BNs for the probability model consistent with our correlated variables and dichotomous prediction setting. Moreover, we use a linear ML model (i.e., Lasso regression), given its advantages in reducing ineffective predictors and transparency for ranking feature importance. Empirical results show that the proposed integration model has better prediction performance, and the obtained BN estimators and interactive features are the most important predictors. This study contributes to the booking cancellation literature by proposing an interpretable feature interaction and a prediction method integrating two types of effective models. The obtained accurate and interpretable cancellation prediction further contributes to offering practical implications to hoteliers in managerial decision-making.
AB - Reliable hotel cancellation prediction can help establish appropriate operational strategies for hotel management. In this sector, personal name records (PNR) data may be the most representative information source for prediction tasks. Despite the popularity of PNR, its inherent lack of availability has been commonly disregarded in the literature. Existing studies have directly input PNR into high-dimensional machine learning (ML) models to achieve cancellation predictions. Another type of model generates cancellation prediction based on the probability modeling of samples. In this study, we propose an interpretable feature interaction method to enrich the existing PNR information. Thereafter, we empirically assess the prediction performance of the two model classes. This study specifically determines whether or not the two methods can cross-fertilize each other to improve cancellation prediction. To do so, we propose a model integrating Bayesian networks (BNs) and Lasso regression for this prediction task. This study utilizes BNs for the probability model consistent with our correlated variables and dichotomous prediction setting. Moreover, we use a linear ML model (i.e., Lasso regression), given its advantages in reducing ineffective predictors and transparency for ranking feature importance. Empirical results show that the proposed integration model has better prediction performance, and the obtained BN estimators and interactive features are the most important predictors. This study contributes to the booking cancellation literature by proposing an interpretable feature interaction and a prediction method integrating two types of effective models. The obtained accurate and interpretable cancellation prediction further contributes to offering practical implications to hoteliers in managerial decision-making.
KW - Bayesian network
KW - Cancellation prediction
KW - Feature interaction
KW - Machine learning
UR - http://www.scopus.com/inward/record.url?scp=85150826602&partnerID=8YFLogxK
U2 - 10.1016/j.dss.2023.113959
DO - 10.1016/j.dss.2023.113959
M3 - Journal article
AN - SCOPUS:85150826602
SN - 0167-9236
VL - 170
JO - Decision Support Systems
JF - Decision Support Systems
M1 - 113959
ER -