TY - JOUR
T1 - Cruise dynamic pricing based on SARSA algorithm
AU - Wang, Jing
AU - Yang, Dong
AU - Chen, Kaimin
AU - Sun, Xiaodong
N1 - Funding Information:
This work was supported by grants from the National Natural Science Foundation of China [No. 71572057], the Shanghai Pujiang Program [No. 17PJC033] and the NSFC/RGC Joint Research Scheme [No. 71661167009 and N_PolyU531/16].
Publisher Copyright:
© 2021 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2021/7
Y1 - 2021/7
N2 - It is a common practice to promote highly discounted fares by cruise companies to enlarge the market share, ignoring economically sustainable development. In some regions, the continuous discounted fares leading to the unsatisfying revenue may be the main cause of decline in ports calls. Cruise companies have learned that dynamic pricing would be much more advantageous at revenue management instead of blindly lowering fares. This paper illustrates such an attempt. We try to dynamically price multiple types of staterooms with various occupancies and evaluate the effect on demand and revenue from different discount and refund policies. We first formulate the cruise pricing problem as Markov Decision Process and Reinforcement Learning (RL), more specifically, state-action-reward-state-action (SARSA) algorithm, is applied to solve it. We then use empirical data to validate the feasibility of RL. Results show that both revenue and demand could be improved under reasonable discount policies. In addition, we demonstrate that reasonable refund policies can also facilitate revenue growth. Finally, a comparison between SARSA algorithm and Q-learning algorithm is discussed. Our finding suggests that SARSA results in higher revenues but takes more time to converge.
AB - It is a common practice to promote highly discounted fares by cruise companies to enlarge the market share, ignoring economically sustainable development. In some regions, the continuous discounted fares leading to the unsatisfying revenue may be the main cause of decline in ports calls. Cruise companies have learned that dynamic pricing would be much more advantageous at revenue management instead of blindly lowering fares. This paper illustrates such an attempt. We try to dynamically price multiple types of staterooms with various occupancies and evaluate the effect on demand and revenue from different discount and refund policies. We first formulate the cruise pricing problem as Markov Decision Process and Reinforcement Learning (RL), more specifically, state-action-reward-state-action (SARSA) algorithm, is applied to solve it. We then use empirical data to validate the feasibility of RL. Results show that both revenue and demand could be improved under reasonable discount policies. In addition, we demonstrate that reasonable refund policies can also facilitate revenue growth. Finally, a comparison between SARSA algorithm and Q-learning algorithm is discussed. Our finding suggests that SARSA results in higher revenues but takes more time to converge.
KW - Cruise industry
KW - discount policy
KW - dynamic pricing
KW - refund policy
KW - reinforcement Learning
UR - https://www.scopus.com/pages/publications/85101077487
U2 - 10.1080/03088839.2021.1887529
DO - 10.1080/03088839.2021.1887529
M3 - Journal article
AN - SCOPUS:85101077487
SN - 0308-8839
VL - 48
SP - 259
EP - 282
JO - Maritime Policy and Management
JF - Maritime Policy and Management
IS - 2
ER -