Reinforcement learning applied to airline revenue management

被引：25

作者：

Bondoux, Nicolas ^{[1
]}

Anh Quan Nguyen ^{[1
]}

Fiig, Thomas ^{[2
]}

Acuna-Agost, Rodrigo ^{[1
]}

机构：

[1] Amadeus SAS, Res Innovat & Ventures, 485 Route Pin Montard, F-06902 Sophia Antipolis, France

[2] Amadeus IT Grp, Lufthavnsboulevarden 14, DK-2770 Kastrup, Denmark

来源：

JOURNAL OF REVENUE AND PRICING MANAGEMENT | 2020年 / 19卷 / 05期

关键词：

Revenue Management System; Machine Learning; Reinforcement Learning; Deep Reinforcement Learning; Q-Learning; Deep Q-Learning; DEMAND; OPTIMIZATION; ALGORITHM;

D O I：

10.1057/s41272-020-00228-4

中图分类号：

F8 [财政、金融];

学科分类号：

0202 ;

摘要：

Reinforcement learning (RL) is an area of machine learning concerned with how agents take actions to optimize a given long-term reward by interacting with the environment they are placed in. Some well-known recent applications include self-driving cars and computers playing games with super-human performance. One of the main advantages of this approach is that there is no need to explicitly model the nature of the interactions with the environment. In this work, we present a new airline Revenue Management System (RMS) based on RL, which does not require a demand forecaster. The optimization module remains but works in a different way. It is theoretically proven that RL converges to the optimal solution; however, in practice, the system may require a significant amount of data (a booking history with millions of daily departures) to learn the optimal policies. To overcome these difficulties, we present a novel model that integrates domain knowledge with a deep neural network trained on GPUs. The results are very encouraging in different scenarios and open the door for a new generation of RMSs that could automatically learn by directly interacting with customers.

引用

页码：332 / 348

页数：17

共 26 条

[1] [Anonymous], 2016, DEEP LEARNING
[2] Aviv Y., 2005, Dynamic pricing of short life-cycle products through active learning
[3] ACTIVELY LEARNING ABOUT DEMAND AND THE DYNAMICS OF PRICE ADJUSTMENT
BALVERS, RJ
COSIMANO, TF
[J]. ECONOMIC JOURNAL, 1990, 100 (402) : 882 - 898
[4] Benchmarking filter-based demand estimates for airline revenue management
Bartke, Philipp
Kliewer, Natalia
Cleophas, Catherine
[J]. EURO JOURNAL ON TRANSPORTATION AND LOGISTICS, 2018, 7 (01) : 57 - 88
[5] Carvalho AX, 2005, J REVENUE PRICING MA, V3, P320, DOI 10.1057/palgrave.rpm.5170118
[6] den Boer A.V, 2012, THESIS
[7] Dynamic Pricing and Learning with Finite Inventories
den Boer, Arnoud V.
Zwart, Bert
[J]. OPERATIONS RESEARCH, 2015, 63 (04) : 965 - 978
[8] Simultaneously Learning and Optimizing Using Controlled Variance Pricing
den Boer, Arnoud V.
Zwart, Bert
[J]. MANAGEMENT SCIENCE, 2014, 60 (03) : 770 - 783
[9] Optimization of mixed fare structures: Theory and applications
Fiig, Thomas
Isler, Karl
Hopperstad, Craig
Belobaba, Peter
[J]. JOURNAL OF REVENUE AND PRICING MANAGEMENT, 2010, 9 (1-2) : 152 - 170
[10] OPTIMAL DYNAMIC PRICING OF INVENTORIES WITH STOCHASTIC DEMAND OVER FINITE HORIZONS
GALLEGO, G
VANRYZIN, G
[J]. MANAGEMENT SCIENCE, 1994, 40 (08) : 999 - 1020

← 1 2 3 →