Reinforcement learning applied to airline revenue management

被引:25
作者
Bondoux, Nicolas [1 ]
Anh Quan Nguyen [1 ]
Fiig, Thomas [2 ]
Acuna-Agost, Rodrigo [1 ]
机构
[1] Amadeus SAS, Res Innovat & Ventures, 485 Route Pin Montard, F-06902 Sophia Antipolis, France
[2] Amadeus IT Grp, Lufthavnsboulevarden 14, DK-2770 Kastrup, Denmark
关键词
Revenue Management System; Machine Learning; Reinforcement Learning; Deep Reinforcement Learning; Q-Learning; Deep Q-Learning; DEMAND; OPTIMIZATION; ALGORITHM;
D O I
10.1057/s41272-020-00228-4
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Reinforcement learning (RL) is an area of machine learning concerned with how agents take actions to optimize a given long-term reward by interacting with the environment they are placed in. Some well-known recent applications include self-driving cars and computers playing games with super-human performance. One of the main advantages of this approach is that there is no need to explicitly model the nature of the interactions with the environment. In this work, we present a new airline Revenue Management System (RMS) based on RL, which does not require a demand forecaster. The optimization module remains but works in a different way. It is theoretically proven that RL converges to the optimal solution; however, in practice, the system may require a significant amount of data (a booking history with millions of daily departures) to learn the optimal policies. To overcome these difficulties, we present a novel model that integrates domain knowledge with a deep neural network trained on GPUs. The results are very encouraging in different scenarios and open the door for a new generation of RMSs that could automatically learn by directly interacting with customers.
引用
收藏
页码:332 / 348
页数:17
相关论文
共 26 条
  • [1] [Anonymous], 2016, DEEP LEARNING
  • [2] Aviv Y., 2005, Dynamic pricing of short life-cycle products through active learning
  • [3] ACTIVELY LEARNING ABOUT DEMAND AND THE DYNAMICS OF PRICE ADJUSTMENT
    BALVERS, RJ
    COSIMANO, TF
    [J]. ECONOMIC JOURNAL, 1990, 100 (402) : 882 - 898
  • [4] Benchmarking filter-based demand estimates for airline revenue management
    Bartke, Philipp
    Kliewer, Natalia
    Cleophas, Catherine
    [J]. EURO JOURNAL ON TRANSPORTATION AND LOGISTICS, 2018, 7 (01) : 57 - 88
  • [5] Carvalho AX, 2005, J REVENUE PRICING MA, V3, P320, DOI 10.1057/palgrave.rpm.5170118
  • [6] den Boer A.V, 2012, THESIS
  • [7] Dynamic Pricing and Learning with Finite Inventories
    den Boer, Arnoud V.
    Zwart, Bert
    [J]. OPERATIONS RESEARCH, 2015, 63 (04) : 965 - 978
  • [8] Simultaneously Learning and Optimizing Using Controlled Variance Pricing
    den Boer, Arnoud V.
    Zwart, Bert
    [J]. MANAGEMENT SCIENCE, 2014, 60 (03) : 770 - 783
  • [9] Optimization of mixed fare structures: Theory and applications
    Fiig, Thomas
    Isler, Karl
    Hopperstad, Craig
    Belobaba, Peter
    [J]. JOURNAL OF REVENUE AND PRICING MANAGEMENT, 2010, 9 (1-2) : 152 - 170
  • [10] OPTIMAL DYNAMIC PRICING OF INVENTORIES WITH STOCHASTIC DEMAND OVER FINITE HORIZONS
    GALLEGO, G
    VANRYZIN, G
    [J]. MANAGEMENT SCIENCE, 1994, 40 (08) : 999 - 1020