Dynamic pricing policies for interdependent perishable products or services using reinforcement learning

被引:52
作者
Rana, Rupal [1 ]
Oliveira, Fernando S. [2 ]
机构
[1] Univ Loughborough, Management Sci & Operat Management Dept, Loughborough LE11 3TU, Leics, England
[2] ESSEC Business Sch, Operat Management & Decis Sci Dept, Singapore 188064, Singapore
关键词
Dynamic pricing; Reinforcement learning; Revenue management; Service management; Simulation; REVENUE MANAGEMENT; STOCHASTIC DEMAND; STRATEGIES; ALGORITHM; SYSTEMS;
D O I
10.1016/j.eswa.2014.07.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many businesses offer multiple products or services that are interdependent, in which the demand for one is often affected by the prices of others. This article considers a revenue management problem of multiple interdependent products, in which dynamically adjusted over a finite sales horizon to maximize expected revenue, given an initial inventory for each product. The main contribution of this article is to use reinforcement learning to model the optimal pricing of perishable interdependent products when demand is stochastic and its functional form unknown. We show that reinforcement learning can be used to price interdependent products. Moreover, we analyze the performance of the Q-learning with eligibility traces algorithm under different conditions. We illustrate our analysis with the pricing of services. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:426 / 436
页数:11
相关论文
共 48 条
[11]   Pricing and promotion strategies of an online shop based on customer segmentation and multiple objective decision making [J].
Chan, C. -C. Henry ;
Cheng, Chi-Bin ;
Hsien, Wen-Chen .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) :14585-14591
[12]   Dynamic packaging in e-retailing with stochastic demand over finite horizons: A Q-learning approach [J].
Cheng, Yan .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (01) :472-480
[13]  
Cheng Y, 2007, I C WIREL COMM NETW, P5476
[14]   Models of the spiral-down effect in revenue management [J].
Cooper, William L. ;
Homem-de-Mello, Tito ;
Kleywegt, Anton J. .
OPERATIONS RESEARCH, 2006, 54 (05) :968-987
[15]   Dynamic pricing of airline tickets with competition [J].
Currie, C. S. M. ;
Cheng, R. C. H. ;
Smith, H. K. .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2008, 59 (08) :1026-1037
[16]   Comparing strategies for modeling students learning styles through reinforcement learning in adaptive and intelligent educational systems: An experimental analysis [J].
Dorca, Fabiano A. ;
Lima, Luciano V. ;
Fernandes, Marcia A. ;
Lopes, Carlos R. .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (06) :2092-2101
[17]   Dynamic pricing in the presence of inventory considerations: Research overview, current practices, and future directions [J].
Elmaghraby, W ;
Keskinocak, P .
MANAGEMENT SCIENCE, 2003, 49 (10) :1287-1309
[18]   A survey of operational expert systems in business (1980-1993) [J].
Eom, SB .
INTERFACES, 1996, 26 (05) :50-70
[19]   A multiproduct dynamic pricing problem and its applications to network yield management [J].
Gallego, G ;
VanRyzin, G .
OPERATIONS RESEARCH, 1997, 45 (01) :24-41
[20]   OPTIMAL DYNAMIC PRICING OF INVENTORIES WITH STOCHASTIC DEMAND OVER FINITE HORIZONS [J].
GALLEGO, G ;
VANRYZIN, G .
MANAGEMENT SCIENCE, 1994, 40 (08) :999-1020