Dynamic pricing policies for interdependent perishable products or services using reinforcement learning

被引:52
作者
Rana, Rupal [1 ]
Oliveira, Fernando S. [2 ]
机构
[1] Univ Loughborough, Management Sci & Operat Management Dept, Loughborough LE11 3TU, Leics, England
[2] ESSEC Business Sch, Operat Management & Decis Sci Dept, Singapore 188064, Singapore
关键词
Dynamic pricing; Reinforcement learning; Revenue management; Service management; Simulation; REVENUE MANAGEMENT; STOCHASTIC DEMAND; STRATEGIES; ALGORITHM; SYSTEMS;
D O I
10.1016/j.eswa.2014.07.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many businesses offer multiple products or services that are interdependent, in which the demand for one is often affected by the prices of others. This article considers a revenue management problem of multiple interdependent products, in which dynamically adjusted over a finite sales horizon to maximize expected revenue, given an initial inventory for each product. The main contribution of this article is to use reinforcement learning to model the optimal pricing of perishable interdependent products when demand is stochastic and its functional form unknown. We show that reinforcement learning can be used to price interdependent products. Moreover, we analyze the performance of the Q-learning with eligibility traces algorithm under different conditions. We illustrate our analysis with the pricing of services. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:426 / 436
页数:11
相关论文
共 48 条
[41]  
Rana R., 2013, OMEGA
[42]   A comparison of fixed and dynamic pricing policies in revenue management [J].
Sen, Alper .
OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2013, 41 (03) :586-597
[43]   Reinforcement learning approach to goal-regulation in a self-evolutionary manufacturing system [J].
Shin, Moonsoo ;
Ryu, Kwangyeol ;
Jung, Mooyoung .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (10) :8736-8743
[44]   Optimal pricing for mobile manufacturers in competitive market using genetic algorithm [J].
Sohn, So Young ;
Moon, Tale Hee ;
Seok, Kim Jong .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :3448-3453
[45]  
Talluri K., 2005, The Theory and Practice of Revenue Management
[46]   An analysis of temporal-difference learning with function approximation [J].
Tsitsiklis, JN ;
VanRoy, B .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (05) :674-690
[47]   Revenue management for parallel flights with customer-choice behavior [J].
Zhang, D ;
Cooper, WL .
OPERATIONS RESEARCH, 2005, 53 (03) :415-431
[48]   Optimal dynamic pricing for perishable assets with nonhomogeneous demand [J].
Zhao, W ;
Zheng, YS .
MANAGEMENT SCIENCE, 2000, 46 (03) :375-388