Dynamic pricing policies for interdependent perishable products or services using reinforcement learning

被引:52
作者
Rana, Rupal [1 ]
Oliveira, Fernando S. [2 ]
机构
[1] Univ Loughborough, Management Sci & Operat Management Dept, Loughborough LE11 3TU, Leics, England
[2] ESSEC Business Sch, Operat Management & Decis Sci Dept, Singapore 188064, Singapore
关键词
Dynamic pricing; Reinforcement learning; Revenue management; Service management; Simulation; REVENUE MANAGEMENT; STOCHASTIC DEMAND; STRATEGIES; ALGORITHM; SYSTEMS;
D O I
10.1016/j.eswa.2014.07.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many businesses offer multiple products or services that are interdependent, in which the demand for one is often affected by the prices of others. This article considers a revenue management problem of multiple interdependent products, in which dynamically adjusted over a finite sales horizon to maximize expected revenue, given an initial inventory for each product. The main contribution of this article is to use reinforcement learning to model the optimal pricing of perishable interdependent products when demand is stochastic and its functional form unknown. We show that reinforcement learning can be used to price interdependent products. Moreover, we analyze the performance of the Q-learning with eligibility traces algorithm under different conditions. We illustrate our analysis with the pricing of services. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:426 / 436
页数:11
相关论文
共 48 条
[1]   Joint Dynamic Pricing of Multiple Perishable Products Under Consumer Choice [J].
Akcay, Yalcin ;
Natarajan, Harihara Prasad ;
Xu, Susan H. .
MANAGEMENT SCIENCE, 2010, 56 (08) :1345-1361
[2]   Maximizing revenue in the airline industry under one-way pricing [J].
Anjos, MF ;
Cheng, RCH ;
Currie, CSM .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2004, 55 (05) :535-541
[3]   Optimal pricing policies for perishable products [J].
Anjos, MF ;
Cheng, RCH ;
Currie, CSM .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2005, 166 (01) :246-254
[4]  
[Anonymous], 1998, Reinforcement Learning: An Introduction
[5]   Dynamic pricing of multiple home delivery options [J].
Asdemir, Kursad ;
Jacob, Varghese S. ;
Krishnan, Ramayya .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2009, 196 (01) :246-257
[6]   Dynamic Pricing Without Knowing the Demand Function: Risk Bounds and Near-Optimal Algorithms [J].
Besbes, Omar ;
Zeevi, Assaf .
OPERATIONS RESEARCH, 2009, 57 (06) :1407-1420
[7]  
Bitran G., 2003, Manufacturing & Service Operations Management, V5, P203, DOI 10.1287/msom.5.3.203.16031
[8]  
Bitran G R., 2004, Pricing Policies for Perishable Products with Demand Substitution (Working Paper)
[9]  
Burkart W. R., 2012, EUR J OPER RES, V219
[10]   Smart pricing scheme: A multi-layered scoring rule application [J].
Chakraborty, Shantanu ;
Ito, Takayuki ;
Senjyu, Tomonobu .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (08) :3726-3735