Least squares approximate policy iteration for learning bid prices in choice-based revenue management

被引：11

作者：

Koch, Sebastian ^{[1
]}

机构：

[1] Univ Augsburg, Chair Analyt & Optimizat, Univ Str 16, D-86159 Augsburg, Germany

来源：

COMPUTERS & OPERATIONS RESEARCH | 2017年 / 77卷

关键词：

Revenue management; Capacity control; Approximate dynamic programming; Approximate policy iteration; DYNAMIC-PROGRAMMING ALGORITHM; VIRTUAL NESTING CONTROLS; CAPACITY CONTROL; SIMULATION; MODEL;

D O I：

10.1016/j.cor.2016.07.015

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

We consider the revenue management problem of capacity control under customer choice behavior. An exact solution of the underlying stochastic dynamic program is difficult because of the multi-dimensional state space and, thus, approximate dynamic programming (ADP) techniques are widely used. The key idea of ADP is to encode the multi-dimensional state space by a small number of basis functions, often leading to a parametric approximation of the dynamic program's value function. In general, two classes of ADP techniques for learning value function approximations exist: mathematical programming and simulation. So far, the literature on capacity control largely focuses on the first class. In this paper, we develop a least squares approximate policy iteration (API) approach which belongs to the second class. Thereby, we suggest value function approximations that are linear in the parameters, and we estimate the parameters via linear least squares regression. Exploiting both exact and heuristic knowledge from the value function, we enforce structural constraints on the parameters to facilitate learning a good policy. We perform an extensive simulation study to investigate the performance of our approach. The results show that it is able to obtain competitive revenues compared to and often out-performs state-of-the-art capacity control methods in reasonable computational time. Depending on the scarcity of capacity and the point in time, revenue improvements of around 1% or more can be observed. Furthermore, the proposed approach contributes to simulation-based ADP, bringing forth research on numerically estimating piecewise linear value function approximations and their application in revenue management environments. (C) 2016 Elsevier Ltd. All rights reserved.

引用

页码：240 / 253

页数：14

共 48 条

[31]

Phillips R, 2005, PRICING REVENUE OPTI, DOI 10.1515/9781503614260

[32] On complexity of unconstrained hyperbolic 0-1 programming problems [J].

Prokopyev, OA ;

Huang, HX ;

Pardalos, PA .

OPERATIONS RESEARCH LETTERS, 2005, 33 (03) :312-318

[33] Solving the dynamic ambulance relocation and dispatching problem using approximate dynamic programming [J].

Schmid, Verena .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 219 (03) :611-621

[34] Approximate dynamic programming for capacity allocation in the service industry [J].

Schuetz, Hans-Joerg ;

Kolisch, Rainer .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 218 (01) :239-250

[35]

Schwind M, 2007, LECT NOTES ECON MATH, V589, P1, DOI 10.1007/978-3-540-68003-1

[36] Integrated revenue management approaches for capacity control with planned upgrades [J].

Steinhardt, Claudius ;

Goensch, Jochen .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 223 (02) :380-391

[37] Revenue management under a general discrete choice model of consumer behavior [J].

Talluri, K ;

van Ryzin, G .

MANAGEMENT SCIENCE, 2004, 50 (01) :15-33

[38]

Talluri KT, 2004, INT SER OPER RES MAN, P1

[39] On the Approximate Linear Programming Approach for Network Revenue Management Problems [J].

Tong, Chaoxu ;

Topaloglu, Huseyin .

INFORMS JOURNAL ON COMPUTING, 2014, 26 (01) :121-134

[40] Decomposing Inventory Routing Problems with Approximate Value Functions [J].

Toriello, Alejandro ;

Nemhauser, George ;

Savelsbergh, Martin .

NAVAL RESEARCH LOGISTICS, 2010, 57 (08) :718-727

← 1 2 3 4 5 →