Nonparametric Learning Algorithms for Joint Pricing and Inventory Control with Lost Sales and Censored Demand

被引：37

作者：

Chen, Boxiao ^{[1
]}

Chao, Xiuli ^{[2
]}

Shi, Cong ^{[2
]}

机构：

[1] Univ Illinois, Coll Business Adm, Chicago, IL 60607 USA

[2] Univ Michigan, Ind & Operat Engn, Ann Arbor, MI 48109 USA

来源：

MATHEMATICS OF OPERATIONS RESEARCH | 2021年 / 46卷 / 02期

基金：

美国国家科学基金会;

关键词：

nonparametric algorithm; joint pricing and inventory control; lost sales; censored demand; FIXED ORDERING COST; NEWSVENDOR PROBLEM; CONTROL POLICY; SYSTEMS; STRATEGIES; MANAGEMENT; OPTIMIZATION; PRODUCTS; BOUNDS;

D O I：

10.1287/moor.2020.1084

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

We consider a joint pricing and inventory control problem in which the customer's response to selling price and the demand distribution are not known a priori. Unsatisfied demand is lost and unobserved, and the only available information for decision making is the observed sales data (also known as censored demand). Conventional approaches, such as stochastic approximation, online convex optimization, and continuum-armed bandit algorithms, cannot be employed, because neither the realized values of the profit function nor its derivatives are known. A major challenge of this problem lies in that the estimated profit function constructed from observed sales data is multimodal in price. We develop a nonparametric spline approximation-based learning algorithm. The algorithm separates the planning horizon into a disjoint exploration phase and an exploitation phase. During the exploration phase, a spline approximation of the demand-price function is constructed based on sales data, and then the corresponding surrogate optimization problem is solved on a sparse grid to obtain a pair of recommended price and target inventory level. During the exploitation phase, the algorithm implements the recommended strategies. We establish a (nearly) square-root regret rate, which (almost) matches the theoretical lower bound.

引用

页码：726 / 756

页数：31

共 61 条

[1] Dynamic Pricing for Nonperishable Products with Demand Learning [J].

Araman, Victor F. ;

Caldentey, Rene .

OPERATIONS RESEARCH, 2009, 57 (05) :1169-1188

[2]

Aviv Y., 2012, Chapter 23 in Handbook of Pricing Management, P522

[3]

Bertsimas D, 2006, APPL OPTIMIZAT, V101, P45

[4] On the (Surprising) Sufficiency of Linear Models for Dynamic Pricing with Demand Learning [J].

Besbes, Omar ;

Zeevi, Assaf .

MANAGEMENT SCIENCE, 2015, 61 (04) :723-739

[5] On Implications of Demand Censoring in the Newsvendor Problem [J].

Besbes, Omar ;

Muharremoglu, Alp .

MANAGEMENT SCIENCE, 2013, 59 (06) :1407-1424

[6] Blind Network Revenue Management [J].

Besbes, Omar ;

Zeevi, Assaf .

OPERATIONS RESEARCH, 2012, 60 (06) :1537-1550

[7] Dynamic Pricing Without Knowing the Demand Function: Risk Bounds and Near-Optimal Algorithms [J].

Besbes, Omar ;

Zeevi, Assaf .

OPERATIONS RESEARCH, 2009, 57 (06) :1407-1420

[8] ESTIMATION OF INVENTORY REORDER LEVELS USING THE BOOTSTRAP STATISTICAL PROCEDURE [J].

BOOKBINDER, JH ;

LORDAHL, AE .

IIE TRANSACTIONS, 1989, 21 (04) :302-312

[9] General Bounds and Finite-Time Improvement for the Kiefer-Wolfowitz Stochastic Approximation Algorithm [J].

Broadie, Mark ;

Cicek, Deniz ;

Zeevi, Assaf .

OPERATIONS RESEARCH, 2011, 59 (05) :1211-1224

[10] Dynamic Pricing Under a General Parametric Choice Model [J].

Broder, Josef ;

Rusmevichientong, Paat .

OPERATIONS RESEARCH, 2012, 60 (04) :965-980

← 1 2 3 4 5 6 7 →