Coordinating Pricing and Inventory Replenishment with Nonparametric Demand Learning

被引：57

作者：

Chen, Boxiao ^{[1
]}

Chao, Xiuli ^{[2
]}

Ahn, Hyun-Soo ^{[3
]}

机构：

[1] Univ Illinois, Dept Informat & Decis Sci, Coll Business Adm, Chicago, IL 60607 USA

[2] Univ Michigan, Dept Ind & Operat Engn, Ann Arbor, MI 48109 USA

[3] Univ Michigan, Ross Sch Business, Dept Technol & Operat, Ann Arbor, MI 48109 USA

来源：

OPERATIONS RESEARCH | 2019年 / 67卷 / 04期

基金：

美国国家科学基金会;

关键词：

dynamic pricing; inventory control; demand learning; nonparametric estimation; nonperishable products; asymptotic optimality; NEWSVENDOR PROBLEM; BANDIT;

D O I：

10.1287/opre.2018.1808

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

We consider a firm (e.g., retailer) selling a single nonperishable product over a finite-period planning horizon. Demand in each period is stochastic and price sensitive, and unsatisfied demands are backlogged. At the beginning of each period, the firm determines its selling price and inventory replenishment quantity with the objective of maximizing total profit, but it knows neither the average demand (as a function of price) nor the distribution of demand uncertainty a priori; hence, it has to make pricing and ordering decisions based on observed demand data. We propose a nonparametric, data-driven algorithm that learns about the demand on the fly and, concurrently, applies learned information to make replenishment and pricing decisions. The algorithm integrates learning and action in a sense that the firm actively experiments on pricing and inventory levels to collect demand information with minimum profit loss. Besides convergence of optimal policies, we show that the regret of the algorithm, defined as the average profit loss compared with that of the optimal solution had the firm known the underlying demand information, vanishes at the fastest possible rate as the planning horizon increases.

引用

页码：1035 / 1052

页数：18

共 40 条

[1]

Agrawal S, 2016, ADV NEUR IN, V29

[2]

Agrawal Shipra, 2014, P 15 ACM C EC COMP, P989, DOI [10.1145/2600057.2602844, DOI 10.1145/2600057.2602844]

[3] Improved rates for the stochastic continuum-armed bandit problem [J].

Auer, Peter ;

Ortner, Ronald ;

Szepesvari, Csaba .

LEARNING THEORY, PROCEEDINGS, 2007, 4539 :454-+

[4] Bandits with Knapsacks (Extended Abstract) [J].

Badanidiyuru, Ashwinkumar ;

Kleinberg, Robert ;

Slivkins, Aleksandrs .

2013 IEEE 54TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2013, :207-216

[5] On the (Surprising) Sufficiency of Linear Models for Dynamic Pricing with Demand Learning [J].

Besbes, Omar ;

Zeevi, Assaf .

MANAGEMENT SCIENCE, 2015, 61 (04) :723-739

[6] On Implications of Demand Censoring in the Newsvendor Problem [J].

Besbes, Omar ;

Muharremoglu, Alp .

MANAGEMENT SCIENCE, 2013, 59 (06) :1407-1424

[7] Blind Network Revenue Management [J].

Besbes, Omar ;

Zeevi, Assaf .

OPERATIONS RESEARCH, 2012, 60 (06) :1537-1550

[8] Dynamic Pricing Without Knowing the Demand Function: Risk Bounds and Near-Optimal Algorithms [J].

Besbes, Omar ;

Zeevi, Assaf .

OPERATIONS RESEARCH, 2009, 57 (06) :1407-1420

[9]

Buche R, 2001, SIAM J CONTROL OPTIM, V40, P1011, DOI 10.1137/S0363012999361639

[10] Adaptive ordering and pricing for perishable products [J].

Burnetas, AN ;

Smith, CE .

OPERATIONS RESEARCH, 2000, 48 (03) :436-443

← 1 2 3 4 →