Dynamic Pricing Under a General Parametric Choice Model

被引：155

作者：

Broder, Josef ^{[1
]}

Rusmevichientong, Paat ^{[2
]}

机构：

[1] Cornell Univ, Ctr Appl Math, Ithaca, NY 14850 USA

[2] Univ So Calif, Marshall Sch Business, Los Angeles, CA 90089 USA

来源：

OPERATIONS RESEARCH | 2012年 / 60卷 / 04期

基金：

美国国家科学基金会;

关键词：

MAXIMUM-LIKELIHOOD-ESTIMATION; STOCHASTIC-APPROXIMATION; DEMAND; BOUNDS;

D O I：

10.1287/opre.1120.1057

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

We consider a stylized dynamic pricing model in which a monopolist prices a product to a sequence of T customers who independently make purchasing decisions based on the price offered according to a general parametric choice model. The parameters of the model are unknown to the seller, whose objective is to determine a pricing policy that minimizes the regret, which is the expected difference between the seller's revenue and the revenue of a clairvoyant seller who knows the values of the parameters in advance and always offers the revenue-maximizing price. We show that the regret of the optimal pricing policy in this model is Theta(root T), by establishing an Omega(root T) lower bound on the worst-case regret under an arbitrary policy, and presenting a pricing policy based on maximum-likelihood estimation whose regret is O(root T) across all problem instances. Furthermore, we show that when the demand curves satisfy a "well-separated" condition, the T-period regret of the optimal policy is Theta(log T). Numerical experiments show that our policies perform well.

引用

页码：965 / 980

页数：16

共 35 条

[1] THE CONTINUUM-ARMED BANDIT PROBLEM [J].

AGRAWAL, R .

SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1995, 33 (06) :1926-1951

[2]

[Anonymous], 1997, ART COMPUTER PROGRAM

[3]

[Anonymous], 1998, Mathematical statistics

[4] Finite-time analysis of the multiarmed bandit problem [J].

Auer, P ;

Cesa-Bianchi, N ;

Fischer, P .

MACHINE LEARNING, 2002, 47 (2-3) :235-256

[5] Improved rates for the stochastic continuum-armed bandit problem [J].

Auer, Peter ;

Ortner, Ronald ;

Szepesvari, Csaba .

LEARNING THEORY, PROCEEDINGS, 2007, 4539 :454-+

[6]

BARSHALOM Y, 1971, J R STAT SOC B, V33, P72

[7]

Basawa IV., 1976, Sankhya: The Indian Journal of Statistics, Series A, P259

[8]

Ben-Akiva M. E., 1985, Discrete choice analysis: Theory and application to travel demand, V9

[9]

Bertsimas D, 2003, MATH COMPUTATIONAL M, V101, P45

[10] On the Minimax Complexity of Pricing in a Changing Environment [J].

Besbes, Omar ;

Zeevi, Assaf .

OPERATIONS RESEARCH, 2011, 59 (01) :66-79

← 1 2 3 4 →