Dynamic Pricing Under a General Parametric Choice Model

被引:155
作者
Broder, Josef [1 ]
Rusmevichientong, Paat [2 ]
机构
[1] Cornell Univ, Ctr Appl Math, Ithaca, NY 14850 USA
[2] Univ So Calif, Marshall Sch Business, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
MAXIMUM-LIKELIHOOD-ESTIMATION; STOCHASTIC-APPROXIMATION; DEMAND; BOUNDS;
D O I
10.1287/opre.1120.1057
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We consider a stylized dynamic pricing model in which a monopolist prices a product to a sequence of T customers who independently make purchasing decisions based on the price offered according to a general parametric choice model. The parameters of the model are unknown to the seller, whose objective is to determine a pricing policy that minimizes the regret, which is the expected difference between the seller's revenue and the revenue of a clairvoyant seller who knows the values of the parameters in advance and always offers the revenue-maximizing price. We show that the regret of the optimal pricing policy in this model is Theta(root T), by establishing an Omega(root T) lower bound on the worst-case regret under an arbitrary policy, and presenting a pricing policy based on maximum-likelihood estimation whose regret is O(root T) across all problem instances. Furthermore, we show that when the demand curves satisfy a "well-separated" condition, the T-period regret of the optimal policy is Theta(log T). Numerical experiments show that our policies perform well.
引用
收藏
页码:965 / 980
页数:16
相关论文
共 35 条
[1]   THE CONTINUUM-ARMED BANDIT PROBLEM [J].
AGRAWAL, R .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1995, 33 (06) :1926-1951
[2]  
[Anonymous], 1997, ART COMPUTER PROGRAM
[3]  
[Anonymous], 1998, Mathematical statistics
[4]   Finite-time analysis of the multiarmed bandit problem [J].
Auer, P ;
Cesa-Bianchi, N ;
Fischer, P .
MACHINE LEARNING, 2002, 47 (2-3) :235-256
[5]   Improved rates for the stochastic continuum-armed bandit problem [J].
Auer, Peter ;
Ortner, Ronald ;
Szepesvari, Csaba .
LEARNING THEORY, PROCEEDINGS, 2007, 4539 :454-+
[6]  
BARSHALOM Y, 1971, J R STAT SOC B, V33, P72
[7]  
Basawa IV., 1976, Sankhya: The Indian Journal of Statistics, Series A, P259
[8]  
Ben-Akiva M. E., 1985, Discrete choice analysis: Theory and application to travel demand, V9
[9]  
Bertsimas D, 2003, MATH COMPUTATIONAL M, V101, P45
[10]   On the Minimax Complexity of Pricing in a Changing Environment [J].
Besbes, Omar ;
Zeevi, Assaf .
OPERATIONS RESEARCH, 2011, 59 (01) :66-79