Feature-Based Dynamic Pricing

被引:69
作者
Cohen, Maxime C. [1 ]
Lobel, Ilan [2 ]
Leme, Renato Paes [3 ]
机构
[1] McGill Univ, Desaultels Fac Management, Montreal, PQ H3A 1G5, Canada
[2] NYU, Stern Sch Business, New York, NY 10012 USA
[3] Google Res, New York, NY 10011 USA
关键词
online learning; contextual bandits; ellipsoid method; revenue management; POLYHEDRAL METHODS; OPTIMIZATION; ALGORITHMS; POLICIES; REGRET;
D O I
10.1287/mnsc.2019.3485
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We consider the problem faced by a firm that receives highly differentiated products in an online fashion. The firm needs to price these products to sell them to its customer base. Products are described by vectors of features and the market value of each product is linear in the values of the features. The firm does not initially know the values of the different features, but can learn the values of the features based on whether products were sold at the posted prices in the past. This model is motivated by applications such as online marketplaces, online flash sales, and loan pricing. We first consider a multidimensional version of binary search over polyhedral sets and show that it has a worst-case regret which is exponential in the dimension of the feature space. We then propose a modification of the prior algorithm where uncertainty sets are replaced by their Lowner-John ellipsoids. We show that this algorithm has a worst-case regret which is quadratic in the dimension of the feature space and logarithmic in the time horizon. We also show how to adapt our algorithm to the case where valuations are noisy. Finally, we present computational experiments to illustrate the performance of our algorithm.
引用
收藏
页码:4921 / 4943
页数:23
相关论文
共 55 条
[1]  
Abbasi-Yadkori Y, 2011, ADV NEURAL INFORM PR, V24, P2312
[2]  
Agarwal A, 2014, PR MACH LEARN RES, V32, P1638
[3]  
Agrawal S., 2016, P 29 C LEARNING THEO, P4
[4]  
Agrawal S, 2016, ADV NEURAL INFORM PR, V29, P3458
[5]   Bandits with Global Convex Constraints and Objective [J].
Agrawal, Shipra ;
Devanur, Nikhil R. .
OPERATIONS RESEARCH, 2019, 67 (05) :1486-1502
[6]  
Amin K., 2011, P 24 ANN C LEARNING, P87
[7]  
Amin K, 2015, AAAI CONF ARTIF INTE, P770
[8]  
Amin K, 2014, ADV NEUR IN, V27
[9]  
[Anonymous], 1993, GEOMETRIC ALGORITHMS
[10]   Dynamic Pricing for Nonperishable Products with Demand Learning [J].
Araman, Victor F. ;
Caldentey, Rene .
OPERATIONS RESEARCH, 2009, 57 (05) :1169-1188