Dynamic pricing under competition using reinforcement learning

被引:0
作者
Alexander Kastius
Rainer Schlosser
机构
[1] University of Potsdam,Hasso Plattner Institute
来源
Journal of Revenue and Pricing Management | 2022年 / 21卷
关键词
Dynamic pricing; Competition; Reinforcement learning; E-commerce; Price collusion;
D O I
暂无
中图分类号
学科分类号
摘要
Dynamic pricing is considered a possibility to gain an advantage over competitors in modern online markets. The past advancements in Reinforcement Learning (RL) provided more capable algorithms that can be used to solve pricing problems. In this paper, we study the performance of Deep Q-Networks (DQN) and Soft Actor Critic (SAC) in different market models. We consider tractable duopoly settings, where optimal solutions derived by dynamic programming techniques can be used for verification, as well as oligopoly settings, which are usually intractable due to the curse of dimensionality. We find that both algorithms provide reasonable results, while SAC performs better than DQN. Moreover, we show that under certain conditions, RL algorithms can be forced into collusion by their competitors without direct communication.
引用
收藏
页码:50 / 63
页数:13
相关论文
共 44 条
[1]  
Bondoux N(2020)Reinforcement learning applied to airline revenue management Journal of Revenue and Pricing Management 19 332-348
[2]  
Nguyen AQ(2015)Dynamic pricing and learning: Historical origins, current research, and new directions Surveys in Operations Research and Management Science 20 1-18
[3]  
Fiig T(2002)Inventory management in supply chains: A reinforcement learning approach International Journal of Production Economics 78 153-161
[4]  
Acuna-Agost R(2002)A reinforcement learning approach to a single leg airline revenue management problem with multiple fare classes and overbooking IIE Transactions (Institute of Industrial Engineers) 34 729-742
[5]  
den Boer AV(2004)A reinforcement learning algorithm based on policy iteration for average reward: Empirical results with yield management and convergence analysis Machine Learning 55 5-29
[6]  
Giannoccaro I(2016)Dynamic pricing and energy consumption scheduling with reinforcement learning IEEE Transactions on Smart Grid 7 2187-2198
[7]  
Pontrandolfo P(2006)Dynamic pricing based on asymmetric multiagent reinforcement learning International Journal of Intelligent Systems 21 73-98
[8]  
Gosavi A(2003)Learning competitive pricing strategies by multi-agent reinforcement learning Journal of Economic Dynamics and Control 27 2207-2218
[9]  
Bandla N(2015)Human-level control through deep reinforcement learning Nature 518 529-533
[10]  
Das TK(2014)Real-time dynamic pricing in a non-stationary environment using model-free reinforcement learning Omega (United Kingdom) 47 116-126