How Active is Active Learning: Value Function Method Versus an Approximation Method

被引:0
作者
Hans M. Amman
Marco P. Tucci
机构
[1] University of Amsterdam,Faculty of Economics and Business
[2] University of Siena,Dipartimento di Economia Politica e Statistica
来源
Computational Economics | 2020年 / 56卷
关键词
Optimal experimentation; Value function; Approximation method; Adaptive control; Active learning; Time-varying parameters; Numerical experiments; C63; E61; E62;
D O I
暂无
中图分类号
学科分类号
摘要
In a previous paper Amman et al. (Macroecon Dyn, 2018) compare the two dominant approaches for solving models with optimal experimentation (also called active learning), i.e. the value function and the approximation method. By using the same model and dataset as in Beck and Wieland (J Econ Dyn Control 26:1359–1377, 2002), they find that the approximation method produces solutions close to those generated by the value function approach and identify some elements of the model specifications which affect the difference between the two solutions. They conclude that differences are small when the effects of learning are limited. However the dataset used in the experiment describes a situation where the controller is dealing with a nonstationary process and there is no penalty on the control. The goal of this paper is to see if their conclusions hold in the more commonly studied case of a controller facing a stationary process and a positive penalty on the control.
引用
收藏
页码:675 / 693
页数:18
相关论文
共 44 条
[1]  
Aghion P(1991)Optimal learning by experimentation Review of Economic Studies 58 621-654
[2]  
Bolton P(1997)Numerical solution methods of the algebraic matrix riccati equation Journal of Economic Dynamics and Control 21 363-370
[3]  
Harris C(1969)On the optimal control of discrete-time linear systems with random parameters IEEE Transactions on Automatic Control 14 3-8
[4]  
Jullien B(2002)Learning and control in a changing economic environment Journal of Economic Dynamics and Control 26 1359-1377
[5]  
Amman HM(1999)Strategic experimentation Econometrica 67 349-374
[6]  
Neudecker H(2011)Learning the wealth of nations Econometrica 79 1-45
[7]  
Bar-Shalom Y(2005)Data uncertainty and the role of money as an information variable for monetary policy European Economic Review 49 975-1006
[8]  
Sivan R(2008)Optimal experimentation and the perturbation method in the neighborhood of the augmented linear regulator problem Journal of Economics, Dynamics and Control 32 1857-1894
[9]  
Beck G(1988)Controlling a stochastic process with unknown parameters Econometrica 56 1045-1064
[10]  
Wieland V(1989)A value function arising in the economics of information Journal of Economic Dynamics and Control 13 201-223