Thermodynamics as a theory of decision-making with information-processing costs

被引:132
作者
Ortega, Pedro A. [1 ,2 ]
Braun, Daniel A. [1 ,2 ]
机构
[1] Max Planck Inst Biol Cybernet, D-72076 Tubingen, Germany
[2] Max Planck Inst Intelligent Syst, D-72076 Tubingen, Germany
来源
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES | 2013年 / 469卷 / 2153期
关键词
decision-making; bounded rationality; information processing; RISK-SENSITIVITY; BOUNDED RATIONALITY; PROSPECT-THEORY; REPRESENTATION; PERFORMANCE; COMPUTATION; AMBIGUITY; ENERGY; CHOICE;
D O I
10.1098/rspa.2012.0683
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Perfectly rational decision-makers maximize expected utility, but crucially ignore the resource costs incurred when determining optimal actions. Here, we propose a thermodynamically inspired formalization of bounded rational decision-making where information processing is modelled as state changes in thermodynamic systems that can be quantified by differences in free energy. By optimizing a free energy, bounded rational decision-makers trade off expected utility gains and information-processing costs measured by the relative entropy. As a result, the bounded rational decision-making problem can be rephrased in terms of well-known variational principles from statistical physics. In the limit when computational costs are ignored, the maximum expected utility principle is recovered. We discuss links to existing decision-making frameworks and applications to human decision-making experiments that are at odds with expected utility theory. Since most of the mathematical machinery can be borrowed from statistical physics, the main contribution is to reinterpret the formalism of thermodynamic free-energy differences in terms of bounded rational decision-making and to discuss its relationship to human decision-making experiments.
引用
收藏
页数:18
相关论文
共 92 条
[31]   STATE-SPACE FORMULAS FOR ALL STABILIZING CONTROLLERS THAT SATISFY AN H INFINITY-NORM BOUND AND RELATIONS TO RISK SENSITIVITY [J].
GLOVER, K ;
DOYLE, JC .
SYSTEMS & CONTROL LETTERS, 1988, 11 (03) :167-172
[32]   Risk-Sensitivity in Bayesian Sensorimotor Integration [J].
Grau-Moya, Jordi ;
Ortega, Pedro A. ;
Braun, Daniel A. .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (09)
[33]  
Gumbel E. J., 1958, Statistics of extremes
[34]  
Hansen Lars P., 2011, Robustness
[35]   A GENERALIZATION OF THE QUASILINEAR MEAN WITH APPLICATIONS TO THE MEASUREMENT OF INCOME INEQUALITY AND DECISION-THEORY RESOLVING THE ALLAIS PARADOX [J].
HONG, CS .
ECONOMETRICA, 1983, 51 (04) :1065-1092
[37]   Maps of bounded rationality: Psychology for behavioral economics [J].
Kahneman, D .
AMERICAN ECONOMIC REVIEW, 2003, 93 (05) :1449-1475
[38]   PROSPECT THEORY - ANALYSIS OF DECISION UNDER RISK [J].
KAHNEMAN, D ;
TVERSKY, A .
ECONOMETRICA, 1979, 47 (02) :263-291
[39]  
Kahnemann D, 2002, CHOICES VALUES FRAME
[40]   Optimal control as a graphical model inference problem [J].
Kappen, Hilbert J. ;
Gomez, Vicenc ;
Opper, Manfred .
MACHINE LEARNING, 2012, 87 (02) :159-182