Thermodynamics as a theory of decision-making with information-processing costs

被引:131
作者
Ortega, Pedro A. [1 ,2 ]
Braun, Daniel A. [1 ,2 ]
机构
[1] Max Planck Inst Biol Cybernet, D-72076 Tubingen, Germany
[2] Max Planck Inst Intelligent Syst, D-72076 Tubingen, Germany
来源
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES | 2013年 / 469卷 / 2153期
关键词
decision-making; bounded rationality; information processing; RISK-SENSITIVITY; BOUNDED RATIONALITY; PROSPECT-THEORY; REPRESENTATION; PERFORMANCE; COMPUTATION; AMBIGUITY; ENERGY; CHOICE;
D O I
10.1098/rspa.2012.0683
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Perfectly rational decision-makers maximize expected utility, but crucially ignore the resource costs incurred when determining optimal actions. Here, we propose a thermodynamically inspired formalization of bounded rational decision-making where information processing is modelled as state changes in thermodynamic systems that can be quantified by differences in free energy. By optimizing a free energy, bounded rational decision-makers trade off expected utility gains and information-processing costs measured by the relative entropy. As a result, the bounded rational decision-making problem can be rephrased in terms of well-known variational principles from statistical physics. In the limit when computational costs are ignored, the maximum expected utility principle is recovered. We discuss links to existing decision-making frameworks and applications to human decision-making experiments that are at odds with expected utility theory. Since most of the mathematical machinery can be borrowed from statistical physics, the main contribution is to reinterpret the formalism of thermodynamic free-energy differences in terms of bounded rational decision-making and to discuss its relationship to human decision-making experiments.
引用
收藏
页数:18
相关论文
共 92 条
[2]  
[Anonymous], 2011, ADAPTIVE DYNAMIC PRO
[3]  
[Anonymous], 1985, Maximum-Entropy and Bayesian Methods in Inverse Problems
[4]  
[Anonymous], 2005, Blink: the power of thinking without thinking
[5]  
[Anonymous], 2010, UAI
[6]  
[Anonymous], 2006, P 20 C NEUR INF PROC
[7]  
[Anonymous], 2000, Utility of gains and losses: Measurement-theoretical and experimental approaches
[8]  
[Anonymous], 1982, Models of bounded rationality: Empirically grounded economic reason
[9]  
[Anonymous], 2006, Pattern recognition and machine learning
[10]   Rationality and bounded rationality [J].
Aumann, RJ .
GAMES AND ECONOMIC BEHAVIOR, 1997, 21 (1-2) :2-14