Toward a Model of Intelligence as an Economy of Agents

被引:0
作者
Eric B. Baum
机构
[1] NEC Research Institute,
来源
Machine Learning | 1999年 / 35卷
关键词
reinforcement learning; multi-agent systems; planning; evolutionary economics; tragedy of the commons; classifier systems; agoric systems; autonomous programming; cognition; artificial intelligence; Hayek; complex adaptive systems; temporal difference learning; evolutionary computation; economic models of mind; economic models of computation; Blocks World; reasoning; learning; computational learning theory; learning to reason; meta-reasoning;
D O I
暂无
中图分类号
学科分类号
摘要
A market-based algorithm is presented which autonomously apportions complex tasks to multiple cooperating agents giving each agent the motivation of improving performance of the whole system. A specific model, called “The Hayek Machine” is proposed and tested on a simulated Blocks World (BW) planning problem. Hayek learns to solve more complex BW problems than any previous learning algorithm. Given intermediate reward and simple features, it has learned to efficiently solve arbitrary BW problems. The Hayek Machine can also be seen as a model of evolutionary economics.
引用
收藏
页码:155 / 185
页数:30
相关论文
共 16 条
  • [1] Coase R.H.(1960)The theory of social cost Journal of Law and Economics 3 1-44
  • [2] Hardin G.(1968)The tragedy of the commons Science 162 1243-1248
  • [3] Lenat D.B.(1983)EURISKO: a program that learns new heuristics and domain concepts, the nature of heuristics III: Program design and results Artificial Intelligence 21 61-98
  • [4] Palmer R.G.(1994)Artificial economic life: A simple model of a stockmarket Physica D 75 264-274
  • [5] Arthur W.B.(1989)The Neural Bucket Brigade: A local learning algorithm for dynamic feedforward and recurrent networks Connection Science 1 403-412
  • [6] Holland J.H.(1988)Learning to predict by the methods of temporal differences Machine Learning 3 9-44
  • [7] LeBaron B.(1992)Practical issues in temporal difference learning Machine Learning 8 257-277
  • [8] Tayler P.(1995)Temporal difference learning and td-gammon Communications of the ACM 38 58-68
  • [9] Schmidhuber J.(1993)A market oriented programming environment and its application to distributed multicommodity flow problems Journal of Artificial Intelligence Research 1 1-23
  • [10] Sutton R.S.(1991)Learning to perceive and act Machine Learning 7 45-83