Multi-Agent Reinforcement Learning and Chimpanzee Hunting

被引:3
作者
Sauter, Michael Z. [1 ]
Shi, Dongqing [1 ]
Kralik, Jerald D. [1 ]
机构
[1] Dartmouth Coll, Dept Psychol & Brain Sci, Hanover, NH 03755 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2009), VOLS 1-4 | 2009年
关键词
D O I
10.1109/ROBIO.2009.5420602
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of multi-agent reinforcement learning is growing because of it's ability to scale in complexity and its lack of need for knowledge of the state and other agents. Chimpanzee hunting behavior is a suitable complex and interesting model for which multi-agent reinforcement learning is appropriate. Chimpanzee hunting strategies vary in both use and complexity and ultimately depend on the environment for which they are applied. Learning to use the varying strategies and learning when they are most effective is what this paper addresses and provides initial results and framework to build upon.
引用
收藏
页码:622 / 626
页数:5
相关论文
共 13 条
[1]  
[Anonymous], 1994, P 11 INT C INT C MAC
[2]  
Boesch C., 1996, CHIMPANZEE CULTURES, P77
[3]  
Boesch Christophe, 1989, AM J PHYS ANTHR
[4]   Multiagent learning using a variable learning rate [J].
Bowling, M ;
Veloso, M .
ARTIFICIAL INTELLIGENCE, 2002, 136 (02) :215-250
[5]  
Conitzer V.., 2003, MACHINE LEARNING P 2, P83
[6]   A division of labour with role specialization in group-hunting bottlenose dolphins (Tursiops truncatus) off Cedar Key, Florida [J].
Gazda, SK ;
Connor, RC ;
Edgar, RK ;
Cox, F .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2005, 272 (1559) :135-140
[7]  
Hespanha J., 1999, P 38 C DEC CONTR
[8]  
Junling Hu, 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P242
[9]  
Littman M. L., 2001, Cognitive Systems Research, V2, P55, DOI 10.1016/S1389-0417(01)00015-8
[10]   Ecological change, group territoriality, and population dynamics in Serengeti lions [J].
Packer, C ;
Hilborn, R ;
Mosser, A ;
Kissui, B ;
Borner, M ;
Hopcraft, G ;
Wilmshurst, J ;
Mduma, S ;
Sinclair, ARE .
SCIENCE, 2005, 307 (5708) :390-393