Evolving coordinated group behaviours through maximisation of mean mutual information

被引:28
作者
Sperati V. [1 ]
Trianni V. [1 ]
Nolfi S. [1 ]
机构
[1] Laboratory of Autonomous Robotics and Artificial Life, Institute of Cognitive Sciences and Technologies, CNR, 44-00185 Rome, via S. Martino della Battaglia
基金
欧盟地平线“2020”;
关键词
Evolutionary robotics; Information theory; Mutual information;
D O I
10.1007/s11721-008-0017-1
中图分类号
学科分类号
摘要
A well known problem in the design of the control system for a swarm of robots concerns the definition of suitable individual rules that result in the desired coordinated behaviour. A possible solution to this problem is given by the automatic synthesis of the individual controllers through evolutionary or learning processes. These processes offer the possibility to freely search the space of the possible solutions for a given task, under the guidance of a user-defined utility function. Nonetheless, there exist no general principles to follow in the definition of such a utility function in order to reward coordinated group behaviours. As a consequence, task dependent functions must be devised each time a new coordination problem is under study. In this paper, we propose the use of measures developed in Information Theory as task-independent, implicit utility functions. We present two experiments in which three robots are trained to produce generic coordinated behaviours. Each robot is provided with rich sensory and motor apparatus, which can be exploited to explore the environment and to communicate with other robots. We show how coordinated behaviours can be synthesised through a simple evolutionary process. The only criteria used to evaluate the performance of the robotic group is the estimate of mutual information between the motor states of the robots. © Springer Science + Business Media, LLC 2008.
引用
收藏
页码:73 / 95
页数:22
相关论文
共 32 条
  • [1] Baldassarre G., Trianni V., Bonani M., Mondada F., Dorigo M., Nolfi S., Self-organised coordinated motion in groups of physically connected robots, IEEE Transactions on Systems, Man and Cybernetics, Part B: Cybernetics, 37, 1, pp. 224-239, (2007)
  • [2] Brenner N., Bialek W., de Ruyter van Steveninck R., Adaptive rescaling maximizes information transmission, Neuron, 26, pp. 695-702, (2000)
  • [3] Capdepuy P., Polani D., Nehaniv C., Maximization of potential information flow as a universal utility for collective behaviour, Proceedings of the 2007 IEEE Symposium on Artificial Life (CI-ALife 2007), pp. 207-213, (2007)
  • [4] Cianci C.M., Raemy C., Pugh J., Martinoli A., Communication in a swarm of miniature robots: The e-puck as an educational tool for swarm robotics, Lecture Notes in Computer Science, 4433, pp. 103-115, (2007)
  • [5] Feldman D., A Brief Introduction To: Information Theory, Excess Entropy and Computational Mechanics, (2002)
  • [6] Funes P., Orme B., Bonabeau E., Evolving emergent group behaviors for simple humans agents, Lecture Notes in Artificial Intelligence: Vol. 2801. Advances in Artificial Life. Proceedings of the 7th European Conference on Artificial Life (ECAL 2003), pp. 76-89, (2003)
  • [7] Jakobi N., Evolutionary robotics and the radical envelope of noise hypothesis, Adaptive Behavior, 6, pp. 325-368, (1997)
  • [8] Klyubin A., Polani D., Nehaniv C., All else being equal being empowered, Lecture Notes in Artificial Intelligence: Vol. 3630. Advances in Artificial Life. Proceedings of the 8th European Conference on Artificial Life (ECAL 2005), pp. 744-753, (2005)
  • [9] Klyubin A., Polani D., Nehaniv C., Empowerment: A universal agent-centric measure of control, Proceedings of the 2005 IEEE Congress on Evolutionary Computation, pp. 128-135, (2005)
  • [10] Lungarella M., Pfeifer R., Robots as cognitive tools: Information theoretic analysis of sensory-motor data, Proceedings of the 2nd International IEEE/RSJ Conference on Humanoid Robotics, pp. 245-252, (2001)