From Reward to Histone: Combining Temporal-Difference Learning and Epigenetic Inheritance for Swarm's Coevolving Decision Making

被引：3

作者：

Mukhlish, Faqihza ^{[1
]}

Page, John ^{[1
]}

Bain, Michael ^{[2
]}

机构：

[1] Univ New South Wales, Sch Mech & Mfg Engn, Sydney, NSW, Australia

[2] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia

来源：

10TH IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (ICDL-EPIROB 2020) | 2020年

关键词：

Epigenetic; Swarm; Coevolving; Learning; Multi-Agent; Decision-Making; ROBOTICS;

D O I：

10.1109/icdl-epirob48136.2020.9278049

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Applying intelligence to a group of simple robots known as swarm robots has become an exciting technology in assisting or replacing humans to fulfil complex, dangerous and harsh missions. However, building a strategy for a swarm to thrive in a dynamic environment is challenging because of control decentralisation and interactions between agents. The decision-making process in a robotic task commonly takes place in sequential stages. By understanding the subsequent actionreaction process, a strategy to make optimal decisions in a respective environment can be learnt. Hence, using the concept of epigenetic inheritance, novel evolutionary-learning mechanisms for a swarm will be discussed in this paper. Reinforcement evolutionary learning using epigenetic inheritance (RELEpi) is proposed in this article. This method utilizes reward, temporal difference and epigenetic inheritance to approximate optimal action and behaviour policies. The proposed method opens possibilities to combine reward-based learning and evolutionary methods as a stacked process where histone value is used rather than fitness function. The formulation consists of methylation and epigenetic mechanisms, inspired by the epigenome studies. The methylation process helps the accumulation of the reward to histone value of the gene. Epigenetic mechanisms give the ability to mate genetic information along with their histone value.

引用

页数：6

共 30 条

[1]

[Anonymous], 2012, Small Unmanned Aircraft

[2]

[Anonymous], Adaptation in Natural and Artificial Systems | The MIT Press

[3] Epigenetic robotics - modelling cognitive development in robotic systems [J].

Berthouze, L ;

Ziemke, T .

CONNECTION SCIENCE, 2003, 15 (04) :147-150

[4] EpiGenetic Algorithm for Optimization: Application to Mobile Network Frequency Planning [J].

Birogul, Serdar .

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2016, 41 (03) :883-896

[5] Swarm intelligence and robotics [J].

Bogue, Robert .

INDUSTRIAL ROBOT-AN INTERNATIONAL JOURNAL, 2008, 35 (06) :488-495

[6] Swarm robotics: a review from the swarm engineering perspective [J].

Brambilla, Manuele ;

Ferrante, Eliseo ;

Birattari, Mauro ;

Dorigo, Marco .

SWARM INTELLIGENCE, 2013, 7 (01) :1-41

[7]

Fontana A, 2007, LECT NOTES ARTIF INT, V4648, P163

[8]

Francesca Gianpiero, 2012, From Animals to Animats 12. Proceedings of the 12th International Conference on Simulation of Adaptive Behavior, SAB 2012, P381, DOI 10.1007/978-3-642-33093-3_38

[9]

Golberg D.E., 1989, GENETIC ALGORITHMS S, DOI 10.5860/choice.27-0936

[10] THE INHERITANCE OF EPIGENETIC DEFECTS [J].

HOLLIDAY, R .

SCIENCE, 1987, 238 (4824) :163-170

← 1 2 3 →