Enriching behavioral ecology with reinforcement learning methods

被引：35

作者：

Frankenhuis, Willem E. ^{[1
]}

Panchanathan, Karthik ^{[2
]}

Barto, Andrew G. ^{[3
]}

机构：

[1] Radboud Univ Nijmegen, Behav Sci Inst, Montessorilaan 3,POB 9104, NL-6500 HE Nijmegen, Netherlands

[2] Univ Missouri, Dept Anthropol, 200 Swallow Hall, Columbia, MO 65211 USA

[3] Univ Massachusetts, Coll Informat & Comp Sci, Amherst, MA 01003 USA

来源：

BEHAVIOURAL PROCESSES | 2019年 / 161卷

关键词：

Adaptation; Evolution; Development; Learning; Dynamic programming; Reinforcement learning; EVOLUTIONARY PSYCHOLOGY; INFORMATION; ADAPTATION; MODEL; PLASTICITY; GAME; PERSPECTIVE; UNCERTAIN; SELECTION; GENETICS;

D O I：

10.1016/j.beproc.2018.01.008

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

This article focuses on the division of labor between evolution and development in solving sequential, state-dependent decision problems. Currently, behavioral ecologists tend to use dynamic programming methods to study such problems. These methods are successful at predicting animal behavior in a variety of contexts. However, they depend on a distinct set of assumptions. Here, we argue that behavioral ecology will benefit from drawing more than it currently does on a complementary collection of tools, called reinforcement learning methods. These methods allow for the study of behavior in highly complex environments, which conventional dynamic programming methods do not feasibly address. In addition, reinforcement learning methods are well-suited to studying how biological mechanisms solve developmental and learning problems. For instance, we can use them to study simple rules that perform well in complex environments. Or to investigate under what conditions natural selection favors fixed, non-plastic traits (which do not vary across individuals), cue-driven-switch plasticity (innate instructions for adaptive behavioral development based on experience), or developmental selection (the incremental acquisition of adaptive behavior based on experience). If natural selection favors developmental selection, which includes learning from environmental feedback, we can also make predictions about the design of reward systems. Our paper is written in an accessible manner and for a broad audience, though we believe some novel insights can be drawn from our discussion. We hope our paper will help advance the emerging bridge connecting the fields of behavioral ecology and reinforcement learning.

引用

页码：94 / 100

页数：7

共 100 条

[11] Evolutionary tipping points in the capacity to adapt to environmental change [J].

Botero, Carlos A. ;

Weissing, Franz J. ;

Wright, Jonathan ;

Rubenstein, Dustin R. .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (01) :184-189

[12] SCIENCE AND STATISTICS [J].

BOX, GEP .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1976, 71 (356) :791-799

[13]

Clark C. W., 2000, OX ECOL EV

[14] Evolutionary Psychology: New Perspectives on Cognition and Motivation [J].

Cosmides, Leda ;

Tooby, John .

ANNUAL REVIEW OF PSYCHOLOGY, VOL 64, 2013, 64 :201-+

[15] Genes as cues: phenotypic integration of genetic and epigenetic information from a Darwinian perspective [J].

Dall, Sasha R. X. ;

McNamara, John M. ;

Leimar, Olof .

TRENDS IN ECOLOGY & EVOLUTION, 2015, 30 (06) :327-333

[16] ADAPTIVE FLEXIBILITY IN THE FORAGING BEHAVIOR OF FISHES [J].

DILL, LM .

CANADIAN JOURNAL OF FISHERIES AND AQUATIC SCIENCES, 1983, 40 (04) :398-408

[17]

Donaldson-Matasci MC, 2008, EVOL ECOL RES, V10, P493

[18] Learning to Cooperate: The Evolution of Social Rewards in Repeated Interactions [J].

Dridi, Slimane ;

Akcay, Erol .

AMERICAN NATURALIST, 2018, 191 (01) :58-73

[19] Environmental complexity favors the evolution of learning [J].

Dridi, Slimane ;

Lehmann, Laurent .

BEHAVIORAL ECOLOGY, 2016, 27 (03) :842-850

[20] A model for the evolution of reinforcement learning in fluctuating games [J].

Dridi, Slimane ;

Lehmann, Laurent .

ANIMAL BEHAVIOUR, 2015, 104 :87-114

← 1 2 3 4 5 6 7 8 9 10 →