Exploration and Exploitation During Sequential Search

被引:13
作者
Dam, Gregory [1 ,2 ]
Koerding, Konrad [1 ,2 ]
机构
[1] Rehabil Inst Chicago, Chicago, IL 60611 USA
[2] Northwestern Univ, Feinberg Sch Med, Dept Physiol, Evanston, IL 60208 USA
关键词
Human search behaviour; Neuroeconomics; Skill acquisition; Decision making; Motor control; Mathematical modeling; UNCERTAINTY; STRATEGIES; DECISION; BEHAVIOR;
D O I
10.1111/j.1551-6709.2009.01021.x
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
When we learn how to throw darts we adjust how we throw based oil where the darts stick. Much of skill learning is computationally similar in that we learn using feedback obtained after the completion of individual actions. We can formalize such tasks as a search problem among the set of all possible actions, find the action that leads to the highest reward. In such cases our actions have two objectives: we want to best utilize what we already know (exploitation), but we also want to learn to be more successful in the future (exploration). Here we tested how participants learn movement trajectories where feedback is provided as a monetary reward that depends on the chosen trajectory. We mathematically derived the optimal search policy for our experiment using decision theory. The search behavior of participants is well predicted by an ideal searcher model that optimally combines exploration and exploitation.
引用
收藏
页码:530 / 541
页数:12
相关论文
共 50 条
[31]   Energetic state regulates the exploration-exploitation trade-off in honeybees [J].
Katz, Keziah ;
Naug, Dhruba .
BEHAVIORAL ECOLOGY, 2015, 26 (04) :1045-1050
[32]   Social Distancing in Robot Swarms: Modulating Exploitation and Exploration Without Signal Exchange [J].
Vogrin, Michael ;
Stefanec, Martin ;
Schmickl, Thomas .
2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, :2233-2240
[33]   Frontopolar cortex and decision-making efficiency: comparing brain activity of experts with different professional background during an exploration-exploitation task [J].
Laureiro-Martinez, Daniella ;
Canessa, Nicola ;
Brusoni, Stefano ;
Zollo, Maurizio ;
Hare, Todd ;
Alemanno, Federica ;
Cappa, Stefano F. .
FRONTIERS IN HUMAN NEUROSCIENCE, 2014, 7
[34]   Disentangling the roles of dopamine and noradrenaline in the exploration-exploitation tradeoff during human decision-making [J].
Cremer, Anna ;
Kalbe, Felix ;
Mueller, Jana Christina ;
Wiedemann, Klaus ;
Schwabe, Lars .
NEUROPSYCHOPHARMACOLOGY, 2023, 48 (07) :1078-1086
[35]   Exploitation of phage battery in the search for bioactive actinomycetes [J].
Kurtboeke, D. Ipek .
APPLIED MICROBIOLOGY AND BIOTECHNOLOGY, 2011, 89 (04) :931-937
[36]   Dopaminergic control of the exploration-exploitation trade-off via the basal ganglia [J].
Humphries, Mark D. ;
Khamassi, Mehdi ;
Gurney, Kevin .
FRONTIERS IN NEUROSCIENCE, 2012, 6
[37]   The link between nascent entrepreneurs' role identity aspirations and their opportunity exploration and exploitation activities [J].
Roelandt, Jolien ;
Rijssegem, Laurence ;
Andries, Petra .
APPLIED PSYCHOLOGY-AN INTERNATIONAL REVIEW-PSYCHOLOGIE APPLIQUEE-REVUE INTERNATIONALE, 2023, 72 (03) :1134-1159
[38]   Sequential Search Beats a Two-Parameter Search [J].
Cheng, Raymond .
SEQUENTIAL ANALYSIS-DESIGN METHODS AND APPLICATIONS, 2014, 33 (03) :287-297
[39]   Continuous human learning optimization with enhanced exploitation and exploration [J].
Wang, Ling ;
Jia, Yihao ;
Huang, Bowen ;
Wu, Xian ;
Zhou, Wenju ;
Fei, Minrui .
SOFT COMPUTING, 2023, 28 (7-8) :5795-5852
[40]   The influence of exploration and exploitation on born globals' speed of internationalization [J].
Lin, Song ;
Si, Steven .
MANAGEMENT DECISION, 2019, 57 (01) :193-210