Exploration and Exploitation During Sequential Search

被引:13
作者
Dam, Gregory [1 ,2 ]
Koerding, Konrad [1 ,2 ]
机构
[1] Rehabil Inst Chicago, Chicago, IL 60611 USA
[2] Northwestern Univ, Feinberg Sch Med, Dept Physiol, Evanston, IL 60208 USA
关键词
Human search behaviour; Neuroeconomics; Skill acquisition; Decision making; Motor control; Mathematical modeling; UNCERTAINTY; STRATEGIES; DECISION; BEHAVIOR;
D O I
10.1111/j.1551-6709.2009.01021.x
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
When we learn how to throw darts we adjust how we throw based oil where the darts stick. Much of skill learning is computationally similar in that we learn using feedback obtained after the completion of individual actions. We can formalize such tasks as a search problem among the set of all possible actions, find the action that leads to the highest reward. In such cases our actions have two objectives: we want to best utilize what we already know (exploitation), but we also want to learn to be more successful in the future (exploration). Here we tested how participants learn movement trajectories where feedback is provided as a monetary reward that depends on the chosen trajectory. We mathematically derived the optimal search policy for our experiment using decision theory. The search behavior of participants is well predicted by an ideal searcher model that optimally combines exploration and exploitation.
引用
收藏
页码:530 / 541
页数:12
相关论文
共 50 条
[41]   Exploration-Exploitation Strategy is Dependent on Early Experience [J].
Humphreys, Kathryn L. ;
Lee, Steve S. ;
Telzer, Eva H. ;
Gabard-Durnam, Laurel J. ;
Goff, Bonnie ;
Flannery, Jessica ;
Tottenham, Nim .
DEVELOPMENTAL PSYCHOBIOLOGY, 2015, 57 (03) :313-321
[42]   Sequential search with a price freeze option: theory and experimental evidence [J].
Marcu, Emanuel ;
Noussair, Charles N. .
EXPERIMENTAL ECONOMICS, 2024, 27 (05) :1106-1139
[43]   Modeling Search Behaviors during the Acquisition of Expertise in a Sequential Decision-Making Task [J].
Moenne-Loccoz, Cristobal ;
Vergara, Rodrigo C. ;
Lopez, Vladimir ;
Mery, Domingo ;
Cosmelli, Diego .
FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2017, 11
[44]   Effects of theta transcranial alternating current stimulation (tACS) on exploration and exploitation during uncertain decision-making [J].
Wischnewski, Miles ;
Compen, Boukje .
BEHAVIOURAL BRAIN RESEARCH, 2022, 426
[45]   Learning the value of information and reward over time when solving exploration-exploitation problems [J].
Dezza, Irene Cogliati ;
Yu, Angela J. ;
Cleeremans, Axel ;
Alexander, William .
SCIENTIFIC REPORTS, 2017, 7
[46]   Internal states drive nutrient homeostatis by modulating exploration-exploitation trade-off [J].
Corrales-Carvajal, Veronica Maria ;
Faisal, Aldo A. ;
Ribeiro, Carlos .
ELIFE, 2016, 5
[47]   Sequential Search and Learning from Rank Feedback: Theory and Experimental Evidence [J].
Palley, Asa B. ;
Kremer, Mirko .
MANAGEMENT SCIENCE, 2014, 60 (10) :2525-2542
[48]   OPTIMAL SEQUENTIAL FILE-SEARCH [J].
MONAHAN, GE .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1994, 77 (02) :224-240
[49]   The role of anticipated emotions and the value of information in determining sequential search incentives [J].
Di Caprio, Debora ;
Santos-Arteaga, Francisco J. ;
Tavana, Madjid .
OPERATIONS RESEARCH PERSPECTIVES, 2019, 6
[50]   The Impact of Feature Exploitation and Exploration on Mobile Application Evolution and Success [J].
Shuraida, Shadi ;
Gao, Qiang ;
Safadi, Hani ;
Jain, Radhika .
JOURNAL OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2024, 25 (03) :648-686