A shared temporal window of integration across cognitive control and reinforcement learning paradigms: A correlational study

被引：1

作者：

Vasta, Nicola ^{[1
]}

Xu, Shengjie ^{[2
]}

Verguts, Tom ^{[2
]}

Braem, Senne ^{[2
]}

机构：

[1] Univ Trento, Dept Psychol & Cognit Sci, Corso Bettini 31, I-38068 Rovereto, TN, Italy

[2] Univ Ghent, Dept Expt Psychol, Ghent, Belgium

来源：

MEMORY & COGNITION | 2025年 / 53卷 / 03期

基金：

欧洲研究理事会;

关键词：

Cognitive control; Reinforcement learning; Congruency sequence effect; Time scale of control; Learning rate; Shared strategy; WORKING-MEMORY; PREFRONTAL CORTEX; ADAPTATION; MODEL; TIME; INFORMATION; PROPORTION;

D O I：

10.3758/s13421-024-01626-4

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

Cognitive control refers to the ability to override prepotent response tendencies to achieve goal-directed behavior. On the other hand, reinforcement learning refers to the learning of actions through feedback and reward. Although cognitive control and reinforcement learning are often viewed as opposing forces in driving behavior, recent theories have emphasized possible similarities in their underling processes. With this study, we aimed to investigate whether a similar time window of integration could be observed during the learning of control on the one hand, and the learning rate in reinforcement learning paradigms on the other. To this end, we performed a correlational analysis on a large public dataset (n = 522) including data from two reinforcement learning tasks, i.e., a probabilistic selection task and a probabilistic Wisconsin Card Sorting Task (WCST), and data from a classic conflict task (i.e., the Stroop task). Results showed expected correlations between the time scale of control indices and learning rate in the probabilistic WCST. Moreover, the learning-rate parameters of the two reinforcement learning tasks did not correlate with each other. Together, these findings suggest a reliance on a shared learning mechanism between these two traditionally distinct domains, while at the same time emphasizing that value updating processes can still be very task-specific. We speculate that updating processes in the Stroop and WCST may be more related because both tasks require task-specific updating of stimulus features (e.g., color, word meaning, pattern, shape), as opposed to stimulus identity.

引用

页码：1008 / 1021

页数：14

共 65 条

[61]

von Bastian C.C., 2020, arXiv

[62] An EZ-diffusion model for response time and accuracy [J].

Wagenmakers, Eric-Jan ;

van der Maas, Han L. J. ;

Grasman, Raoul P. P. P. .

PSYCHONOMIC BULLETIN & REVIEW, 2007, 14 (01) :3-22

[63] Prefrontal cortex as a meta-reinforcement learning system [J].

Wang, Jane X. ;

Kurth-Nelson, Zeb ;

Kumaran, Dharshan ;

Tirumala, Dhruva ;

Soyer, Hubert ;

Leibo, Joel Z. ;

Hassabis, Demis ;

Botvinick, Matthew .

NATURE NEUROSCIENCE, 2018, 21 (06) :860-+

[64] Different Levels of Learning Interact to Shape the Congruency Sequence Effect [J].

Weissman, Daniel H. ;

Hawks, Zoe W. ;

Egner, Tobias .

JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2016, 42 (04) :566-583

[65] Are Cognitive Control Processes Reliable? [J].

Whitehead, Peter S. ;

Brewer, Gene A. ;

Blais, Chris .

JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2019, 45 (05) :765-778

← 1 2 3 4 5 6 7 →