Testing Stimulus Equivalence in Transformer-Based Agents

被引:0
作者
Carrillo, Alexis [1 ]
Betancort, Moises [1 ]
机构
[1] Univ La Laguna, Dept Psicol Clin Psicobiol & Metodol, Campus Guajara,Apartado 456, San Cristobal De La Lagun 38200, Spain
关键词
stimulus equivalence; transformers; BERT; GPT; matching to sample; reject control; training structures; CONDITIONAL DISCRIMINATION; EMERGENCE; BEHAVIOR;
D O I
10.3390/fi16080289
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study investigates the ability of transformer-based models (TBMs) to form stimulus equivalence (SE) classes. We employ BERT and GPT as TBM agents in SE tasks, evaluating their performance across training structures (linear series, one-to-many and many-to-one) and relation types (select-reject, select-only). Our findings demonstrate that both models performed above mastery criterion in the baseline phase across all simulations (n = 12). However, they exhibit limited success in reflexivity, transitivity, and symmetry tests. Notably, both models achieved success only in the linear series structure with select-reject relations, failing in one-to-many and many-to-one structures, and all select-only conditions. These results suggest that TBM may be forming decision rules based on learned discriminations and reject relations, rather than responding according to equivalence class formation. The absence of reject relations appears to influence their responses and the occurrence of hallucinations. This research highlights the potential of SE simulations for: (a) comparative analysis of learning mechanisms, (b) explainability techniques for TBM decision-making, and (c) TBM bench-marking independent of pre-training or fine-tuning. Future investigations can explore upscaling simulations and utilize SE tasks within a reinforcement learning framework.
引用
收藏
页数:24
相关论文
共 52 条
[1]   The Problem of Class Breakdown in Sidman′s (1994,2000) Theory about the Origin of Stimulus Equivalence [J].
Alonso-Alvarez, Benigno .
PERSPECTIVES ON BEHAVIOR SCIENCE, 2023, 46 (01) :217-235
[2]  
Alpaydin E., 2020, INTRO MACHINE LEARNI
[3]   Effects of Procedural Variations in the Training of Negative Relations for the Emergence of Equivalence Relations [J].
Antonio Plazas, Elberto ;
Eduardo Pena, Telmo .
PSYCHOLOGICAL RECORD, 2016, 66 (01) :109-125
[4]  
Arntzen E., 2012, European Journal of Behavior Analysis, V13, P123, DOI DOI 10.1080/15021149.2012.11434412
[5]   STIMULUS EQUIVALENCE AND CONNECTIONISM - IMPLICATIONS FOR BEHAVIOR ANALYSIS AND COGNITIVE SCIENCE [J].
BARNES, D ;
HAMPSON, PJ .
PSYCHOLOGICAL RECORD, 1993, 43 (04) :617-638
[6]   On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? [J].
Bender, Emily M. ;
Gebru, Timnit ;
McMillan-Major, Angelina ;
Shmitchell, Shmargaret .
PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, :610-623
[7]  
Berglund L, 2024, Arxiv, DOI [arXiv:2309.12288, 10.48550/arXiv.2309.12288]
[8]   CONDITIONAL DISCRIMINATION AND EQUIVALENCE-RELATIONS - A THEORETICAL-ANALYSIS OF CONTROL BY NEGATIVE STIMULI [J].
CARRIGAN, PF ;
SIDMAN, M .
JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR, 1992, 58 (01) :183-204
[9]   Differences of Training Structures on Stimulus Class Formation in Computational Agents [J].
Carrillo, Alexis ;
Betancort, Moises .
MULTIMODAL TECHNOLOGIES AND INTERACTION, 2023, 7 (04)
[10]  
Casper S, 2023, Arxiv, DOI arXiv:2307.15217