Exploring Reasoning Biases in Large Language Models Through Syllogism: Insights from the NeuBAROCO Dataset

被引:0
|
作者
Ozeki, Kentaro [1 ,2 ]
Ando, Risako [1 ]
Morishita, Takanobu [1 ]
Abe, Hirohiko [1 ]
Mineshima, Koji [1 ]
Okada, Mitsuhiro [1 ]
机构
[1] Keio University, Tokyo, Japan
[2] University of Tokyo, Tokyo, Japan
来源
arXiv |
关键词
Compilation and indexing terms; Copyright 2025 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 8 条
  • [1] Evaluating Large Language Models with NeuBAROCO: Syllogistic Reasoning Ability and Human-like Biases
    Ando, Risako
    Morishita, Takanobu
    Abe, Hirohiko
    Mineshima, Koji
    Okada, Mitsuhiro
    arXiv, 2023,
  • [2] Discovering Research Areas in Dataset Applications through Knowledge Graphs and Large Language Models
    Gerasimov, Irina
    Mehrabian, Armin
    Kc, Binita
    Alfred, Jerome
    Mcguire, Michael P.
    Proceedings - 2024 IEEE 20th International Conference on e-Science, e-Science 2024, 2024,
  • [3] Trojan Detection in Large Language Models: Insights from The Trojan Detection Challenge
    Maloyan, Narek
    Verma, Ekansh
    Nutfullin, Bulat
    Ashinov, Bislan
    arXiv,
  • [4] PersonalSum: A User-Subjective Guided Personalized Summarization Dataset for Large Language Models
    Zhang, Lemei
    Liu, Peng
    Tiedemann, Marcus
    Henriksboe, Oekland
    Lauvrak, Even W.
    Gulla, Jon Atle
    Ramampiaro, Heri
    arXiv,
  • [5] Realizing the cooking recipe of materials synthesis through large language models
    Thik, Jaydeep
    Wang, Siwen
    Wang, Chuhong
    Mansourifar, Hadi
    Lin, Honghong
    Okubo, Keiichi
    Ling, Chen
    JOURNAL OF MATERIALS CHEMISTRY A, 2023, 11 (47) : 25849 - 25853
  • [6] WikiFactDiff: A Large, Realistic, and Temporally Adaptable Dataset for Atomic Factual Knowledge Update in Causal Language Models
    Khodja, Hichem Ammar
    Béchet, Frédéric
    Brabant, Quentin
    Nasr, Alexis
    Lecorvé, Gwénolé
    2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings, 2024, : 17614 - 17624
  • [7] Predicting seizure recurrence from medical records using large language models
    Mbizvo, Gashirai K.
    Buchan, Ian
    LANCET DIGITAL HEALTH, 2023, 5 (12): : E851 - E852
  • [8] Think from Words(TFW): Initiating Human-Like Cognition in Large Language Models Through Think from Words for Japanese Text-level Classification
    Gan, Chengguang
    Zhang, Qinghao
    Mori, Tatsunori
    arXiv, 2023,