Do Large Language Models Show Human-like Biases? Exploring Confidence-Competence Gap in AI

被引：2

作者：

Singh, Aniket Kumar ^{[1
]}

Lamichhane, Bishal ^{[2
]}

Devkota, Suman ^{[3
]}

Dhakal, Uttam ^{[3
]}

Dhakal, Chandra ^{[4
]}

机构：

[1] Youngstown State Univ, Dept Comp Sci & Informat Syst, Youngstown, OH 44555 USA

[2] Univ Nevada, Dept Math & Stat, Reno, NV 89557 USA

[3] Youngstown State Univ, Dept Elect & Comp Engn, Youngstown, OH 44555 USA

[4] Univ Georgia, Dept Agr & Appl Econ, Athens, GA 30602 USA

来源：

INFORMATION | 2024年 / 15卷 / 02期

关键词：

Large Language Models; Dunning-Kruger effects; chat-GPT; BARD; Claude; LLaMA; cognitive biases; artificial intelligence; AI ethics; Natural Language Processing; confidence assessment;

D O I：

10.3390/info15020092

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study investigates self-assessment tendencies in Large Language Models (LLMs), examining if patterns resemble human cognitive biases like the Dunning-Kruger effect. LLMs, including GPT, BARD, Claude, and LLaMA, are evaluated using confidence scores on reasoning tasks. The models provide self-assessed confidence levels before and after responding to different questions. The results show cases where high confidence does not correlate with correctness, suggesting overconfidence. Conversely, low confidence despite accurate responses indicates potential underestimation. The confidence scores vary across problem categories and difficulties, reducing confidence for complex queries. GPT-4 displays consistent confidence, while LLaMA and Claude demonstrate more variations. Some of these patterns resemble the Dunning-Kruger effect, where incompetence leads to inflated self-evaluations. While not conclusively evident, these observations parallel this phenomenon and provide a foundation to further explore the alignment of competence and confidence in LLMs. As LLMs continue to expand their societal roles, further research into their self-assessment mechanisms is warranted to fully understand their capabilities and limitations.

引用

页数：20

共 24 条

[1] Large language models show human- like content biases in transmission chain experiments
Acerbi, Alberto
Stubbersfield, Joseph M.
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (44)
[2] THE DUNNING-KRUGER EFFECT: ON BEING IGNORANT OF ONE'S OWN IGNORANCE
Dunning, David
[J]. ADVANCES IN EXPERIMENTAL SOCIAL PSYCHOLOGY, VOL 44, 2011, 44 : 247 - 296
[3] Hendrycks D., 2021, P 2021 INT C LEARNIN
[4] Huang JT, 2024, Arxiv, DOI arXiv:2308.03656
[5] Huang Jiaxin, 2022, arXiv, DOI DOI 10.48550/ARXIV.2210.11610
[6] Jones E, 2022, Arxiv, DOI arXiv:2202.12299
[7] Kraus M, 2023, Arxiv, DOI [arXiv:2304.00116, 10.48550/arXiv.2304.00116]
[8] Unskilled and unaware of it: How difficulties in recognizing one's own incompetence lead to inflated self-assessments
Kruger, J
Dunning, D
[J]. JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1999, 77 (06) : 1121 - 1134
[9] Liang P, 2023, Arxiv, DOI arXiv:2211.09110
[10] Lin Z, 2024, Arxiv, DOI arXiv:2305.19187

← 1 2 3 →