Do large language models "understand" their knowledge?

被引：0

作者：

Venkatasubramanian, Venkat ^{[1
]}

机构：

[1] Columbia Univ, Dept Chem Engn, Complex Resilient Intelligent Syst Lab, New York, NY 10027 USA

来源：

AICHE JOURNAL | 2025年 / 71卷 / 03期

关键词：

Knowledge representation; LLM; Industrial revolution 4.0; LKM; Transformers; PROCESS FAULT-DETECTION; QUANTITATIVE MODEL; PART I; FRAMEWORK; DESIGN; SYSTEM;

D O I：

10.1002/aic.18661

中图分类号：

TQ [化学工业];

学科分类号：

0817 ;

摘要：

Large language models (LLMs) are often criticized for lacking true "understanding" and the ability to "reason" with their knowledge, being seen merely as autocomplete engines. I suggest that this assessment might be missing a nuanced insight. LLMs do develop a kind of empirical "understanding" that is "geometry"-like, which is adequate for many applications. However, this "geometric" understanding, built from incomplete and noisy data, makes them unreliable, difficult to generalize, and lacking in inference capabilities and explanations. To overcome these limitations, LLMs should be integrated with an "algebraic" representation of knowledge that includes symbolic AI elements used in expert systems. This integration aims to create large knowledge models (LKMs) grounded in first principles that can reason and explain, mimicking human expert capabilities. Furthermore, we need a conceptual breakthrough, such as the transformation from Newtonian mechanics to statistical mechanics, to create a new science of LLMs.

引用

页数：10

共 50 条

[41] Demystifying Data Management for Large Language Models
Miao, Xupeng
Jia, Zhihao
Cui, Bin
COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 547 - 555
[42] Social Value Alignment in Large Language Models
Abbol, Giulio Antonio
Marchesi, Serena
Wykowska, Agnieszka
Belpaeme, Tony
VALUE ENGINEERING IN ARTIFICIAL INTELLIGENCE, VALE 2023, 2024, 14520 : 83 - 97
[43] Large Language Models on Graphs: A Comprehensive Survey
Jin, Bowen
Liu, Gang
Han, Chi
Jiang, Meng
Ji, Heng
Han, Jiawei
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 8622 - 8642
[44] BioLORD-2023: semantic textual representations fusing large language models and clinical knowledge graph insights
Remy, Francois
Demuynck, Kris
Demeester, Thomas
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024,
[45] LegalReasoner: A Multi-Stage Framework for Legal Judgment Prediction via Large Language Models and Knowledge Integration
Wang, Xuran
Zhang, Xinguang
Hoo, Vanessa
Shao, Zhouhang
Zhang, Xuguang
IEEE ACCESS, 2024, 12 : 166843 - 166854
[46] Applications of natural language processing and large language models in materials discovery
Jiang, Xue
Wang, Weiren
Tian, Shaohan
Wang, Hao
Lookman, Turab
Su, Yanjing
NPJ COMPUTATIONAL MATERIALS, 2025, 11 (01)
[47] Performance of Recent Large Language Models for a Low-Resourced Language
Jayakody, Ravindu
Dias, Gihan
2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 162 - 167
[48] The Life Cycle of Knowledge in Big Language Models: A Survey
Cao, Boxi
Lin, Hongyu
Han, Xianpei
Sun, Le
MACHINE INTELLIGENCE RESEARCH, 2024, 21 (02) : 217 - 238
[49] Large Language Models for Conducting Advanced Text Analytics Information Systems Research
Ampel, Benjamin
Yang, Chi-heng
Hu, James
Chen, Hsinchun
ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2025, 16 (01)
[50] Automatic detection of contextual laterality in Mammography Reports using Large Language Models
Godoy, Eduardo
de Ferrari, Joaquin
Mellado, Diego
Chabert, Steren
Salas, Rodrigo
2024 14TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS, ICPRS, 2024,

← 1 2 3 4 5 →