Do large language models "understand" their knowledge?

被引:0
|
作者
Venkatasubramanian, Venkat [1 ]
机构
[1] Columbia Univ, Dept Chem Engn, Complex Resilient Intelligent Syst Lab, New York, NY 10027 USA
关键词
Knowledge representation; LLM; Industrial revolution 4.0; LKM; Transformers; PROCESS FAULT-DETECTION; QUANTITATIVE MODEL; PART I; FRAMEWORK; DESIGN; SYSTEM;
D O I
10.1002/aic.18661
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Large language models (LLMs) are often criticized for lacking true "understanding" and the ability to "reason" with their knowledge, being seen merely as autocomplete engines. I suggest that this assessment might be missing a nuanced insight. LLMs do develop a kind of empirical "understanding" that is "geometry"-like, which is adequate for many applications. However, this "geometric" understanding, built from incomplete and noisy data, makes them unreliable, difficult to generalize, and lacking in inference capabilities and explanations. To overcome these limitations, LLMs should be integrated with an "algebraic" representation of knowledge that includes symbolic AI elements used in expert systems. This integration aims to create large knowledge models (LKMs) grounded in first principles that can reason and explain, mimicking human expert capabilities. Furthermore, we need a conceptual breakthrough, such as the transformation from Newtonian mechanics to statistical mechanics, to create a new science of LLMs.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Demystifying Data Management for Large Language Models
    Miao, Xupeng
    Jia, Zhihao
    Cui, Bin
    COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 547 - 555
  • [42] Social Value Alignment in Large Language Models
    Abbol, Giulio Antonio
    Marchesi, Serena
    Wykowska, Agnieszka
    Belpaeme, Tony
    VALUE ENGINEERING IN ARTIFICIAL INTELLIGENCE, VALE 2023, 2024, 14520 : 83 - 97
  • [43] Large Language Models on Graphs: A Comprehensive Survey
    Jin, Bowen
    Liu, Gang
    Han, Chi
    Jiang, Meng
    Ji, Heng
    Han, Jiawei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 8622 - 8642
  • [44] BioLORD-2023: semantic textual representations fusing large language models and clinical knowledge graph insights
    Remy, Francois
    Demuynck, Kris
    Demeester, Thomas
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024,
  • [45] LegalReasoner: A Multi-Stage Framework for Legal Judgment Prediction via Large Language Models and Knowledge Integration
    Wang, Xuran
    Zhang, Xinguang
    Hoo, Vanessa
    Shao, Zhouhang
    Zhang, Xuguang
    IEEE ACCESS, 2024, 12 : 166843 - 166854
  • [46] Applications of natural language processing and large language models in materials discovery
    Jiang, Xue
    Wang, Weiren
    Tian, Shaohan
    Wang, Hao
    Lookman, Turab
    Su, Yanjing
    NPJ COMPUTATIONAL MATERIALS, 2025, 11 (01)
  • [47] Performance of Recent Large Language Models for a Low-Resourced Language
    Jayakody, Ravindu
    Dias, Gihan
    2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 162 - 167
  • [48] The Life Cycle of Knowledge in Big Language Models: A Survey
    Cao, Boxi
    Lin, Hongyu
    Han, Xianpei
    Sun, Le
    MACHINE INTELLIGENCE RESEARCH, 2024, 21 (02) : 217 - 238
  • [49] Large Language Models for Conducting Advanced Text Analytics Information Systems Research
    Ampel, Benjamin
    Yang, Chi-heng
    Hu, James
    Chen, Hsinchun
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2025, 16 (01)
  • [50] Automatic detection of contextual laterality in Mammography Reports using Large Language Models
    Godoy, Eduardo
    de Ferrari, Joaquin
    Mellado, Diego
    Chabert, Steren
    Salas, Rodrigo
    2024 14TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS, ICPRS, 2024,