PrimeNet: A Framework for Commonsense Knowledge Representation and Reasoning Based on Conceptual Primitives

被引:1
作者
Liu, Qian [1 ]
Han, Sooji [2 ]
Cambria, Erik [3 ]
Li, Yang [4 ]
Kwok, Kenneth [5 ]
机构
[1] Univ Auckland, Sch Comp Sci, Auckland, New Zealand
[2] Intapp, Berlin, Germany
[3] Nanyang Technol Univ, Coll Comp & Data Sci, Singapore, Singapore
[4] Northwestern Polytech Univ, Sch Automat, Xian, Peoples R China
[5] ASTAR, Inst High Performance Comp, Singapore, Singapore
关键词
Commonsense acquisition; Knowledge representation and reasoning; Conceptual primitives; LARGE-SCALE; LANGUAGE; CONSTRUCTION; DBPEDIA; SENSE; MIND;
D O I
10.1007/s12559-024-10345-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Commonsense knowledge acquisition and representation is a core topic in artificial intelligence (AI), which is crucial for building more sophisticated and human-like AI systems. However, existing commonsense knowledge bases organize facts in an isolated manner like bag of facts, lacking the cognitive-level connections that humans commonly possess. People have the ability to efficiently organize vast amounts of knowledge by linking or generalizing concepts using a limited set of conceptual primitives that serve as the fundamental building blocks of reasoning. These conceptual primitives are basic, foundational elements of thought that humans use to make sense of the world. By combining and recombining these primitives, people can construct complex ideas, solve problems, and understand new concepts. To emulate this cognitive mechanism, we design a new commonsense knowledge base, termed PrimeNet, organized in a three-layer structure: a small core of conceptual primitives (e.g., FOOD), a bigger set of concepts that connect to such primitives (e.g., fruit), and an even larger layer of entities connecting to the concepts (e.g., banana). First, we collect commonsense knowledge and employ a gradual expansion strategy for knowledge integration. After refinement, PrimeNet contains 6 million edges between 2 million nodes, with 34 different types of relations. Then, we design a new conceptualization method by leveraging a probabilistic taxonomy, to build the concept layer of PrimeNet. Finally, we conduct primitive detection to build the primitive layer, where a lexical substitution task is used to identify related concepts, and large language models are employed to generate a rational primitive to label each concept cluster as well as verify the primitive detection process.
引用
收藏
页码:3429 / 3456
页数:28
相关论文
共 108 条
  • [1] AlKhamissi B, 2022, ABS220406031 CORR
  • [2] [Anonymous], 2016, P 2016 C EMPIRICAL M, DOI DOI 10.18653/V1/D16-1235
  • [3] [Anonymous], 2006, Conference on Human Factors in Computing Systems
  • [4] [Anonymous], 2001, P 10 INT C WORLD WID, DOI DOI 10.1145/371920.372094
  • [5] DBpedia: A nucleus for a web of open data
    Auer, Soeren
    Bizer, Christian
    Kobilarov, Georgi
    Lehmann, Jens
    Cyganiak, Richard
    Ives, Zachary
    [J]. SEMANTIC WEB, PROCEEDINGS, 2007, 4825 : 722 - +
  • [6] Baker C.F., 1998, P 17 INT C COMPUTAT
  • [7] Baker Simon, 2014, P C EMP METH NAT LAN, P278
  • [8] Banerjee P, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P151
  • [9] Bhagavatula C, 2020, P INT C LEARN REPR I
  • [10] Bhargava P, 2022, AAAI CONF ARTIF INTE, P12317