Refining neural network predictions using background knowledge

被引：6

作者：

Daniele, Alessandro ^{[1
]}

van Krieken, Emile ^{[2
]}

Serafini, Luciano ^{[1
]}

van Harmelen, Frank ^{[2
]}

机构：

[1] Fdn Bruno Kessler, Data & Knowledge Management Unit, Via Sommar 18, I-38123 Trento, Italy

[2] Vrije Univ Amsterdam, Dept Comp Sci, De Boelelaan 1081a, NL-1081 HV Amsterdam, Netherlands

来源：

MACHINE LEARNING | 2023年 / 112卷 / 09期

关键词：

Neurosymbolic AI; Fuzzy logic; Optimization; TRIANGULAR NORMS;

D O I：

10.1007/s10994-023-06310-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent work has shown learning systems can use logical background knowledge to compensate for a lack of labeled training data. Many methods work by creating a loss function that encodes this knowledge. However, often the logic is discarded after training, even if it is still helpful at test time. Instead, we ensure neural network predictions satisfy the knowledge by refining the predictions with an extra computation step. We introduce differentiable refinement functions that find a corrected prediction close to the original prediction. We study how to effectively and efficiently compute these refinement functions. Using a new algorithm called iterative local refinement (ILR), we combine refinement functions to find refined predictions for logical formulas of any complexity. ILR finds refinements on complex SAT formulas in significantly fewer iterations and frequently finds solutions where gradient descent can not. Finally, ILR produces competitive results in the MNIST addition task.

引用

页码：3293 / 3331

页数：39

共 35 条

[1]

Ahmed K., 2022, ARXIV

[2]

Alsina C., 1984, General Inequalities, V4, P241

[3] Logic Tensor Networks [J].

Badreddine, Samy ;

Garcez, Artur d'Avila ;

Serafini, Luciano ;

Spranger, Michael .

ARTIFICIAL INTELLIGENCE, 2022, 303

[4]

Calvo T, 2002, STUD FUZZ SOFT COMP, V97, P3

[5]

Chowdhery Aakanksha, 2022, arXiv

[6] SUBGRADIENT CRITERIA FOR MONOTONICITY, THE LIPSCHITZ CONDITION, AND CONVEXITY [J].

CLARKE, FH ;

STERN, RJ ;

WOLENSKI, PR .

CANADIAN JOURNAL OF MATHEMATICS-JOURNAL CANADIEN DE MATHEMATIQUES, 1993, 45 (06) :1167-1183

[7]

Daniele A., 2022, ARXIV

[8] Knowledge Enhanced Neural Networks [J].

Daniele, Alessandro ;

Serafini, Luciano .

PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2019, 11670 :542-554

[9] Semantic-based regularization for learning and inference [J].

Diligenti, Michelangelo ;

Gori, Marco ;

Sacca, Claudio .

ARTIFICIAL INTELLIGENCE, 2017, 244 :143-165

[10]

Donadello I, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1596

← 1 2 3 4 →