Regularizing deep networks with prior knowledge: A constraint-based approach

被引:30
作者
Roychowdhury, Soumali [1 ]
Diligenti, Michelangelo [2 ]
Gori, Marco [2 ,3 ]
机构
[1] IMT Lucca, Lucca, Italy
[2] Univ Siena, Dept Informat Engn & Math, Siena, Italy
[3] Univ Cote Azur, CNRS, I3S, INRIA, Maasai, Cote Dazur, France
基金
欧盟地平线“2020”;
关键词
Deep learning; Convolutional neural networks; Image classification; Neuro symbolic methods; First-order logic; Learning from constraints; LOGIC;
D O I
10.1016/j.knosys.2021.106989
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Learning architectures can develop feature representations and classification models in an integrated way during training. This joint learning process requires large networks with many parameters, and it is successful when a large amount of training data is available. Instead of making the learner develop its entire understanding of the world from scratch from the input examples, the injection of prior knowledge into the learner seems to be a principled way to reduce the amount of require training data, as the learner does not need to induce the rules from the data. This paper presents a general framework to integrate arbitrary prior knowledge into learning. The domain knowledge is provided as a collection of first-order logic (FOL) clauses, where each task to be learned corresponds to a predicate in the knowledge base. The logic statements are translated into a set of differentiable constraints, which can be integrated into the learning process to distill the knowledge into the network, or used during inference to enforce the consistency of the predictions with the prior knowledge. The experimental results have been carried out on multiple image datasets and show that the integration of the prior knowledge boosts the accuracy of several state-of-the-art deep architectures on image classification tasks. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页数:10
相关论文
共 48 条
[1]  
Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640
[2]  
[Anonymous], 2018, J Artif Intell Res
[3]  
Bach SH, 2017, J MACH LEARN RES, V18
[4]  
Bengio Y., 2007, Advances in neural information processing systems, P153
[5]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[6]  
Chen LC, 2015, PR MACH LEARN RES, V37, P1785
[7]  
Dai W., 2018, ARXIV PREPRINT ARXIV
[8]  
Darwiche A., 2011, P 22 INT JOINT C ART, Vtwo, P819826
[9]  
De Raedt L, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2468
[10]  
Demeester T., 2019, NESY 19 IJCAI