CCN plus : A neuro-symbolic framework for deep learning with requirements

被引:3
作者
Giunchiglia, Eleonora [1 ]
Tatomir, Alex [2 ]
Stoian, Mihaela Catalina [2 ]
Lukasiewicz, Thomas [1 ,2 ]
机构
[1] TU Wien, Inst Log & Computat, Vienna, Austria
[2] Univ Oxford, Dept Comp Sci, Oxford, England
基金
英国工程与自然科学研究理事会;
关键词
Machine learning with requirements; Neuro-symbolic AI; Machine learning; Deep learning; Safe AI; Artificial intelligence; CLASSIFICATION; NETWORKS;
D O I
10.1016/j.ijar.2024.109124
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For their outstanding ability of finding hidden patterns in data, deep learning models have been extensively applied in many different domains. However, recent works have shown that, if a set of requirements expressing inherent knowledge about the problem at hand is given, then neural networks often fail to comply with them. This represents a major drawback for deep learning models, as requirements compliance is normally considered a necessary condition for standard software deployment. In this paper, we propose a novel neuro-symbolic framework able to make any neural network compliant by design to a given set of requirements over the output space expressed in full propositional logic. This framework, called CCN + , integrates the requirements into the output layer of the neural network by applying multiple inference rules that ensure compliance with the requirements and adapts the standard binary cross -entropy loss function to the requirement output layer. As a result, not only the outputted predictions are guaranteed to be compliant with the requirements, but the neural network itself learns how to exploit the domain knowledge expressed by the requirements to get better performance. We conduct an extensive experimental evaluation of CCN + on 19 real -world multi -label classification datasets with propositional logic requirements, including a challenging dataset for autonomous driving. Our experimental analysis confirms that CCN + is able to outperform both its neural counterparts and the state-of-the-art models.
引用
收藏
页数:27
相关论文
共 69 条
  • [1] Ahmed K., 2022, P NEURIPS
  • [2] Ahmed K., 2022, P UAI
  • [3] Logic Tensor Networks
    Badreddine, Samy
    Garcez, Artur d'Avila
    Serafini, Luciano
    Spranger, Michael
    [J]. ARTIFICIAL INTELLIGENCE, 2022, 303
  • [4] Bergstra J., 2013, P 30 INT C MACHINE L, V28, P115
  • [5] Biere A., 2021, Frontiers in Artificial Intelligence and Applications, V336
  • [6] Learning multi-label scene classification
    Boutell, MR
    Luo, JB
    Shen, XP
    Brown, CM
    [J]. PATTERN RECOGNITION, 2004, 37 (09) : 1757 - 1771
  • [7] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
    Carreira, Joao
    Zisserman, Andrew
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
  • [8] Reduction strategies for hierarchical multi-label classification in protein function prediction
    Cerri, Ricardo
    Barros, Rodrigo C.
    de Carvalho, Andre C. P. L. F.
    Jin, Yaochu
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [9] A knowledge compilation map
    Darwiche, A
    Marquis, P
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2002, 17 : 229 - 264
  • [10] De Raedt L, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2468