Injecting Domain Knowledge in Neural Networks: A Controlled Experiment on a Constrained Problem

被引:6
|
作者
Silvestri, Mattia [1 ]
Lombardi, Michele [1 ]
Milano, Michela [1 ]
机构
[1] Univ Bologna, Bologna, Italy
来源
INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH | 2021年 / 12735卷
关键词
D O I
10.1007/978-3-030-78230-6_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent research has shown how Deep Neural Networks trained on historical solution pools can tackle CSPs to some degree, with potential applications in problems with implicit soft and hard constraints. In this paper, we consider a setup where one has offline access to symbolic, incomplete, problem knowledge, which cannot however be employed at search time. We show how such knowledge can be generally treated as a propagator, we devise an approach to distill it in the weights of a network, and we define a simple procedure to extensively exploit even small solution pools. Rather than tackling a real-world application directly, we perform experiments in a controlled setting, i.e. the classical Partial Latin Square completion problem, aimed at identifying patterns, potential advantages, and challenges. Our analysis shows that injecting knowledge at training time can be very beneficial with small solution pools, but may have less reliable effects with large solution pools. Scalability appears as the greatest challenge, as it affects the reliability of the incomplete knowledge and necessitates larger solution pools.
引用
收藏
页码:266 / 282
页数:17
相关论文
共 50 条
  • [1] Injecting Domain Knowledge Into Deep Neural Networks for Tree Crown Delineation
    Harmon, Ira
    Marconi, Sergio
    Weinstein, Ben
    Graves, Sarah
    Wang, Daisy Zhe
    Zare, Alina
    Bohlman, Stephanie
    Singh, Aditya
    White, Ethan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [2] Injecting Domain Knowledge from Empirical Interatomic Potentials to Neural Networks for Predicting Material Properties
    Shui, Zeren
    Karls, Daniel S.
    Wen, Mingjian
    Nikiforov, Ilia A.
    Tadmor, Ellad B.
    Karypis, George
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [3] Judgment Prediction via Injecting Legal Knowledge into Neural Networks
    Gan, Leilei
    Kuang, Kun
    Yang, Yi
    Wu, Fei
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12866 - 12874
  • [4] Injecting Semantic Background Knowledge into Neural Networks using Graph Embeddings
    Ziegler, Konstantin
    Caelen, Olivier
    Garchery, Mathieu
    Granitzer, Michael
    He-Guelton, Liyun
    Jurgovsky, Johannes
    Portier, Pierre-Edouard
    Zwicklbauer, Stefan
    2017 IEEE 26TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES - INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE), 2017, : 200 - 205
  • [5] Incorporating symbolic domain knowledge into graph neural networks
    Dash, Tirtharaj
    Srinivasan, Ashwin
    Vig, Lovekesh
    MACHINE LEARNING, 2021, 110 (07) : 1609 - 1636
  • [6] Incorporating symbolic domain knowledge into graph neural networks
    Tirtharaj Dash
    Ashwin Srinivasan
    Lovekesh Vig
    Machine Learning, 2021, 110 : 1609 - 1636
  • [7] Incorporating Prior Domain Knowledge into Deep Neural Networks
    Muralidhar, Nikhil
    Islam, Mohammad Raihanul
    Marwah, Manish
    Karpatne, Anuj
    Ramakrishnan, Naren
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 36 - 45
  • [8] Modeling a user's domain knowledge with neural networks
    Chen, QY
    Norcio, AF
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 1997, 9 (01) : 25 - 40
  • [9] Injecting Chaos in Feedforward Neural Networks
    Ahmed, Sultan Uddin
    Shahjahan, Md.
    Murase, Kazuyuki
    NEURAL PROCESSING LETTERS, 2011, 34 (01) : 87 - 100
  • [10] Injecting Chaos in Feedforward Neural Networks
    Sultan Uddin Ahmed
    Md. Shahjahan
    Kazuyuki Murase
    Neural Processing Letters, 2011, 34 : 87 - 100