TacticZero: Learning to Prove Theorems from Scratch with Deep Reinforcement Learning

被引:0
|
作者
Wu, Minchao [1 ]
Norrish, Michael [2 ]
Walder, Christian [1 ]
Dezfouli, Amir [3 ]
机构
[1] Australian Natl Univ, CSIRO, Data61, Canberra, ACT, Australia
[2] Australian Natl Univ, Canberra, ACT, Australia
[3] CSIRO, Data61, Canberra, ACT, Australia
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel approach to interactive theorem proving (ITP) using deep reinforcement learning. The proposed framework is able to learn proof search strategies as well as tactic and arguments prediction in an end-to-end manner. We formulate the process of ITP as a Markov decision process (MDP) in which each state represents a set of potential derivation paths. This structure allows us to introduce a search mechanism which enables the agent to efficiently discard (predicted) dead-end derivations and restart from promising alternatives. We implement the framework in the HOL4 theorem prover. Experimental results show that the framework using learned search strategies outperforms existing automated theorem provers (i.e. hammers) available in HOL4 when evaluated on unseen problems. We further elaborate the role of key components of the framework using ablation studies.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Learning to Prove Theorems by Learning to Generate Theorems
    Wang, Mingzhe
    Deng, Jia
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] Learning to Play Precision Ball Sports from scratch: a Deep Reinforcement Learning Approach
    Antao, Liliana
    Sousa, Armando
    Reis, Luis Paulo
    Goncalves, Gil
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [3] From Reinforcement Learning to Deep Reinforcement Learning: An Overview
    Agostinelli, Forest
    Hocquet, Guillaume
    Singh, Sameer
    Baldi, Pierre
    BRAVERMAN READINGS IN MACHINE LEARNING: KEY IDEAS FROM INCEPTION TO CURRENT STATE, 2018, 11100 : 298 - 328
  • [4] Learning low level skills from scratch for humanoid robot soccer using deep reinforcement learning
    Abreu, Miguel
    Lau, Nuno
    Sousa, Armando
    Reis, Luis Paulo
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2019), 2019, : 256 - 263
  • [5] A Case Study on Learning a Steering Controller from Scratch with Reinforcement Learning
    Lauer, Martin
    2011 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2011, : 260 - 265
  • [6] From Scratch to Sketch: Deep Decoupled Hierarchical Reinforcement Learning for Robotic Sketching Agent
    Lee, Ganghun
    Kim, Minji
    Lee, Minsu
    Zhang, Byoung-Tak
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 5553 - 5559
  • [7] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Morales, Eduardo F.
    Murrieta-Cid, Rafael
    Becerra, Israel
    Esquivel-Basaldua, Marco A.
    INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 773 - 805
  • [8] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Eduardo F. Morales
    Rafael Murrieta-Cid
    Israel Becerra
    Marco A. Esquivel-Basaldua
    Intelligent Service Robotics, 2021, 14 : 773 - 805
  • [9] The Advance of Reinforcement Learning and Deep Reinforcement Learning
    Lyu, Le
    Shen, Yang
    Zhang, Sicheng
    2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 644 - 648
  • [10] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
    Tan, Fuxiao
    Yan, Pengfei
    Guan, Xinping
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483