TacticZero: Learning to Prove Theorems from Scratch with Deep Reinforcement Learning

被引：0

作者：

Wu, Minchao ^{[1
]}

Norrish, Michael ^{[2
]}

Walder, Christian ^{[1
]}

Dezfouli, Amir ^{[3
]}

机构：

[1] Australian Natl Univ, CSIRO, Data61, Canberra, ACT, Australia

[2] Australian Natl Univ, Canberra, ACT, Australia

[3] CSIRO, Data61, Canberra, ACT, Australia

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel approach to interactive theorem proving (ITP) using deep reinforcement learning. The proposed framework is able to learn proof search strategies as well as tactic and arguments prediction in an end-to-end manner. We formulate the process of ITP as a Markov decision process (MDP) in which each state represents a set of potential derivation paths. This structure allows us to introduce a search mechanism which enables the agent to efficiently discard (predicted) dead-end derivations and restart from promising alternatives. We implement the framework in the HOL4 theorem prover. Experimental results show that the framework using learned search strategies outperforms existing automated theorem provers (i.e. hammers) available in HOL4 when evaluated on unseen problems. We further elaborate the role of key components of the framework using ablation studies.

引用

页数：13

共 50 条

[1] Learning to Prove Theorems by Learning to Generate Theorems
Wang, Mingzhe
Deng, Jia
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[2] Learning to Play Precision Ball Sports from scratch: a Deep Reinforcement Learning Approach
Antao, Liliana
Sousa, Armando
Reis, Luis Paulo
Goncalves, Gil
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[3] From Reinforcement Learning to Deep Reinforcement Learning: An Overview
Agostinelli, Forest
Hocquet, Guillaume
Singh, Sameer
Baldi, Pierre
BRAVERMAN READINGS IN MACHINE LEARNING: KEY IDEAS FROM INCEPTION TO CURRENT STATE, 2018, 11100 : 298 - 328
[4] Learning low level skills from scratch for humanoid robot soccer using deep reinforcement learning
Abreu, Miguel
Lau, Nuno
Sousa, Armando
Reis, Luis Paulo
2019 19TH IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2019), 2019, : 256 - 263
[5] A Case Study on Learning a Steering Controller from Scratch with Reinforcement Learning
Lauer, Martin
2011 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2011, : 260 - 265
[6] From Scratch to Sketch: Deep Decoupled Hierarchical Reinforcement Learning for Robotic Sketching Agent
Lee, Ganghun
Kim, Minji
Lee, Minsu
Zhang, Byoung-Tak
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 5553 - 5559
[7] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
Morales, Eduardo F.
Murrieta-Cid, Rafael
Becerra, Israel
Esquivel-Basaldua, Marco A.
INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 773 - 805
[8] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
Eduardo F. Morales
Rafael Murrieta-Cid
Israel Becerra
Marco A. Esquivel-Basaldua
Intelligent Service Robotics, 2021, 14 : 773 - 805
[9] The Advance of Reinforcement Learning and Deep Reinforcement Learning
Lyu, Le
Shen, Yang
Zhang, Sicheng
2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 644 - 648
[10] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning
Tan, Fuxiao
Yan, Pengfei
Guan, Xinping
NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483

← 1 2 3 4 5 →