TacticZero: Learning to Prove Theorems from Scratch with Deep Reinforcement Learning

被引：0

作者：

Wu, Minchao ^{[1
]}

Norrish, Michael ^{[2
]}

Walder, Christian ^{[1
]}

Dezfouli, Amir ^{[3
]}

机构：

[1] Australian Natl Univ, CSIRO, Data61, Canberra, ACT, Australia

[2] Australian Natl Univ, Canberra, ACT, Australia

[3] CSIRO, Data61, Canberra, ACT, Australia

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel approach to interactive theorem proving (ITP) using deep reinforcement learning. The proposed framework is able to learn proof search strategies as well as tactic and arguments prediction in an end-to-end manner. We formulate the process of ITP as a Markov decision process (MDP) in which each state represents a set of potential derivation paths. This structure allows us to introduce a search mechanism which enables the agent to efficiently discard (predicted) dead-end derivations and restart from promising alternatives. We implement the framework in the HOL4 theorem prover. Experimental results show that the framework using learned search strategies outperforms existing automated theorem provers (i.e. hammers) available in HOL4 when evaluated on unseen problems. We further elaborate the role of key components of the framework using ablation studies.

引用

页数：13

共 50 条

[31] Learning Dynamics and Generalization in Deep Reinforcement Learning
Lyle, Clare
Rowland, Mark
Dabney, Will
Kwiatkowksa, Marta
Gal, Yarin
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[32] Learning Global Optimization by Deep Reinforcement Learning
da Silva Filho, Moesio Wenceslau
Barbosa, Gabriel A.
Miranda, Pericles B. C.
INTELLIGENT SYSTEMS, PT II, 2022, 13654 : 417 - 433
[33] Learning Macromanagement in Starcraft by Deep Reinforcement Learning
Huang, Wenzhen
Yin, Qiyue
Zhang, Junge
Huang, Kaiqi
SENSORS, 2021, 21 (10)
[34] Bayesian Deep Reinforcement Learning via Deep Kernel Learning
Junyu Xuan
Jie Lu
Zheng Yan
Guangquan Zhang
International Journal of Computational Intelligence Systems, 2018, 12 : 164 - 171
[35] Bayesian Deep Reinforcement Learning via Deep Kernel Learning
Xuan, Junyu
Lu, Jie
Yan, Zheng
Zhang, Guangquan
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (01) : 164 - 171
[36] Learning for a Robot: Deep Reinforcement Learning, Imitation Learning, Transfer Learning
Hua, Jiang
Zeng, Liangcai
Li, Gongfa
Ju, Zhaojie
SENSORS, 2021, 21 (04) : 1 - 21
[37] Learning for a robot: Deep reinforcement learning, imitation learning, transfer learning
Hua, Jiang
Zeng, Liangcai
Li, Gongfa
Ju, Zhaojie
Sensors (Switzerland), 2021, 21 (04): : 1 - 21
[38] Learning to Drive via Apprenticeship Learning and Deep Reinforcement Learning
Huang, Wenhui
Braghin, Francesco
Wang, Zhuo
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1536 - 1540
[39] Participants Selection for From-Scratch Mobile Crowdsensing via Reinforcement Learning
Hu, Yunfan
Wang, Jiangtao
Wu, Bo
Helal, Sumi
2020 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS (PERCOM 2020), 2020,
[40] Explainability in deep reinforcement learning
Heuillet, Alexandre
Couthouis, Fabien
Diaz-Rodriguez, Natalia
KNOWLEDGE-BASED SYSTEMS, 2021, 214 (214)

← 1 2 3 4 5 →