Can GPT-3 Perform Statutory Reasoning?

被引：19

作者：

Blair-Stanek, Andrew ^{[1
]}

Holzenberger, Nils ^{[2
]}

Van Durme, Benjamin ^{[3
]}

机构：

[1] Univ Maryland, Sch Law, Baltimore, MD 21201 USA

[2] Inst Polytech Paris, Palaiseau, France

[3] Johns Hopkins Univ, Baltimore, MD USA

来源：

PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND LAW, ICAIL 2023 | 2023年

基金：

美国国家科学基金会;

关键词：

natural language processing; reasoning; law; statutes; GPT-3;

D O I：

10.1145/3594536.3595163

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Statutory reasoning is the task of reasoning with facts and statutes, which are rules written in natural language by a legislature. It is a basic legal skill. In this paper we explore the capabilities of the most capable GPT-3 model, text-davinci-003, on an established statutory-reasoning dataset called SARA. We consider a variety of approaches, including dynamic few-shot prompting, chain-of-thought prompting, and zero-shot prompting. While we achieve results with GPT-3 that are better than the previous best published results, we also identify several types of clear errors it makes. We investigate why these errors happen. We discover that GPT-3 has imperfect prior knowledge of the actual U.S. statutes on which SARA is based. More importantly, we create simple synthetic statutes, which GPT-3 is guaranteed not to have seen during training. We find GPT-3 performs poorly at answering straightforward questions about these simple synthetic statutes.

引用

页码：22 / 31

页数：10

共 27 条

[1]

BenWang Aran, 2021, GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model

[2] Judicial knowledge-enhanced magnitude-aware reasoning for numerical legal judgment prediction [J].

Bi, Sheng ;

Zhou, Zhiyao ;

Pan, Lu ;

Qi, Guilin .

ARTIFICIAL INTELLIGENCE AND LAW, 2023, 31 (04) :773-806

[3]

Bommarito I I., 2022, arXiv

[4]

Brown TB, 2020, ADV NEUR IN, V33

[5]

Choi JH, 2022, J LEGAL EDUC, V71, P387

[6]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[7]

Gutierrez B. J., 2022, Findings of the Association for Computational Linguistics, EMNLP 2022, P4497, DOI 10.18653/v1/2022.findings-emnlp.329,

[8]

Hendrycks Dan., 2021, 9 INT C LEARNING REP

[9]

Holzenberger N, 2020, CEUR WORKSHOP P, V2645, P31

[10]

Holzenberger N, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, P2742

← 1 2 3 →