The unreasonable effectiveness of large language models in zero-shot semantic annotation of legal texts

被引:11
作者
Savelka, Jaromir [1 ]
Ashley, Kevin D. [2 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, Sch Law, Pittsburgh, PA 15260 USA
来源
FRONTIERS IN ARTIFICIAL INTELLIGENCE | 2023年 / 6卷
关键词
legal text analytics; large language models (LLM); zero-shot classification; semantic annotation; text annotation; CLASSIFICATION; EXTRACTION; DECISIONS; SEARCH;
D O I
10.3389/frai.2023.1279794
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The emergence of ChatGPT has sensitized the general public, including the legal profession, to large language models' (LLMs) potential uses (e.g., document drafting, question answering, and summarization). Although recent studies have shown how well the technology performs in diverse semantic annotation tasks focused on legal texts, an influx of newer, more capable (GPT-4) or cost-effective (GPT-3.5-turbo) models requires another analysis. This paper addresses recent developments in the ability of LLMs to semantically annotate legal texts in zero-shot learning settings. Given the transition to mature generative AI systems, we examine the performance of GPT-4 and GPT-3.5-turbo(-16k), comparing it to the previous generation of GPT models, on three legal text annotation tasks involving diverse documents such as adjudicatory opinions, contractual clauses, or statutory provisions. We also compare the models' performance and cost to better understand the trade-offs. We found that the GPT-4 model clearly outperforms the GPT-3.5 models on two of the three tasks. The cost-effective GPT-3.5-turbo matches the performance of the 20x more expensive text-davinci-003 model. While one can annotate multiple data points within a single prompt, the performance degrades as the size of the batch increases. This work provides valuable information relevant for many practical applications (e.g., in contract review) and research projects (e.g., in empirical legal studies). Legal scholars and practicing lawyers alike can leverage these findings to guide their decisions in integrating LLMs in a wide range of workflows involving semantic annotation of legal texts.
引用
收藏
页数:14
相关论文
共 41 条
  • [31] Zero-shot urban function inference with street view images through prompting a pretrained vision-language model
    Huang, Weiming
    Wang, Jing
    Cong, Gao
    [J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2024, 38 (07) : 1414 - 1442
  • [32] On the Effectiveness of Pre-Trained Language Models for Legal Natural Language Processing: An Empirical Study
    Song, Dezhao
    Gao, Sally
    He, Baosheng
    Schilder, Frank
    [J]. IEEE ACCESS, 2022, 10 : 75835 - 75858
  • [33] Web-Scale Semantic Product Search with Large Language Models
    Muhamed, Aashiq
    Srinivasan, Sriram
    Teo, Choon-Hui
    Cui, Qingjun
    Zeng, Belinda
    Chilimbi, Trishul
    Vishwanathan, S. V. N.
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT III, 2023, 13937 : 73 - 85
  • [34] Negation typology and general representation models for cross-lingual zero-shot negation scope resolution in Russian, French, and Spanish
    Shaitarova, Anastassia
    Rinaldi, Fabio
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 15 - 23
  • [35] Advancing Large Language Models for Spatiotemporal and Semantic Association Mining of Similar Environmental Events
    Tian, Yuanyuan
    Li, Wenwen
    Hu, Lei
    Chen, Xiao
    Brook, Michael
    Brubaker, Michael
    Zhang, Fan
    Liljedahl, Anna K.
    [J]. TRANSACTIONS IN GIS, 2025, 29 (01)
  • [36] Generalized Zero-Shot Chest X-Ray Diagnosis Through Trait-Guided Multi-View Semantic Embedding With Self-Training
    Paul, Angshuman
    Shen, Thomas C.
    Lee, Sungwon
    Balachandar, Niranjan
    Peng, Yifan
    Lu, Zhiyong
    Summers, Ronald M.
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (10) : 2642 - 2655
  • [37] Construction of Legal Knowledge Graph Based on Knowledge-Enhanced Large Language Models
    Li, Jun
    Qian, Lu
    Liu, Peifeng
    Liu, Taoxiong
    [J]. INFORMATION, 2024, 15 (11)
  • [38] DILF: Differentiable rendering-based multi-view Image-Language Fusion for zero-shot 3D shape understanding
    Ning, Xin
    Yu, Zaiyang
    Li, Lusi
    Li, Weijun
    Tiwari, Prayag
    [J]. INFORMATION FUSION, 2024, 102
  • [39] From Text to Structure: Using Large Language Models to Support the Development of Legal Expert Systems
    Janatian, Samyar
    Westermann, Hannes
    Tan, Jinzhe
    Savelka, Jaromir
    Benyekhlef, Karim
    [J]. LEGAL KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 379 : 167 - 176
  • [40] Using large language models for extracting and pre-annotating texts on mental health from noisy data in a low-resource language
    Koltcov, Sergei
    Surkov, Anton
    Koltsova, Olessia
    Ignatenko, Vera
    [J]. PEERJ COMPUTER SCIENCE, 2024, 10 : 1 - 19