The unreasonable effectiveness of large language models in zero-shot semantic annotation of legal texts

被引:11
作者
Savelka, Jaromir [1 ]
Ashley, Kevin D. [2 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, Sch Law, Pittsburgh, PA 15260 USA
来源
FRONTIERS IN ARTIFICIAL INTELLIGENCE | 2023年 / 6卷
关键词
legal text analytics; large language models (LLM); zero-shot classification; semantic annotation; text annotation; CLASSIFICATION; EXTRACTION; DECISIONS; SEARCH;
D O I
10.3389/frai.2023.1279794
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The emergence of ChatGPT has sensitized the general public, including the legal profession, to large language models' (LLMs) potential uses (e.g., document drafting, question answering, and summarization). Although recent studies have shown how well the technology performs in diverse semantic annotation tasks focused on legal texts, an influx of newer, more capable (GPT-4) or cost-effective (GPT-3.5-turbo) models requires another analysis. This paper addresses recent developments in the ability of LLMs to semantically annotate legal texts in zero-shot learning settings. Given the transition to mature generative AI systems, we examine the performance of GPT-4 and GPT-3.5-turbo(-16k), comparing it to the previous generation of GPT models, on three legal text annotation tasks involving diverse documents such as adjudicatory opinions, contractual clauses, or statutory provisions. We also compare the models' performance and cost to better understand the trade-offs. We found that the GPT-4 model clearly outperforms the GPT-3.5 models on two of the three tasks. The cost-effective GPT-3.5-turbo matches the performance of the 20x more expensive text-davinci-003 model. While one can annotate multiple data points within a single prompt, the performance degrades as the size of the batch increases. This work provides valuable information relevant for many practical applications (e.g., in contract review) and research projects (e.g., in empirical legal studies). Legal scholars and practicing lawyers alike can leverage these findings to guide their decisions in integrating LLMs in a wide range of workflows involving semantic annotation of legal texts.
引用
收藏
页数:14
相关论文
共 41 条
  • [21] Generative Zero-Shot Learning via Low-Rank Embedded Semantic Dictionary
    Ding, Zhengming
    Shao, Ming
    Fu, Yun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (12) : 2861 - 2874
  • [22] Towards Unbiased Multi-Label Zero-Shot Learning With Pyramid and Semantic Attention
    Liu, Ziming
    Guo, Song
    Guo, Jingcai
    Xu, Yuanyuan
    Huo, Fushuo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7441 - 7455
  • [23] Large-scale zero-shot learning in the wild: Classifying zoological illustrations
    Stork, Lise
    Weber, Andreas
    van den Herik, Jaap
    Plaat, Aske
    Verbeek, Fons
    Wolstencroft, Katherine
    ECOLOGICAL INFORMATICS, 2021, 62
  • [24] Best Practices for Text Annotation with Large Language Models
    Toernberg, Petter
    SOCIOLOGICA-INTERNATIONAL JOURNAL FOR SOCIOLOGICAL DEBATE, 2024, 18 (02): : 67 - 85
  • [25] Big data in myoelectric control: large multi-user models enable robust zero-shot EMG-based discrete gesture recognition
    Eddy, Ethan
    Campbell, Evan
    Bateman, Scott
    Scheme, Erik
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2024, 12
  • [26] A Novel Perspective to Zero-Shot Learning: Towards an Alignment of Manifold Structures via Semantic Feature Expansion
    Guo, Jingcai
    Guo, Song
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 524 - 537
  • [27] Zero-shot transfer learned generic AI models for prediction of optimally ripe climacteric fruits
    Dutta, Jayita
    Patwardhan, Manasi
    Deshpande, Parijat
    Karande, Shirish
    Rai, Beena
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [28] Domain Adaptation Meets Zero-Shot Learning: An Annotation-Efficient Approach to Multi-Modality Medical Image Segmentation
    Bian, Cheng
    Yuan, Chenglang
    Ma, Kai
    Yu, Shuang
    Wei, Dong
    Zheng, Yefeng
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (05) : 1043 - 1056
  • [29] A Fast Hybrid Model for Large-Scale Zero-Shot Image Recognition Based on Knowledge Graphs
    Xiao, Bo
    Du, Yujiao
    Wu, Q. M. Jonathan
    Xu, Qianfang
    Yan, Liping
    IEEE ACCESS, 2019, 7 : 119309 - 119318
  • [30] Using Large Language Models to Support Thematic Analysis in Empirical Legal Studies
    Drapal, Jakub
    Westermann, Hannes
    Savelka, Jaromir
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 379 : 197 - 206