Extracting Legal Norm Analysis Categories from German Law Texts with Large Language Models

被引:0
|
作者
Bachinger, Sarah T. [1 ,2 ]
Feddoul, Leila [1 ]
Mauch, Marianne [1 ,2 ]
Koenig-Ries, Birgitta [1 ]
机构
[1] Friedrich Schiller Univ Jena, Heinz Nixdorf Chair Distributed Informat Syst, Jena, Germany
[2] Friedrich Schiller Univ Jena, Competence Ctr Digital Res Zedif, Jena, Germany
来源
PROCEEDINGS OF THE 25TH ANNUAL INTERNATIONAL CONFERENCE ON DIGITAL GOVERNMENT RESEARCH, DGO 2024 | 2024年
关键词
Named Entity Recognition; Large Language Models; Federal Information Management; Digital Transformation; Public Administration;
D O I
10.1145/3657054.3657277
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The digitization of public services in Germany is always based on a legal basis (e.g., laws). In the digitization process, first relevant entities in law documents (e.g., actors) are detected, then a list of possible process steps of their interactions is derived. The final process is constructed and transformed to a digital service for citizens and companies. Today, the discovery of custom entities in German law documents is still manual high effort work. In our study, we investigate the capabilities of Large Language Models (LLMs) to automate this task, choose five LLMs from 61 evaluated candidates, and perform prompt engineering to create five different prompt variants with differing parts. We examine the automatic annotation by two LLMs (LeoLM and BLOOM CLP German) in detail and find that the inclusion of more information in the prompts as well as an increased number of examples per prompt are beneficial. We report micro F1-scores for the optimal scenario of 0.91 for BLOOM CLP German, and 0.82 for LeoLM, with a higher balanced accuracy for LeoLM. The results indicate that LLMs have a good potential to perform named entity recognition, especially for supporting legal norm analysis in the context of the digitization of public administration.
引用
收藏
页码:481 / 493
页数:13
相关论文
共 50 条
  • [41] The Language of Creativity: Evidence from Humans and Large Language Models
    Orwig, William
    Edenbaum, Emma R.
    Greene, Joshua D.
    Schacter, Daniel L.
    JOURNAL OF CREATIVE BEHAVIOR, 2024, 58 (01) : 128 - 136
  • [42] Large language models as tax attorneys: a case study in legal capabilities emergence
    Nay, John J.
    Karamardian, David
    Lawsky, Sarah B.
    Tao, Wenting
    Bhat, Meghana
    Jain, Raghav
    Lee, Aaron Travis
    Choi, Jonathan H.
    Kasai, Jungo
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2024, 382 (2270):
  • [43] From manual to machine: assessing the efficacy of large language models in content analysis
    Pilny, Andrew
    Mcaninch, Kelly
    Slone, Amanda
    Moore, Kelsey
    COMMUNICATION RESEARCH REPORTS, 2024, 41 (02) : 61 - 70
  • [44] Using Large Language Models to Improve Sentiment Analysis in Latvian Language
    Purvins, Pauls
    Urtans, Evalds
    Caune, Vairis
    BALTIC JOURNAL OF MODERN COMPUTING, 2024, 12 (02): : 165 - 175
  • [45] Large Language Models in Targeted Sentiment Analysis for Russian
    Rusnachenko, N.
    Golubev, A.
    Loukachevitch, N.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2024, 45 (07) : 3148 - 3158
  • [46] Large language models in textual analysis for gesture selection
    Hensel, Laura B.
    Yongsatianchot, Nutchanon
    Torshizi, Parisa
    Minucci, Elena
    Marsella, Stacy
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2023, 2023, : 378 - 387
  • [47] LARGE LANGUAGE MODELS "AD REFERENDUM": HOW GOOD ARE THEY AT MACHINE TRANSLATION IN THE LEGAL DOMAIN?
    Briva-Iglesias, Vicent
    Camargo, Joao Lucas Cavalheiro
    Dogru, Gokhan
    MONTI, 2024, 16 : 75 - 107
  • [48] LARGE LANGUAGE MODELS "AD REFERENDUM": HOW GOOD ARE THEY AT MACHINE TRANSLATION IN THE LEGAL DOMAIN?
    Briva-Iglesias, Vicent
    Camargo, Joao Lucas Cavalheiro
    Dogru, Gokhan
    MONTI, 2024, 16 : 75 - 107
  • [49] Can Large Language Models Assist in Hazard Analysis?
    Diemert, Simon
    Weber, Jens H.
    COMPUTER SAFETY, RELIABILITY, AND SECURITY, SAFECOMP 2023 WORKSHOPS, 2023, 14182 : 410 - 422
  • [50] Application of large language models in healthcare: A bibliometric analysis
    Zhang, Lanping
    Zhao, Qing
    Zhang, Dandan
    Song, Meijuan
    Zhang, Yu
    Wang, Xiufen
    DIGITAL HEALTH, 2025, 11