Extracting Legal Norm Analysis Categories from German Law Texts with Large Language Models

被引:0
|
作者
Bachinger, Sarah T. [1 ,2 ]
Feddoul, Leila [1 ]
Mauch, Marianne [1 ,2 ]
Koenig-Ries, Birgitta [1 ]
机构
[1] Friedrich Schiller Univ Jena, Heinz Nixdorf Chair Distributed Informat Syst, Jena, Germany
[2] Friedrich Schiller Univ Jena, Competence Ctr Digital Res Zedif, Jena, Germany
来源
PROCEEDINGS OF THE 25TH ANNUAL INTERNATIONAL CONFERENCE ON DIGITAL GOVERNMENT RESEARCH, DGO 2024 | 2024年
关键词
Named Entity Recognition; Large Language Models; Federal Information Management; Digital Transformation; Public Administration;
D O I
10.1145/3657054.3657277
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The digitization of public services in Germany is always based on a legal basis (e.g., laws). In the digitization process, first relevant entities in law documents (e.g., actors) are detected, then a list of possible process steps of their interactions is derived. The final process is constructed and transformed to a digital service for citizens and companies. Today, the discovery of custom entities in German law documents is still manual high effort work. In our study, we investigate the capabilities of Large Language Models (LLMs) to automate this task, choose five LLMs from 61 evaluated candidates, and perform prompt engineering to create five different prompt variants with differing parts. We examine the automatic annotation by two LLMs (LeoLM and BLOOM CLP German) in detail and find that the inclusion of more information in the prompts as well as an increased number of examples per prompt are beneficial. We report micro F1-scores for the optimal scenario of 0.91 for BLOOM CLP German, and 0.82 for LeoLM, with a higher balanced accuracy for LeoLM. The results indicate that LLMs have a good potential to perform named entity recognition, especially for supporting legal norm analysis in the context of the digitization of public administration.
引用
收藏
页码:481 / 493
页数:13
相关论文
共 50 条
  • [31] Large language models to process, analyze, and synthesize biomedical texts: a scoping review
    Doneva, Simona Emilova
    Qin, Sijing
    Sick, Beate
    Ellendorff, Tilia
    Goldman, Jean-Philippe
    Schneider, Gerold
    Ineichen, Benjamin Victor
    Discover Artificial Intelligence, 2024, 4 (01):
  • [32] How to train your stochastic parrot: large language models for political texts
    Ornstein, Joseph T.
    Blasingame, Elise N.
    Truscott, Jake S.
    POLITICAL SCIENCE RESEARCH AND METHODS, 2025,
  • [33] Extracting Implicit User Preferences in Conversational Recommender Systems Using Large Language Models
    Kim, Woo-Seok
    Lim, Seongho
    Kim, Gun-Woo
    Choi, Sang-Min
    MATHEMATICS, 2025, 13 (02)
  • [34] Evolution and Prospects of Foundation Models: From Large Language Models to Large Multimodal Models
    Chen, Zheyi
    Xu, Liuchang
    Zheng, Hongting
    Chen, Luyao
    Tolba, Amr
    Zhao, Liang
    Yu, Keping
    Feng, Hailin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 1753 - 1808
  • [35] Trend Analysis Through Large Language Models
    Alzapiedi, Lucas
    Bihl, Trevor
    IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE, NAECON 2024, 2024, : 370 - 374
  • [36] Towards Taming Large Language Models with Prompt Templates for Legal GRL Modeling
    de Kinderen, Sybren
    Winter, Karolin
    ENTERPRISE, BUSINESS-PROCESS AND INFORMATION SYSTEMS MODELING, BPMDS 2024, EMMSAD 2024, 2024, 511 : 213 - 228
  • [37] Boosting legal case retrieval by query content selection with large language models
    Zhou, Youchao
    Huang, Heyan
    Wu, Zhijing
    ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL IN THE ASIA PACIFIC REGION, SIGIR-AP 2023, 2023, : 176 - 184
  • [38] Legal aspects of generative artificial intelligence and large language models in examinations and theses
    Maerz, Maren
    Himmelbauer, Monika
    Boldt, Kevin
    Oksche, Alexander
    GMS JOURNAL FOR MEDICAL EDUCATION, 2024, 41 (04):
  • [39] A Framework for Enhancing Statute Law Retrieval Using Large Language Models
    Pham, Trang Ngoc Anh
    Do, Dinh-Truong
    Nguyen, Minh Le
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, JSAI-ISAI 2024, 2024, 14741 : 247 - 259
  • [40] A comparative evaluation of the effectiveness of document splitters for large language models in legal contexts
    Plonka, Mateusz
    Kocot, Krzysztof
    Holda, Kacper
    Daniec, Krzysztof
    Nawrat, Aleksander
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272