Extracting Legal Norm Analysis Categories from German Law Texts with Large Language Models

被引：0

作者：

Bachinger, Sarah T. ^{[1
,2
]}

Feddoul, Leila ^{[1
]}

Mauch, Marianne ^{[1
,2
]}

Koenig-Ries, Birgitta ^{[1
]}

机构：

[1] Friedrich Schiller Univ Jena, Heinz Nixdorf Chair Distributed Informat Syst, Jena, Germany

[2] Friedrich Schiller Univ Jena, Competence Ctr Digital Res Zedif, Jena, Germany

来源：

PROCEEDINGS OF THE 25TH ANNUAL INTERNATIONAL CONFERENCE ON DIGITAL GOVERNMENT RESEARCH, DGO 2024 | 2024年

关键词：

Named Entity Recognition; Large Language Models; Federal Information Management; Digital Transformation; Public Administration;

D O I：

10.1145/3657054.3657277

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The digitization of public services in Germany is always based on a legal basis (e.g., laws). In the digitization process, first relevant entities in law documents (e.g., actors) are detected, then a list of possible process steps of their interactions is derived. The final process is constructed and transformed to a digital service for citizens and companies. Today, the discovery of custom entities in German law documents is still manual high effort work. In our study, we investigate the capabilities of Large Language Models (LLMs) to automate this task, choose five LLMs from 61 evaluated candidates, and perform prompt engineering to create five different prompt variants with differing parts. We examine the automatic annotation by two LLMs (LeoLM and BLOOM CLP German) in detail and find that the inclusion of more information in the prompts as well as an increased number of examples per prompt are beneficial. We report micro F1-scores for the optimal scenario of 0.91 for BLOOM CLP German, and 0.82 for LeoLM, with a higher balanced accuracy for LeoLM. The results indicate that LLMs have a good potential to perform named entity recognition, especially for supporting legal norm analysis in the context of the digitization of public administration.

引用

页码：481 / 493

页数：13

共 50 条

[31] Large language models to process, analyze, and synthesize biomedical texts: a scoping review
Doneva, Simona Emilova
Qin, Sijing
Sick, Beate
Ellendorff, Tilia
Goldman, Jean-Philippe
Schneider, Gerold
Ineichen, Benjamin Victor
Discover Artificial Intelligence, 2024, 4 (01):
[32] How to train your stochastic parrot: large language models for political texts
Ornstein, Joseph T.
Blasingame, Elise N.
Truscott, Jake S.
POLITICAL SCIENCE RESEARCH AND METHODS, 2025,
[33] Extracting Implicit User Preferences in Conversational Recommender Systems Using Large Language Models
Kim, Woo-Seok
Lim, Seongho
Kim, Gun-Woo
Choi, Sang-Min
MATHEMATICS, 2025, 13 (02)
[34] Evolution and Prospects of Foundation Models: From Large Language Models to Large Multimodal Models
Chen, Zheyi
Xu, Liuchang
Zheng, Hongting
Chen, Luyao
Tolba, Amr
Zhao, Liang
Yu, Keping
Feng, Hailin
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 1753 - 1808
[35] Trend Analysis Through Large Language Models
Alzapiedi, Lucas
Bihl, Trevor
IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE, NAECON 2024, 2024, : 370 - 374
[36] Towards Taming Large Language Models with Prompt Templates for Legal GRL Modeling
de Kinderen, Sybren
Winter, Karolin
ENTERPRISE, BUSINESS-PROCESS AND INFORMATION SYSTEMS MODELING, BPMDS 2024, EMMSAD 2024, 2024, 511 : 213 - 228
[37] Boosting legal case retrieval by query content selection with large language models
Zhou, Youchao
Huang, Heyan
Wu, Zhijing
ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL IN THE ASIA PACIFIC REGION, SIGIR-AP 2023, 2023, : 176 - 184
[38] Legal aspects of generative artificial intelligence and large language models in examinations and theses
Maerz, Maren
Himmelbauer, Monika
Boldt, Kevin
Oksche, Alexander
GMS JOURNAL FOR MEDICAL EDUCATION, 2024, 41 (04):
[39] A Framework for Enhancing Statute Law Retrieval Using Large Language Models
Pham, Trang Ngoc Anh
Do, Dinh-Truong
Nguyen, Minh Le
NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, JSAI-ISAI 2024, 2024, 14741 : 247 - 259
[40] A comparative evaluation of the effectiveness of document splitters for large language models in legal contexts
Plonka, Mateusz
Kocot, Krzysztof
Holda, Kacper
Daniec, Krzysztof
Nawrat, Aleksander
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272

← 1 2 3 4 5 →