Identification of Paragraph Regularities in Legal Judgements Through Clustering and Textual Embedding

被引：0

作者：

De Martino, Graziella ^{[1
]}

Pio, Gianvito ^{[1
,2
]}

机构：

[1] Univ Bari Aldo Moro, Dept Comp Sci, Via Orabona 4, I-70125 Bari, Italy

[2] Natl Interuniv Consortium Informat, Big Data Lab, Via Ariosto 25, I-00185 Rome, Italy

来源：

FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2022) | 2022年 / 13515卷

关键词：

Legal information retrieval; Embedding; Clustering; Approximate nearest neighbor search;

D O I：

10.1007/978-3-031-16564-1_8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In an era characterized by fast technological progresses, working in the law field is very difficult if not supported by the right tools. In this paper, we present a novel method, called JPReg, that identifies paragraph regularities in legal case judgments to support legal experts during the preparation of new legal documents (i.e., paragraphs of existing documents that are similar to those of a document under preparation). JPReg adopts a two-step approach that first clusters similar documents, according to their semantic content, and then identifies regularities in the paragraphs for each cluster. Text embedding methods are adopted to represent documents and paragraphs into a numerical feature space, and an Approximated Nearest Neighbor Search method is adopted to efficiently retrieve the most similar paragraphs with respect to those of a target document. Our extensive experimental evaluation, performed on a real-world dataset, shows the effectiveness and the computational efficiency of the proposed method even in presence of noise in the data.

引用

页码：74 / 84

页数：11