A Pragmatic Approach to Semantic Annotation for Search of Legal Texts - An Experiment on GDPR

被引:1
作者
Nazarenko, Adeline [1 ]
Levy, Francois [1 ]
Wyner, Adam [2 ]
机构
[1] Univ Sorbonne Paris Nord, LIPN, Villetaneuse, France
[2] Swansea Univ, Dept Comp Sci, Swansea, W Glam, Wales
来源
LEGAL KNOWLEDGE AND INFORMATION SYSTEMS | 2021年 / 346卷
关键词
Text annotation; Semantic search; Annotation methodology; Semantic markup language; Law;
D O I
10.3233/FAIA210313
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tools must be developed to help draft, consult, and explore textual legal sources. Between statistical information retrieval and the formalization of textual rules for automated legal reasoning, we defend a more pragmatic third way that enriches legal texts with a coarse-grained, interpretation-neutral, semantic annotation layer. The aim is that legal texts can be enriched on a large scale at a reasonable cost, paving the way for new search capabilities that will facilitate mining of legal sources. This new approach is illustrated on a proof-of-concept experiment that consisted in semantically annotating a significant part of the French version of the GDPR. The paper presents the design methodology of the annotation language, a first version of a Core Legal Annotation Language (CLAL), together with its formalization in XML, the gold standard resulting from the annotation of GDPR, and examples of user questions that can be better answered by semantic than by plain text search. This experimentation demonstrates the potential of the proposed approach and provides a basis for further development. All resources developed for that GDPR experiment are language independent and are publicly available.
引用
收藏
页码:23 / 32
页数:10
相关论文
共 17 条
[1]   LegalRuleML: Design Principles and Foundations [J].
Athan, Tara ;
Governatori, Guido ;
Palmirani, Monica ;
Paschke, Adrian ;
Wyner, Adam .
REASONING WEB: WEB LOGIC RULES, 2015, 9203 :151-188
[2]  
Barabucci Gioele, 2010, AI Approaches to the Complexity of Legal Systems. Complex Systems, the Semantic Web, Ontologies, Argumentation, and Dialogue. Revised Selected Papers, P133, DOI 10.1007/978-3-642-16524-5_9
[3]  
Bartolini C, 2015, Language and Semantic Technology for Legal Domain, V8, P8
[4]  
Fort K., 2016, Collaborative Annotation for Reliable Natural Language Processing: Technical and Sociological Aspects
[5]  
Group WOC, 2018, Open Digital Rights Language: Vocabulary & Expression 2.2
[6]  
Hoekstra Rinke, 2007, LOAIT, V321, P43
[7]  
Libal Tomer, 2020, Logic and Argumentation. Third International Conference, CLAR 2020. Proceedings. Lecture Notes in Artificial Intelligence (LNAI 12061), P131, DOI 10.1007/978-3-030-44638-3_9
[8]  
Libal T, 2019, PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND LAW, ICAIL 2019, P262, DOI 10.1145/3322640.3326721
[9]   Concept and Context in Legal Information Retrieval [J].
Maxwell, K. Tamsin ;
Schafer, Burkhard .
LEGAL KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 189 :63-+
[10]   Towards a Methodology for Formalizing Legal Texts in LegalRuleML [J].
Nazarenko, Adeline ;
Levy, Francois ;
Wyner, Adam .
LEGAL KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 294 :149-154