Bringing order into the realm of Transformer-based language models for artificial intelligence and law

被引:10
作者
Greco, Candida M. [1 ]
Tagarelli, Andrea [1 ]
机构
[1] Univ Calabria, Dept Comp Engn Modeling Elect & Syst Engn DIMES, I-87036 Arcavacata Di Rende, CS, Italy
关键词
Language models; BERT; GPT; Legal search; Legal document review; Legal outcome prediction; Retrieval; Entailment; Inference; Caselaw data; Statutory law data; Benchmarks; AI for law;
D O I
10.1007/s10506-023-09374-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer-based language models (TLMs) have widely been recognized to be a cutting-edge technology for the successful development of deep-learning-based solutions to problems and applications that require natural language processing and understanding. Like for other textual domains, TLMs have indeed pushed the state-of-the-art of AI approaches for many tasks of interest in the legal domain. Despite the first Transformer model being proposed about six years ago, there has been a rapid progress of this technology at an unprecedented rate, whereby BERT and related models represent a major reference, also in the legal domain. This article provides the first systematic overview of TLM-based methods for AI-driven problems and tasks in the legal sphere. A major goal is to highlight research advances in this field so as to understand, on the one hand, how the Transformers have contributed to the success of AI in supporting legal processes, and on the other hand, what are the current limitations and opportunities for further research development.
引用
收藏
页码:863 / 1010
页数:148
相关论文
共 323 条
  • [1] Aguiar A, 2021, ANAIS X BRAZILIAN C, P586
  • [2] Ahmad WU, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, P4402
  • [3] Ahmad WU, 2020, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, P743
  • [4] COLIEE 2020: Legal Information Retrieval and Entailment with Legal Embeddings and Boosting
    Alberts, Houda
    Ipek, Akin
    Lucas, Roderick
    Wozny, Phillip
    [J]. NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, JSAI-ISAI 2020, 2021, 12758 : 211 - 225
  • [5] Predicting judicial decisions of the European Court of Human Rights: a Natural Language Processing perspective
    Aletras, Nikolaos
    Tsarapatsanis, Dimitrios
    Preotiuc-Pietro, Daniel
    Lampos, Vasileios
    [J]. PEERJ COMPUTER SCIENCE, 2016, PeerJ Inc. (2016):
  • [6] Allan James, 2017, NIST Special Publication, V500-324
  • [7] Privacy Policies over Time: Curation and Analysis of a Million-Document Dataset
    Amos, Ryan
    Acar, Gunes
    Lucherini, Elena
    Kshirsagar, Mihir
    Narayanan, Arvind
    Mayer, Jonathan
    [J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 2165 - 2176
  • [8] [Anonymous], 2017, P 2017 C EMP METH NA, DOI DOI 10.18653/V1/D17-1110
  • [9] [Anonymous], 1998, P 21 ANN INT ACM SIG, DOI 10.1145/290941.291008
  • [10] Antoun W, 2021, Arxiv, DOI arXiv:2003.00104