Lawformer: A pre-trained language model for Chinese legal long documents

被引:140
作者
Xiao, Chaojun [1 ,2 ]
Hu, Xueyu [3 ]
Liu, Zhiyuan [1 ,2 ]
Tu, Cunchao [4 ]
Sun, Maosong [1 ,2 ]
机构
[1] Tsinghua Univ, Inst Artificial Intelligence, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[3] Beihang Univ, Beijing, Peoples R China
[4] Beijing Powerlaw Intelligent Technol Co Ltd, Beijing, Peoples R China
来源
AI OPEN | 2021年 / 2卷
关键词
Pre -trained language model; Legal artificial intelligence;
D O I
10.1016/j.aiopen.2021.06.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Legal artificial intelligence (LegalAI) aims to benefit legal systems with the technology of artificial intelligence, especially natural language processing (NLP). Recently, inspired by the success of pre -trained language models (PLMs) in the generic domain, many LegalAI researchers devote their effort to applying PLMs to legal tasks. However, utilizing PLMs to address legal tasks is still challenging, as the legal documents usually consist of thousands of tokens, which is far longer than the length that mainstream PLMs can process. In this paper, we release the Longformer-based pre -trained language model, named as Lawformer, for Chinese legal long documents understanding. We evaluate Lawformer on a variety of LegalAI tasks, including judgment prediction, similar case retrieval, legal reading comprehension, and legal question answering. The experimental results demonstrate that our model can achieve promising improvement on tasks with long documents as inputs. The code and parameters are available at https://github.com/thunlp/LegalPLMs.
引用
收藏
页码:79 / 84
页数:6
相关论文
共 41 条
[1]  
Alsentzer Emily, 2019, P 2 CLIN NATURAL LAN, P72, DOI [10.18653/v1/W19-1909, DOI 10.18653/V1/W19-1909]
[2]  
Beltagy I, 2020, Arxiv, DOI arXiv:2004.05150
[3]  
Beltagy I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3615
[4]  
Brown TB, 2020, ADV NEUR IN, V33
[5]  
Chalkidis I, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P4317
[6]  
Chalkidis Ilias, 2020, FINDINGS ASS COMPUTA
[7]   A text mining approach to assist the general public in the retrieval of legal documents [J].
Chen, Yen-Liang ;
Liu, Yi-Hung ;
Ho, Wu-Liang .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2013, 64 (02) :280-290
[8]  
Cui YM, 2021, Arxiv, DOI [arXiv:1906.08101, 10.48550/arXiv.1906.08101]
[9]  
Dai ZH, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P2978
[10]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171