A RoBERTa-GlobalPointer-Based Method for Named Entity Recognition of Legal Documents

被引:0
|
作者
Zhang, Xinrui [1 ]
Luo, Xudong [1 ]
Wu, Jiaye [1 ]
机构
[1] Guangxi Normal Univ, Sch Comp Sci & Engn, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
关键词
Information Extraction; Nested Entity Recognition; Legal Intelligence; Deep Learning;
D O I
10.1109/IJCNN54540.2023.10191275
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In legal practice, judicial professionals often need to extract useful information from numerous legal documents. The key to legal information extraction is the Named Entity Recognition (NER) of legal documents. To address three critical issues in NER of legal documents, this paper proposes a RoBERTa-GlobalPointer-based method for NER of legal documents. Specifically, we first use RoBERTa (a variant of the pre-trained language model BERT) to extract char-level feature representations of a legal document, and use the Skip-Gram method to extract its word-level feature representations, and fuse them to better capture the contextual information of entities in the document. Then, according to the concatenated result, we use the GlobalPointer method to calculate the score of each subsequence of the document, to which it is an entity of a certain type. Finally, we employ the balanced softmax function to determine whether or not a subsequence of the document is an entity of a certain type according to its score calculated by GlobalPointer. Our evaluation experiments on the Chinese judicial domain dataset show that the proposed method outperforms the state-of-the-art baseline methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] A Dataset of German Legal Documents for Named Entity Recognition
    Leitner, Elena
    Rehm, Georg
    Moreno-Schneider, Julian
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4478 - 4485
  • [2] Named Entity Recognition Method of Chinese Legal Documents Based on Parallel Instance Query Network
    Lu, Rui
    Li, Linying
    International Journal of Digital Crime and Forensics, 2024, 16 (01)
  • [3] A Named Entity Recognition Method Based on Knowledge Distillation and Efficient GlobalPointer for Chinese Medical Texts
    Zhai, Zhengwei
    Fan, Rongli
    Huang, Jie
    Xiong, Neal N.
    Zhang, Lijuan
    Wan, Jian
    Zhang, Lei
    IEEE ACCESS, 2024, 12 : 83563 - 83574
  • [4] Chinese Fine-Grained Named Entity Recognition Based on BILTAR and GlobalPointer Modules
    Li, Weijun
    Liu, Jintong
    Gao, Yuxiao
    Zhang, Xinyong
    Gu, Jianlai
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [5] Named entity recognition in Vietnamese documents
    Tri Tran, Q.
    Thao Pham, T.X.
    Hung Ngo, Q.
    Dinh, Dien
    Collier, Nigel
    Progress in Informatics, 2007, (04): : 5 - 13
  • [6] On the Assessment of Deep Learning Models for Named Entity Recognition of Brazilian Legal Documents
    Albuquerque, Hidelberg O.
    Souza, Ellen
    Oliveira, Adriano L. I.
    Macedo, David
    Zanchettin, Cleber
    Vitorio, Douglas
    da Silva, Nadia F. F.
    de Carvalho, Andre C. P. L. F.
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT II, 2023, 14116 : 93 - 104
  • [7] Enhancing Legal Named Entity Recognition Using RoBERTa-GCN with CRF: A Nuanced Approach for Fine-Grained Entity Recognition
    Jain, Arihant
    Sharma, Raksha
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT III, 2024, 14610 : 261 - 267
  • [8] Named Entity Recognition for Tamil Biomedical Documents
    Antony, Betina J.
    Mahalakshmi, G. S.
    2014 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT-2014), 2014, : 1571 - 1577
  • [9] Arabic named entity recognition in crime documents
    Asharef, M.
    Omar, N.
    Albared, M.
    Journal of Theoretical and Applied Information Technology, 2012, 44 (01) : 1 - 6
  • [10] Evaluation of Named Entity Recognition in Handwritten Documents
    Villanova-Aparisi, David
    Martinez-Hinarejos, Carlos-D
    Romero, Veronica
    Pastor-Gadea, Moises
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 568 - 582