RETRACTED: Legal Text Recognition Using LSTM-CRF Deep Learning Model (Retracted Article)

被引:9
作者
Xu, Hesheng [1 ]
Hu, Bin [1 ]
机构
[1] Zhejiang Univ City Coll, Dept Law, Hangzhou 310015, Peoples R China
关键词
NAMED ENTITY RECOGNITION;
D O I
10.1155/2022/9933929
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In legal texts, named entity recognition (NER) is researched using deep learning models. First, the bidirectional (Bi)-long short-term memory (LSTM)-conditional random field (CRF) model for studying NER in legal texts is established. Second, different annotation methods are used to compare and analyze the entity recognition effect of the Bi-LSTM-CRF model. Finally, other objective loss functions are set to compare and analyze the entity recognition effect of the Bi-LSTM-CRF model. The research results show that the F1 value of the model trained on the word sequence labeling corpus on the named entity is 88.13%, higher than that of the word sequence labeling corpus. For the two types of entities, place names and organization names, the F1 values obtained by the Bi-LSTM-CRF model using word segmentation are 67.60% and 89.45%, respectively, higher than the F1 values obtained by the model using character segmentation. Therefore, the Bi-LSTM-CRF model using word segmentation is more suitable for recognizing extended entities. The parameter learning result using log-likelihood is better than that using the maximum interval criterion, and it is ideal for the Bi-LSTM-CRF model. This method provides ideas for the research of legal text recognition and has a particular value.
引用
收藏
页数:10
相关论文
共 28 条
  • [1] BE-BLC: BERT-ELMO-Based Deep Neural Network Architecture for English Named Entity Recognition Task
    Affi, Manel
    Latiri, Chiraz
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 168 - 181
  • [2] A set of benchmarks for Handwritten Text Recognition on historical documents
    Andreu Sanchez, Joan
    Romero, Veronica
    Toselli, Alejandro H.
    Villegas, Mauricio
    Vidal, Enrique
    [J]. PATTERN RECOGNITION, 2019, 94 : 122 - 134
  • [3] A neural model for text localization, transcription and named entity recognition in full pages
    Carbonell, Manuel
    Fornes, Alicia
    Villegas, Mauricio
    Llados, Josep
    [J]. PATTERN RECOGNITION LETTERS, 2020, 136 : 219 - 227
  • [4] Combining contextualized word representation and sub-document level analysis through Bi-LSTM plus CRF architecture for clinical de-identification
    Catelli, Rosario
    Casola, Valentina
    De Pietro, Giuseppe
    Fujita, Hamido
    Esposito, Massimo
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 213
  • [5] A comparative study of automated legal text classification using random forests and deep learning
    Chen, Haihua
    Wu, Lei
    Chen, Jiangping
    Lu, Wei
    Ding, Junhua
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (02)
  • [6] Time series forecasting of COVID-19 transmission in Canada using LSTM networks
    Chimmula, Vinay Kumar Reddy
    Zhang, Lei
    [J]. CHAOS SOLITONS & FRACTALS, 2020, 135
  • [7] A Deep Learning Model for Estimating Story Points
    Choetkiertikul, Morakot
    Hoa Khanh Dam
    Truyen Tran
    Trang Pham
    Ghose, Aditya
    Menzies, Tim
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2019, 45 (07) : 637 - 656
  • [8] Stacked bidirectional and unidirectional LSTM recurrent neural network for forecasting network-wide traffic state with missing values
    Cui, Zhiyong
    Ke, Ruimin
    Pu, Ziyuan
    Wang, Yinhai
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 118
  • [9] Interpretable spatio-temporal attention LSTM model for flood forecasting
    Ding, Yukai
    Zhu, Yuelong
    Feng, Jun
    Zhang, Pengcheng
    Cheng, Zirun
    [J]. NEUROCOMPUTING, 2020, 403 : 348 - 359
  • [10] Ghiasvand Omid, 2018, Informatics in Medicine Unlocked, V13, P122, DOI 10.1016/j.imu.2018.10.011