An Ontology-Based and Deep Learning-Driven Method for Extracting Legal Facts from Chinese Legal Texts

被引:5
作者
Ren, Yong [1 ]
Han, Jinfeng [1 ]
Lin, Yingcheng [1 ]
Mei, Xiujiu [1 ]
Zhang, Ling [2 ]
机构
[1] Chongqing Univ, Sch Microelect & Commun Engn, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Sch Publ Policy & Adm, Chongqing 400044, Peoples R China
关键词
information extraction; ontology; BERT; Bi-LSTM; CRF; Chinese legal texts; INFORMATION EXTRACTION;
D O I
10.3390/electronics11121821
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The construction of smart courts promotes the in-deep integration of internet, big data, cloud computing and artificial intelligence with judicial trial work, which can both improve trials and ensure judicial justice with more efficiency. High-quality structured legal facts, obtained by extracting information from unstructured legal texts, are the foundation for the construction of smart courts. Based on the strong normative characteristics of Chinese legal text content and structure composition and the strong text feature learning ability of deep learning, this paper proposes an ontology-based and deep learning-driven method for extracting legal facts from Chinese legal texts. The proposed method utilizes rules and patterns generated in the process of knowledge modeling to extract simple entities, and then extracts complex entities hidden in legal text details with deep learning methods. Finally, the extracted entities are mapped into structured legal facts with clear logical relationships by the Chinese Legal Text Ontology. In the information extraction test of judicial datasets composed of Chinese legal texts on theft, the proposed method effectively extracts up to 38 categories of legal facts from legal texts and the number of categories extracted increases significantly. Among them, the rule-based extractor obtains an F1-score of 99.70%, and the deep learning-driven extractor obtains an F1-score of 91.43%. Compared with existing methods, the proposed method has great advantages in extracting the completeness and accuracy of legal facts.
引用
收藏
页数:18
相关论文
共 34 条
[21]  
Musen Mark A, 2015, AI Matters, V1, P4
[22]   Reference Extraction from Vietnamese Legal Documents [J].
Ngo Xuan Bach ;
Nguyen Thi Thanh Thuy ;
Dang Bao Chien ;
Trieu Khuong Duy ;
To Minh Hien ;
Tu Minh Phuong .
SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, :486-493
[23]  
Bach NX, 2017, INT CONF KNOWL SYS, P154, DOI 10.1109/KSE.2017.8119451
[24]  
Nuranti EQ, 2020, INT C ADV COMP SCI I, P429, DOI [10.1109/icacsis51025.2020.9263157, 10.1109/ICACSIS51025.2020.9263157]
[25]  
Rao X., 2018, ARXIV
[26]  
Solihin F, 2018, INT C ADV COMP SCI I, P349, DOI 10.1109/ICACSIS.2018.8618187
[27]  
Staudemeyer R. C., 2019, arXiv
[28]  
Thomas A., 2017, 6 INT C REC TRENDS I
[29]   Semi-supervised, knowledge-integrated pattern learning approach for fact extraction from judicial text [J].
Thomas, Anu ;
Sangeetha, Sivanesan .
EXPERT SYSTEMS, 2021, 38 (03)
[30]   Knowledge graphs as tools for explainable machine learning: A survey [J].
Tiddi, Ilaria ;
Schlobach, Stefan .
ARTIFICIAL INTELLIGENCE, 2022, 302