Semi-supervised, knowledge-integrated pattern learning approach for fact extraction from judicial text

被引:9
作者
Thomas, Anu [1 ]
Sangeetha, Sivanesan [1 ]
机构
[1] Natl Inst Technol, Dept Comp Applicat, Text Analyt & NLP Lab, Tiruchirappalli, Tamil Nadu, India
关键词
domain adaptability; domain‐ specific fact extraction; e‐ judgements; information extraction; judicial ontology; judicial text; knowledge‐ integrated; natural language processing; semantic processing; semi‐ supervised learning approach; INFORMATION EXTRACTION; FRAMEWORK;
D O I
10.1111/exsy.12656
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tremendous growth in the availability of judicial documents has demanded the rise of information extraction (IE) techniques that support the automatic extraction of relevant concepts or data from judicial texts. Among various approaches available for IE, ontology-based IE has proven to be the most appropriate for extracting domain-specific information from natural language text. Through this article, we propose a knowledge-driven, semi-supervised pattern-based learning (bootstrapping) approach to extract domain-specific facts from judicial text, starting with a small set of seed facts. In the semantic analysis of legal text, fact extraction is the next step to entity identification, which involves the identification of roles played by each entity in the judicial text. The proposed methodology learns extraction patterns for 12 classes of facts from the judicial text through the integration of the domain ontology called judicial case ontology (JCO). The experimental results were evaluated by human judges and found to be quite promising. One main feature of the proposed methodology is its portability across various domains (such as medical, banking, insurance, etc.) which in turn helps build expert systems in various sectors.
引用
收藏
页数:20
相关论文
共 57 条
[1]  
Andrew JJ, 2018, NAMED ENTITIES, P1
[2]  
[Anonymous], 2006, P 2 WORKSH ONT LEARN
[3]  
[Anonymous], 2010, P 3 ACM INT C WEB SE, DOI 10.1145/ 1718487.1718501
[4]  
Ashley KD, 1997, INT JOINT CONF ARTIF, P335
[5]  
Banko M, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2670
[6]  
Bird S., 2004, P ACL 02 WORKSH EFF, P214, DOI [DOI 10.3115/1225403.1225421, 10.3115/1118108.1118117, DOI 10.3115/1118108.1118117]
[7]  
Blok HE, 1997, THESIS
[8]  
Brin S, 1999, LECT NOTES COMPUT SC, V1590, P172
[9]  
Buey M. G., 2016, P ICAART 16, P438
[10]  
Cafarella M. J., 2009, 4 BIENN C INN DAT SY