Effective deep learning approaches for summarization of legal texts

被引:38
作者
Anand, Deepa [1 ]
Wagh, Rupali [2 ]
机构
[1] CMR Inst Technol, Bangalore 560037, Karnataka, India
[2] JAIN Deemed Univ, Bangalore 560004, Karnataka, India
关键词
Legal text summarization; Natural language processing; Deep learning; Sentence embeddings;
D O I
10.1016/j.jksuci.2019.11.015
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The availability of legal judgment documents in digital form offers numerous opportunities for information extraction and application. Automatic summarization of these legal texts is a crucial and a challenging task due to the unusual structure and high complexity of these documents. Previous approaches in this direction have relied on huge labelled datasets, using hand engineered features, leveraging on domain knowledge and focussed their attention on a narrow sub-domain for increased effectiveness. In this paper, we propose simple generic techniques using neural network for the summarization task for Indian legal judgment documents. We explore two neural network architectures for this task utilizing the word and sentence embeddings for capturing the semantics. The main advantage of the proposed approaches is that they do not rely on hand crafted features, or domain specific knowledge, nor is their application restricted to a particular sub-domain thus making them suitable to be extended to other domains as well. We tackle the problem of unavailability of labelled data for the task by assigning classes/scores to sentences in the training set, based on their match with reference summary produced by humans. The experimental evaluations establish the effectiveness of our proposed approaches as compared with other baselines. (C) 2019 The Authors. Published by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页码:2141 / 2150
页数:10
相关论文
共 28 条
  • [1] Alguliev RM, 2007, APPL COMPUT MATH-BAK, V6, P278
  • [2] Allahyari M, 2017, Arxiv, DOI arXiv:1707.02268
  • [3] [Anonymous], FULL PYTHON IMPLEMEN
  • [4] [Anonymous], Understanding LSTM Networks
  • [5] [Anonymous], The Python Deep Learning library
  • [6] [Anonymous], GEN PURPOSE UNSUPERV
  • [7] [Anonymous], LONG SHORT TERM MEMO
  • [8] [Anonymous], TOKENIZER SPACY API
  • [9] [Anonymous], INFERSENT SENTENCE E
  • [10] A literature review on question answering techniques, paradigms and systems
    Calijorne Soares, Marco Antonio
    Parreiras, Fernando Silva
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 32 (06) : 635 - 646