Summarizing judicial documents: a hybrid extractive- abstractive model with legal domain knowledge

被引:0
作者
Gao, Yan [1 ]
Wu, Jie [1 ]
Liu, Zhengtao [1 ]
Li, Juan [2 ]
机构
[1] Cent South Univ, Sch Automat, Changsha 410006, Peoples R China
[2] Cent South Univ, Law Sch, Changsha 410006, Peoples R China
关键词
Extractive summarization; Abstractive summarization; Rhetorical role; Contrastive learning; Prior knowledge;
D O I
10.1007/s10506-025-09435-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The automatic summarization of judgment documents is a challenging task due to their length and the dispersed nature of the important information they contain. The prevailing approach to tackling the summarization of lengthy documents involves the integration of both extractive and abstractive summarization models. However, current extractive models face challenges in capturing all essential details due to the scattered distribution of pertinent information within judgment documents. Additionally, the existing abstractive models still grapple with the problem of "hallucinations" which leads to generating inaccurate information. In our work, we proposed a novel hybrid legal summarization method that incorporates legal domain knowledge into both the extractive model and abstractive model. The method consists of two parts: (1) The rhetorical role of sentences is identified by the sentence-level sequence labeling method, and the rhetorical information is integrated into the extractive model based on WoBERT through the conditional normalization to ensure that the identification of key sentences is both precise and complete. (2) The pre-trained model RoFormer is combined with Seq2Seq to construct a long text summarization model, and the prior knowledge in the external resources and the document itself is introduced into the decoding process to improve the faithfulness and coherence of the composed summary. In addition, the contrastive learning strategy is employed during the training process to enhance the robustness of the abstractive model. Experimental results on the CAIL2020 dataset show that the proposed model is superior to the baseline methods. Furthermore, our method outperforms GPT and other LLMs in processing judgment documents.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] LegoNet - classification and extractive summarization of Indian legal judgments with Capsule Networks and Sentence Embeddings
    Acharya, Harshith R.
    Bhat, Aditya D.
    Avinash, K.
    Srinath, Ramamoorthy
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) : 2037 - 2046
  • [2] Bhattacharya Paheli, 2021, ICAIL '21: Proceedings of the Eighteenth International Conference on Artificial Intelligence and Law, P22, DOI 10.1145/3462757.3466092
  • [3] Bhattacharya P, 2022, P 2 C AS PAC CHAPT A, V1, P1048
  • [4] Cheng JP, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P484
  • [5] Cui J, 2023, Chatlaw: A multi-agent collaborative legal assistant with knowledge graph enhanced mixture-of-experts large language model abs/2306.16092
  • [6] Dan J, 2023, Artificial Intelligence and Law, P1
  • [7] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [8] Dongwook Lee, 2019, arXiv
  • [9] LexRank: Graph-based lexical centrality as salience in text summarization
    Erkan, G
    Radev, DR
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 : 457 - 479
  • [10] Improving abstractive summarization of legal rulings through textual entailment
    Feijo, Diego de Vargas
    Moreira, Viviane P.
    [J]. ARTIFICIAL INTELLIGENCE AND LAW, 2023, 31 (01) : 91 - 113