Towards Understanding Contracts Grammar: A Large Language Model-based Extractive Question-Answering Approach

被引:0
|
作者
Rejithkumar, Gokul [1 ]
Anish, Preethu Rose [1 ]
Ghaisas, Smita [1 ]
机构
[1] TCS Res, Pune, India
来源
32ND IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE, RE 2024 | 2024年
关键词
text extraction; deep learning; natural language processing; large language models; question-answering; token classification; text-to-text generation; prompting; empirical research;
D O I
10.1109/RE59067.2024.00037
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software Engineering (SE) contracts play a pivotal role in Information Technology Outsourcing (ITO) projects. The obligations in SE contracts are known to be a useful source for deriving software requirements, thereby contributing to the overall Software Development Life Cycle (SDLC). Making sense of contractual obligations is an important first step in successfully executing software projects. This includes building compliant systems, meeting delivery deadlines, avoiding heavy penalties, and steering clear of expensive litigations. In this work, we present an approach to capture the essence of a contractual clause by extracting its Contracts Grammar. Through an exploratory study, we first identify the constituents of Contracts Grammar. Subsequently, we experiment with multiple approaches for the automated extraction of these constituents, including extractive question-answering, token classification, text-to-text generation, prompting, and regular expressions. The question-answering based approach performed the best in terms of high average ROUGE-L score of 0.81, and faster inference times. The work presented in this paper is a part of the Contracts Governance System (CGS) and is in the process of deployment within a large IT vendor organization.
引用
收藏
页码:310 / 320
页数:11
相关论文
共 50 条
  • [21] MiniMedGPT: Efficient Large Vision-Language Model for medical Visual Question Answering
    Alsabbagh, Abdel Rahman
    Mansour, Tariq
    Al-Kharabsheh, Mohammad
    Ebdah, Abdel Salam
    Al-Emaryeen, Roa'a
    Al-Nahhas, Sara
    Mahafza, Waleed
    Al-Kadi, Omar
    PATTERN RECOGNITION LETTERS, 2025, 189 : 8 - 16
  • [22] Knowledge Graphs Enhanced Large Language Model Prompt for Electric Power Question Answering
    Wang, Chen
    Hua, Min
    Song, Jiale
    Tang, Xuesong
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 24 - 29
  • [23] CPEQA: A Large Language Model Based Knowledge Base Retrieval System for Chinese Confidentiality Knowledge Question Answering
    Cao, Jian
    Cao, Jiuxin
    ELECTRONICS, 2024, 13 (21)
  • [24] A Question-Answering Model Based on Knowledge Graphs for the General Provisions of Equipment Purchase Orders for Steel Plants Maintenance
    Lee, Sang-Hyuk
    Choi, So-Won
    Lee, Eul-Bum
    ELECTRONICS, 2023, 12 (11)
  • [25] Natural Language Processing Based Question Answering Using Vector Space Model
    Jayashree, R.
    Niveditha, N.
    PROCEEDINGS OF SIXTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2016, VOL 2, 2017, 547 : 368 - 375
  • [26] Question Answering with Character-Level LSTM Encoders and Model-Based Data Augmentation
    Wang, Run-Ze
    Zhan, Chen-Di
    Ling, Zhen-Hua
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2017, 2017, 10565 : 295 - 305
  • [27] A question understanding model based on knowledge points for Chinese question answering service in e-learning
    Wu, Zheng-Hong
    Li, Ming
    Feng, Huamin
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 3, PROCEEDINGS, 2007, 4489 : 493 - +
  • [28] A large language model-based manufacturing process planning approach under industry 5.0
    Ni, Mingzhe
    Wang, Tao
    Leng, Jiewu
    Chen, Chong
    Cheng, Lianglun
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2025,
  • [29] A Pre-trained Language Model for Medical Question Answering Based on Domain Adaption
    Liu, Lang
    Ren, Junxiang
    Wu, Yuejiao
    Song, Ruilin
    Cheng, Zhen
    Wang, Sibo
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 216 - 227
  • [30] A question answering system for assembly process of wind turbines based on multi-modal knowledge graph and large language model
    Hu, Zhiqiang
    Li, Xinyu
    Pan, Xinyu
    Wen, Sijie
    Bao, Jinsong
    JOURNAL OF ENGINEERING DESIGN, 2023,