A support system for the detection of abusive clauses in B2C contracts

被引:1
作者
Dadas, Slawomir [1 ]
Kozlowski, Marek [1 ]
Poswiata, Rafal [1 ]
Perelkiewicz, Michal [1 ]
Bialas, Marcin [1 ]
Grebowiec, Malgorzata [1 ]
机构
[1] Natl Informat Proc Inst, Al Niepodleglosci 188b, Warsaw, Poland
关键词
B2C contracts; Natural language processing; Machine learning; Neural networks; TERMS;
D O I
10.1007/s10506-024-09408-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many countries employ systemic methods of protecting consumers from unfair business practices. One such practice is the use of abusive clauses in business-to-consumer (B2C) contracts, which unfairly impose additional obligations on the consumer or deprive them of their due rights. This article presents an information system that utilizes artificial intelligence methods to automate contract analysis and to detect abusive clauses. The goal of the system is to support the entire administrative process, from contract acquisition, through text extraction and the recommendation of potentially abusive clauses, to the generation of official administrative documents that can be sent to court or to the owners of firms. This article focuses on on the components that use machine learning methods. The first is an intelligent crawler that is responsible for automatically detecting contract templates on websites and retrieving them into the system. The second is a document analysis module that implements a clause recommendation algorithm. The algorithm employs transformer-based language models and information retrieval methods to identify abusive passages in text. Our solution achieved first place in a competition on the automatic analysis of B2C contracts organized by the Polish Office of Competition and Consumer Protection (UOKiK), and has since been implemented as an official tool to support the contract analysis process in Poland.
引用
收藏
页数:39
相关论文
共 70 条
  • [61] An overview of the tesseract OCR engine
    Smith, Ray
    [J]. ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 629 - 633
  • [62] Tiedemann J, 2012, LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P2214
  • [63] van den Oord A, 2019, Arxiv, DOI arXiv:1807.03748
  • [64] Vaswani A, 2017, ADV NEUR IN, V30
  • [65] Pre-Trained Language Models and Their Applications
    Wang, Haifeng
    Li, Jiwei
    Wu, Hua
    Hovy, Eduard
    Sun, Yu
    [J]. ENGINEERING, 2023, 25 : 51 - 65
  • [66] Willett C., 2007, Fairness in Consumer Contracts: The Case of Unfair Terms
  • [67] Williams Adina., 2018, P THE 018 C N AM CHA, P1112, DOI 10.18653/v1/N18-1101
  • [68] Zadgaonkar A., 2021, Int J Electr Comp Eng (IJECE), V11, P5450, DOI 10.11591/ijece.v11i6.pp5450-5457
  • [69] Zhang JQ, 2020, Arxiv, DOI arXiv:1912.08777
  • [70] Zhong LW, 2019, PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND LAW, ICAIL 2019, P163, DOI 10.1145/3322640.3326728