Improving Zero-Shot Text Matching for Financial Auditing with Large Language Models

被引:3
作者
Hillebrand, Lars [1 ]
Berger, Armin [1 ]
Deusser, Tobias [1 ,3 ]
Dilmaghani, Tim [2 ]
Khaled, Mohamed [2 ]
Kliem, Bernd [2 ]
Loitz, Ruediger [2 ]
Pielka, Maren [1 ]
Leonhard, David [1 ]
Bauckhage, Christian [1 ,3 ]
Sifa, Rafet [1 ,3 ]
机构
[1] Fraunhofer IAIS, St Augustin, Germany
[2] PricewaterhouseCoopers GmbH, Dusseldorf, Germany
[3] Univ Bonn, Bonn, Germany
来源
PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2023 | 2023年
关键词
Large Language Models; Recommender System; Text Matching;
D O I
10.1145/3573128.3609344
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Auditing financial documents is a very tedious and time-consuming process. As of today, it can already be simplified by employing AI-based solutions to recommend relevant text passages from a report for each legal requirement of rigorous accounting standards. However, these methods need to be fine-tuned regularly, and they require abundant annotated data, which is often lacking in industrial environments. Hence, we present ZeroShotALI, a novel recommender system that leverages a state-of-the-art large language model (LLM) in conjunction with a domain-specifically optimized transformer-based text-matching solution. We find that a two-step approach of first retrieving a number of best matching document sections per legal requirement with a custom BERT-based model and second filtering these selections using an LLM yields significant performance improvements over existing approaches.
引用
收藏
页数:4
相关论文
共 18 条
  • [1] Achiam OJ, 2023, Arxiv, DOI [arXiv:2303.08774, 10.48550/arXiv.2303.08774, 10.48550/ARXIV.2303.08774]
  • [2] Ali Syed Musharraf, 2023, P NLDL
  • [3] ANN-Benchmarks: A benchmarking tool for approximate nearest neighbor algorithms
    Aumuller, Martin
    Bernhardsson, Erik
    Faithfull, Alexander
    [J]. INFORMATION SYSTEMS, 2020, 87
  • [4] Biesner David, 2022, P ICML A
  • [5] Bridging the gap - the impact of ChatGPT on financial research
    Cao, Yi
    Zhai, Jia
    [J]. JOURNAL OF CHINESE ECONOMIC AND BUSINESS STUDIES, 2023, 21 (02) : 177 - 191
  • [6] Deusser Tobias, 2022, P ICMLA
  • [7] Deusser Tobias., 2023, P NLDL
  • [8] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [9] Hillebrand L., 2022, P ICPR
  • [10] Hillebrand Lars., 2022, P BIGDATA