Question Answering System to Answer Questions About Technical Documentation

被引：0

作者：

Olewniczak, Szymon ^{[1
]}

Maciszka, Michal ^{[1
]}

Paluszewski, Kamil ^{[1
]}

Pozorski, Grzegorz ^{[1
]}

Rosenthal, Wojciech ^{[1
]}

Zaleski, Lukasz ^{[1
]}

机构：

[1] Gdansk Univ Technol, Dept Comp Architecture, Fac Elect Telecommun & Informat, Gdansk, Poland

来源：

ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2024, PART I | 2024年 / 2165卷

关键词：

Question Answering; Information Retrieval; AI; Chatbot; Natural Language Processing; Documentation;

D O I：

10.1007/978-3-031-70248-8_15

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article ventures into the realm of specialized AI systems for question answering, with a specific focus on programming languages, using Rust as the case study. Our research harnesses the capabilities of BERT, a leading model in natural language processing, to explore its effectiveness in interpreting and responding to complex, domain-specific queries. We have developed a novel dataset, derived from Rust's detailed documentation, which surpasses the usual input size for language models. This dataset serves as a foundation for evaluating BERT's performance in a domain-specific context, providing a new resource for testing question-answering systems and shedding light on their strengths and limitations in processing specialized technical information. In this paper, we proposed a solution based on retrieval-reader architecture, the fine-tuned RoBERTa model with the usage of the mentioned dataset, and conducted typical tests for said problem. It is shown, that domain-specific question-answering remains a challenging problem.

引用

页码：193 / 205

页数：13

共 19 条

[1] Reading comprehension based question answering system in Bangla language with transformer-based learning [J].

Aurpa, Tanjim Taharat ;

Rifat, Richita Khandakar ;

Ahmed, Md Shoaib ;

Anwar, Md Musfique ;

Ali, A. B. M. Shawkat .

HELIYON, 2022, 8 (10)

[2]

Brown TB, 2020, ADV NEUR IN, V33

[3]

Chen Danqi, 2017, arXiv

[4] Survey on evaluation methods for dialogue systems [J].

Deriu, Jan ;

Rodrigo, Alvaro ;

Otegi, Arantxa ;

Echegoyen, Guillermo ;

Rosset, Sophie ;

Agirre, Eneko ;

Cieliebak, Mark .

ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) :755-810

[5]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[6] OpenAPI Bot: A Chatbot to Help You Understand REST APIs [J].

Ed-Douibi, Hamza ;

Daniel, Gwendal ;

Cabot, Jordi .

WEB ENGINEERING, ICWE 2020, 2020, 12128 :538-542

[7]

Es Shahul, 2023, Ragas: automated evaluation of retrieval augmented generation

[8]

He Y, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P4604

[9]

Liu Yinhan, 2019, Roberta: A robustly optimized bert pretraining ap- proach.

[10] Prediction of the high-cost normalised discounted cumulative gain (nDCG) measure in information retrieval evaluation [J].

Muwanei, Sinyinda ;

Ravana, Sri Devi ;

Hoo, Wai Lam ;

Kunda, Douglas .

INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL, 2022, 27 (02)

← 1 2 →