Question Answering System to Answer Questions About Technical Documentation

被引:0
作者
Olewniczak, Szymon [1 ]
Maciszka, Michal [1 ]
Paluszewski, Kamil [1 ]
Pozorski, Grzegorz [1 ]
Rosenthal, Wojciech [1 ]
Zaleski, Lukasz [1 ]
机构
[1] Gdansk Univ Technol, Dept Comp Architecture, Fac Elect Telecommun & Informat, Gdansk, Poland
来源
ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2024, PART I | 2024年 / 2165卷
关键词
Question Answering; Information Retrieval; AI; Chatbot; Natural Language Processing; Documentation;
D O I
10.1007/978-3-031-70248-8_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article ventures into the realm of specialized AI systems for question answering, with a specific focus on programming languages, using Rust as the case study. Our research harnesses the capabilities of BERT, a leading model in natural language processing, to explore its effectiveness in interpreting and responding to complex, domain-specific queries. We have developed a novel dataset, derived from Rust's detailed documentation, which surpasses the usual input size for language models. This dataset serves as a foundation for evaluating BERT's performance in a domain-specific context, providing a new resource for testing question-answering systems and shedding light on their strengths and limitations in processing specialized technical information. In this paper, we proposed a solution based on retrieval-reader architecture, the fine-tuned RoBERTa model with the usage of the mentioned dataset, and conducted typical tests for said problem. It is shown, that domain-specific question-answering remains a challenging problem.
引用
收藏
页码:193 / 205
页数:13
相关论文
共 19 条
[1]   Reading comprehension based question answering system in Bangla language with transformer-based learning [J].
Aurpa, Tanjim Taharat ;
Rifat, Richita Khandakar ;
Ahmed, Md Shoaib ;
Anwar, Md Musfique ;
Ali, A. B. M. Shawkat .
HELIYON, 2022, 8 (10)
[2]  
Brown TB, 2020, ADV NEUR IN, V33
[3]  
Chen Danqi, 2017, arXiv
[4]   Survey on evaluation methods for dialogue systems [J].
Deriu, Jan ;
Rodrigo, Alvaro ;
Otegi, Arantxa ;
Echegoyen, Guillermo ;
Rosset, Sophie ;
Agirre, Eneko ;
Cieliebak, Mark .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) :755-810
[5]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[6]   OpenAPI Bot: A Chatbot to Help You Understand REST APIs [J].
Ed-Douibi, Hamza ;
Daniel, Gwendal ;
Cabot, Jordi .
WEB ENGINEERING, ICWE 2020, 2020, 12128 :538-542
[7]  
Es Shahul, 2023, Ragas: automated evaluation of retrieval augmented generation
[8]  
He Y, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P4604
[9]  
Liu Yinhan, 2019, Roberta: A robustly optimized bert pretraining ap- proach.
[10]   Prediction of the high-cost normalised discounted cumulative gain (nDCG) measure in information retrieval evaluation [J].
Muwanei, Sinyinda ;
Ravana, Sri Devi ;
Hoo, Wai Lam ;
Kunda, Douglas .
INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL, 2022, 27 (02)