Towards an Open Domain Arabic Question Answering System: Assessment of the Bert Approach

被引:0
作者
Azroumahli, Chaimae [1 ]
El Younoussi, Yacine [2 ]
Badir, Hassan [3 ]
机构
[1] Moroccan Sch Engn Sci EMSI, Lab Intelligent Syst & Applicat LSIA, Tangier, Morocco
[2] Abdelmalek Essaadi Univ, SIGL, ENSA Tetuan, Tetouan, Morocco
[3] Abdelmalek Essaadi Univ, IDS Team, ENSA Tangier, Tanger, Morocco
来源
ADVANCES IN MODEL AND DATA ENGINEERING IN THE DIGITALIZATION ERA, MEDI 2023 SHORT AND WORKSHOP PAPERS | 2024年 / 2071卷
关键词
Arabic NLP; Contextualized word representations; Transformers; Question Answering; Bert;
D O I
10.1007/978-3-031-55729-3_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep learning-based contextualized word representations have made substantial advancements in enhancing the efficiency of various natural language processing (NLP) applications. However, only limited efforts have been dedicated to employing these representations for the development of Arabic opendomain question-answering (QA) systems, which are an indispensable component of conversational agents such as ChatGPT. In this study, we address this gap by delving into the Bert architecture to create a pre-trained Arabic Bert model. Furthermore, we assess the performance of this model in constructing a QA system by comparing its performance with that of a multilingual Bert model. The experimental results show that our AraQA_Bert_SL model, fine-tuned on the weights of a single-language pre-trained model, outperforms existing systems, boasting an F1 score of 90.6% and a pRR score of 93.7%. This achievement surpasses the performance of the AraQA_Bert_ML model, which relies on a multilingual pre-trained model. Notably, our approach significantly reduces the computational costs associated with the process of Bert fine-tuning.
引用
收藏
页码:33 / 46
页数:14
相关论文
共 27 条
[1]  
Ahmed W., 2016, Answer extraction for howandwhy questions in question answering systems, P18
[2]  
Akour Mohammed, 2011, American Journal of Applied Sciences, V8, P652, DOI 10.3844/ajassp.2011.652.661
[3]   A comprehensive survey of techniques for developing an Arabic question answering system [J].
Alkhurayyif, Yazeed ;
Sait, Abdul Rahaman Wahab .
PEERJ COMPUTER SCIENCE, 2023, 9
[4]   A hybrid semantic query expansion approach for Arabic information retrieval [J].
ALMarwi, Hiba ;
Ghurab, Mossa ;
Al-Baltah, Ibrahim .
JOURNAL OF BIG DATA, 2020, 7 (01)
[5]  
Antoun W, 2020, P 4 WORKSH OP SOURC, P9
[6]  
Azroumahli Chaimae, 2020, International Journal of Computer Information Systems and Industrial Management Applications, V12, P349
[7]  
Azroumahli C., 2018, SoCPaR 2017, V737, P130, DOI [10.1007/978-3-319-76357-613, DOI 10.1007/978-3-319-76357-613]
[8]  
Brini W, 2009, IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, P417
[9]  
Chaimae A., 2020, 2020 INT S ADV EL CO, P1
[10]  
Chaimae A., 2019, Int. J. Rough Sets Data Anal., V6, P18, DOI [10.4018/IJRSDA.2019070102, DOI 10.4018/IJRSDA.2019070102]