Developing an Open Domain Arabic Question Answering System Using a Deep Learning Technique

被引:2
|
作者
Alkhurayyif, Yazeed [1 ]
Sait, Abdul Rahaman Wahab [2 ]
机构
[1] Shaqra Univ, Al Quwayiyah Coll Sci & Humanities, Dept Comp Sci, Shaqra 11911, Saudi Arabia
[2] King Faisal Univ, Ctr Documents & Adm Commun, Dept Documents & Arch, Al Hufuf 31982, Saudi Arabia
关键词
Open-domain QAS; question answering system; name entity relationship; deep learning; Arabic question-answering model;
D O I
10.1109/ACCESS.2023.3292190
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A question-answering system (QAS) retrieves a relevant response to user queries. The existing QASs are limited in performance in satisfying the users' intention. In recent times, researchers have focused on developing Arabic QASs. However, a significant number of QASs are based on specific domains. Therefore, the study intends to develop an open-domain QAS using a deep learning technique. The proposed QAS comprises three phases: data pre-processing, name entity relationship, and response retrieval. The researchers apply de-diacritization, minimizing orthographic ambiguity, tokenization, and morphological analysis to extract the key terms from the Arabic content. This phase supports the QAS in overcoming the challenges of understanding the Arabic content. Multinomial Naive Bayes algorithm is applied to uncover the relationship among the Arabic terms. In addition, the authors employ the Embeddings from Language Models approach with a quaternion long-short-term memory neural network (QLSTM) for constructing the QAS with limited resources. The Arabic reading comprehension dataset (ARCD) and TyDiQA are utilized to evaluate the performance. The experimental outcome reveals that the proposed QAS achieve accuracy, precision, recall, F1-score, MCC and Kappa of 96.23, 97, 96.95, 97, 95.98, 95.7 and 95.35, 94.8, 94.68, 94.73, 92.98, and 93.6 for ARCD and TyDiQA, respectively. The structure of the proposed QAS is lightweight and can be implemented in real-world applications.
引用
收藏
页码:69131 / 69143
页数:13
相关论文
共 50 条
  • [41] Enabling deep learning for large scale question answering in Italian
    Croce, Danilo
    Zelenanska, Alexandra
    Basili, Roberto
    INTELLIGENZA ARTIFICIALE, 2019, 13 (01) : 49 - 61
  • [42] Recent progress in leveraging deep learning methods for question answering
    Hao, Tianyong
    Li, Xinxin
    He, Yulan
    Wang, Fu Lee
    Qu, Yingying
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04) : 2765 - 2783
  • [43] A review of deep learning in question answering over knowledge bases
    Zhang, Chen
    Lai, Yuxuan
    Feng, Yansong
    Zhao, Dongyan
    AI OPEN, 2021, 2 : 205 - 215
  • [44] A Novel Open-Domain Question Answering System on Curated and Extracted Knowledge Bases With Consideration of Confidence Scores in Existing Triples
    Behmanesh, Somayyeh
    Talebpour, Alireza
    Shamsfard, Mehrnoush
    Jafari, Mohammad Mahdi
    IEEE ACCESS, 2024, 12 : 160741 - 160760
  • [45] Recent progress in leveraging deep learning methods for question answering
    Tianyong Hao
    Xinxin Li
    Yulan He
    Fu Lee Wang
    Yingying Qu
    Neural Computing and Applications, 2022, 34 : 2765 - 2783
  • [46] The design of restricted domain automatic question answering system based on question base
    Gong, Zheng
    Zhang, Dan
    INFORMATION TECHNOLOGY AND COMPUTER APPLICATION ENGINEERING, 2014, : 487 - 490
  • [47] A survey of deep learning-based visual question answering
    Huang, Tong-yuan
    Yang, Yu-ling
    Yang, Xue-jiao
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2021, 28 (03) : 728 - 746
  • [48] RESEARCH AND IMPLEMENTATION OF INTELLIGENT QUESTION ANSWERING SYSTEM IN A RESTRICTED DOMAIN
    YinLi Wang
    GuangLai Gao
    PROCEEDINGS OF THE 2008 CHINESE CONFERENCE ON PATTERN RECOGNITION (CCPR 2008), 2008, : 426 - 431
  • [49] A question answering system on special domain and the implementation of speech interface
    Hu, HQ
    Ren, FJ
    Kuroiwa, S
    Zhang, SW
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2006, 3878 : 458 - 469
  • [50] Context Aware Restricted Tourism Domain Question Answering System
    Pathak, Swati
    Mishra, Nidhi
    PROCEEDINGS ON 2016 2ND INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2016, : 534 - 539