Enabling deep learning for large scale question answering in Italian

被引:2
作者
Croce, Danilo [1 ]
Zelenanska, Alexandra [1 ]
Basili, Roberto [1 ]
机构
[1] Univ Roma Tor Vergata, Dept Enterprise Engn, Rome, Italy
关键词
Question answering in Italian; deep learning; recurrent neural network with attention;
D O I
10.3233/IA-190018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent breakthroughs in the field of deep learning led to state-of-the-art results in several NLP tasks, such as Question Answering (QA). Unfortunately, the requirements of such neural QA systems are very strict due to the size of the involved training datasets. In cross-linguistic settings these requirements are not satisfied as training datasets for QA over non-English texts are often not available. This represents the major barrier for a wide-spread adoption of neural QA methods in NLP applications. In this paper, the acquisition of a large scale dataset for an open-domain factoid question answering system in Italian is discussed. It is obtained by automatic translation and linguistic elicitation of an existing English dataset, i.e. the SQUAD question-answer pair corpus. Even though the quality of the resulting corpus for Italian might not be completely satisfying, our work allowed to generate more than 60 thousand question-answer pairs. In the paper the impact of this resource on the QA process over the Italian Wikipedia is studied, according to different training conditions and architectural constraints. A comparative evaluation against the English version, in line with standards in the SQUAD literature, is carried out. The outcomes show that the results achievable for Italian are below the state-of-the-art for English, but the ability of learning not to respond (i.e. the adoption of techniques for detecting question whose answers are simply not available, i.e. EMPTY set of answers) allows the system to pursue reasonable levels of precision. This make it already usable within realistic application scenarios. Finally, an error analysis is presented that suggests possible future research directions on still critical but highly beneficial enhancements, in view of concrete QA applications in Italian.
引用
收藏
页码:49 / 61
页数:13
相关论文
共 50 条
  • [41] Personalized Course Resource Recommendation Algorithm Based on Deep Learning in the Intelligent Question Answering Robot Environment
    Sun, Peng
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2023, 16 (03)
  • [42] BERT+vnKG: Using Deep Learning and Knowledge Graph to Improve Vietnamese Question Answering System
    Phan, Truong H., V
    Phuc Do
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (07) : 480 - 487
  • [43] Deep neural network approach for arabic community question answering
    Almiman, Ali
    Osman, Nada
    Torki, Marwan
    ALEXANDRIA ENGINEERING JOURNAL, 2020, 59 (06) : 4427 - 4434
  • [44] Deep Neural Network Models for Question Classification in Community Question-Answering Forums
    Upadhya, Akshay B.
    Udupa, Swastik
    Kamath, Sowmya S.
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [45] Integrating deep learning for visual question answering in Agricultural Disease Diagnostics: Case Study of Wheat Rust
    Nanavaty, Akash
    Sharma, Rishikesh
    Pandita, Bhuman
    Goyal, Ojasva
    Rallapalli, Srinivas
    Mandal, Murari
    Singh, Vaibhav Kumar
    Narang, Pratik
    Chamola, Vinay
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [46] Large-scale Pollen Recognition with Deep Learning
    de Geus, Andre R.
    Barcelos, Celia A. Z.
    Batista, Marcos A.
    da Silva, Sergio F.
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [47] Deep Learning on Large-scale Muticore Clusters
    Sakiyama, Kazumasa
    Kato, Shinpei
    Ishikawa, Yutaka
    Hori, Atsushi
    Monrroy, Abraham
    2018 30TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2018), 2018, : 314 - 321
  • [48] Enabling ImageNet-Scale Deep Learning on MCUs for Accurate and Efficient Inference
    Sadiq, Sulaiman
    Hare, Jonathon
    Craske, Simon
    Maji, Partha
    Merrett, Geoff
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (07) : 11471 - 11479
  • [49] Category Prediction of Questions Posted in Community-Based Question Answering Services Using Deep Learning Methods
    Ma, Qing
    Kato, Reo
    Murata, Masaki
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT V, 2017, 10638 : 699 - 709
  • [50] Visual question answering model for fruit tree disease decision-making based on multimodal deep learning
    Lan, Yubin
    Guo, Yaqi
    Chen, Qizhen
    Lin, Shaoming
    Chen, Yuntong
    Deng, Xiaoling
    FRONTIERS IN PLANT SCIENCE, 2023, 13