SemBioNLQA: A semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions

被引:49
|
作者
Sarrouti, Mourad [1 ]
Ouatik El Alaoui, Said [2 ,3 ]
机构
[1] US Natl Inst Hlth, Lister Hill Natl Ctr Biomed Commun, US Natl Lib Med, Bethesda, MD 20892 USA
[2] Ibn Tofail Univ, Natl Sch Appl Sci, Kenitra, Morocco
[3] Sidi Mohammed Ben Abdellah Univ, Lab Informat & Modeling, Fes, Morocco
关键词
Biomedical question answering; Information retrieval; Passage retrieval; Natural language processing; Machine leaming; Biomedical informatics; BioASQ; CLINICAL QUESTIONS; INFORMATION; ONTOLOGY; TEXT; CLASSIFICATION; SIMILARITY; LEXRANK; MEDLINE;
D O I
10.1016/j.artmed.2019.101767
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background and objective: Question answering (QA), the identification of short accurate answers to users questions written in natural language expressions, is a longstanding issue widely studied over the last decades in the open-domain. However, it still remains a real challenge in the biomedical domain as the most of the existing systems support a limited amount of question and answer types as well as still require further efforts in order to improve their performance in terms of precision for the supported questions. Here, we present a semantic biomedical QA system named SemBioNLQA which has the ability to handle the kinds of yes/no, factoid, list, and summary natural language questions. Methods: This paper describes the system architecture and an evaluation of the developed end-to-end biomedical QA system named SemBioNLQA, which consists of question classification, document retrieval, passage retrieval and answer extraction modules. It takes natural language questions as input, and outputs both short precise answers and summaries as results. The SemBioNLQA system, dealing with four types of questions, is based on (1) handcrafted lexico-syntactic patterns and a machine learning algorithm for question classification, (2) PubMed search engine and UMLS similarity for document retrieval, (3) the BM25 model, stemmed words and UMLS concepts for passage retrieval, and (4) UMLS metathesaurus, BioPortal synonyms, sentiment analysis and term frequency metric for answer extraction. Results and conclusion: Compared with the current state-of-the-art biomedical QA systems, SemBioNLQA, a fully automated system, has the potential to deal with a large amount of question and answer types. SemBioNLQA retrieves quickly users' information needs by returning exact answers (e.g., "yes", "no", a biomedical entity name, etc.) and ideal answers (i.e., paragraph-sized summaries of relevant information) for yes/no, factoid and list questions, whereas it provides only the ideal answers for summary questions. Moreover, experimental evaluations performed on biomedical questions and answers provided by the BioASQ challenge especially in 2015, 2016 and 2017 (as part of our participation), show that SemBioNLQA achieves good performances compared with the most current state-of-the-art systems and allows a practical and competitive alternative to help information seekers find exact and ideal answers to their biomedical questions.
引用
收藏
页数:16
相关论文
共 8 条
  • [1] A Multiple Ontologies Based System for Answering Natural Language Questions
    El-Ansari, Anas
    Beni-Hssane, Abderrahim
    Saadi, Mostafa
    EUROPE AND MENA COOPERATION ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGIES, 2017, 520 : 177 - 186
  • [2] Semantic Understanding of Natural Language Stories for Near Human Question Answering
    Jamil, Hasan M.
    Oduro-Afriyie, Joel
    FLEXIBLE QUERY ANSWERING SYSTEMS, 2019, 11529 : 215 - 227
  • [3] JOSN: JAVA']JAVA Oriented Question-Answering System Combining Semantic Web and Natural language Processing techniques
    Garg, Shally
    Kumar, Suresh
    2016 1ST INDIA INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (IICIP), 2016,
  • [4] Natural language neural network and its application to question-answering system
    Sagara, Tsukasa
    Hagiwara, Masafumi
    NEUROCOMPUTING, 2014, 142 : 201 - 208
  • [5] Question-Answering System Design in Teaching and Learning, Based on Natural Language Processing
    Wang Ming
    Yuan Dachao
    PROCEEDINGS OF THE FOURTH NORTHEAST ASIA INTERNATIONAL SYMPOSIUM ON LANGUAGE, LITERATURE AND TRANSLATION, 2015, 2015, : 132 - 137
  • [6] Research on Computer Natural Language Processing Intelligent Question Answering System Based on Knowledge Graph
    Zhan, Liuchun
    Huang, Changjiang
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND DIGITAL APPLICATIONS, MIDA2024, 2024, : 70 - 74
  • [7] Gathering information from a relational lexical-conceptual database: A natural language question-answering system
    Marrafa, P
    Ribeiro, C
    Santos, R
    Correia, J
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING, 2004, : 260 - 265
  • [8] A Machine Learning Based Natural Language Question and Answering System for Healthcare Data Search using Complex Queries
    Yeo, Hangu
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2467 - 2474