Entity and relation extraction from clinical case reports of COVID-19: a natural language processing approach

被引:7
作者
Raza, Shaina [1 ,2 ]
Schwartz, Brian [1 ,2 ]
机构
[1] Publ Hlth Ontario PHO, Toronto, ON, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
关键词
Natural language processing; Data cohort; COVID-19; Named entity; Relation extraction; Transfer learning; Artificial intelligence; RECOGNITION;
D O I
10.1186/s12911-023-02117-3
中图分类号
R-058 [];
学科分类号
摘要
BackgroundExtracting relevant information about infectious diseases is an essential task. However, a significant obstacle in supporting public health research is the lack of methods for effectively mining large amounts of health data.ObjectiveThis study aims to use natural language processing (NLP) to extract the key information (clinical factors, social determinants of health) from published cases in the literature.MethodsThe proposed framework integrates a data layer for preparing a data cohort from clinical case reports; an NLP layer to find the clinical and demographic-named entities and relations in the texts; and an evaluation layer for benchmarking performance and analysis. The focus of this study is to extract valuable information from COVID-19 case reports.ResultsThe named entity recognition implementation in the NLP layer achieves a performance gain of about 1-3% compared to benchmark methods. Furthermore, even without extensive data labeling, the relation extraction method outperforms benchmark methods in terms of accuracy (by 1-8% better). A thorough examination reveals the disease's presence and symptoms prevalence in patients.ConclusionsA similar approach can be generalized to other infectious diseases. It is worthwhile to use prior knowledge acquired through transfer learning when researching other infectious diseases.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Ischemic stroke and COVID-19 infection - a review of clinical case reports
    Malempati, M.
    Patel, M.
    Patel, J.
    EGYPTIAN JOURNAL OF INTERNAL MEDICINE, 2024, 36 (01)
  • [32] Validation of a Natural Language Processing Algorithm for the Extraction of the Sleep Parameters from the Polysomnography Reports
    Rahman, Mahbubur
    Nowakowski, Sara
    Agrawal, Ritwick
    Naik, Aanand
    Sharafkhaneh, Amir
    Razjouyan, Javad
    HEALTHCARE, 2022, 10 (10)
  • [33] A Study of the Effects of the COVID-19 Pandemic on the Experience of Back Pain Reported on Twitter(R) in the United States: A Natural Language Processing Approach
    Fiok, Krzysztof
    Karwowski, Waldemar
    Gutierrez, Edgar
    Saeidi, Maham
    Aljuaid, Awad M.
    Davahli, Mohammad Reza
    Taiar, Redha
    Marek, Tadeusz
    Sawyer, Ben D.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (09)
  • [34] Impact of the COVID-19 Pandemic on the Epidemiological Situation of Pulmonary Tuberculosis-Using Natural Language Processing
    Morena, Diego
    Campos, Carolina
    Castillo, Maria
    Alonso, Miguel
    Benavent, Maria
    Izquierdo, Jose Luis
    JOURNAL OF PERSONALIZED MEDICINE, 2023, 13 (12):
  • [35] Toward Using Twitter for Tracking COVID-19: A Natural Language Processing Pipeline and Exploratory Data Set
    Klein, Ari Z.
    Magge, Arjun
    O'Connor, Karen
    Amaro, Jesus Ivan Flores
    Weissenbacher, Davy
    Hernandez, Graciela Gonzalez
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (01)
  • [36] Classification of the Disposition of Patients Hospitalized with COVID-19: Reading Discharge Summaries Using Natural Language Processing
    Fernandes, Marta
    Sun, Haoqi
    Jain, Aayushee
    Alabsi, Haitham S.
    Brenner, Laura N.
    Ye, Elissa
    Ge, Wendong
    Collens, Sarah, I
    Leone, Michael J.
    Das, Sudeshna
    Robbins, Gregory K.
    Mukerji, Shibani S.
    Westover, M. Brandon
    JMIR MEDICAL INFORMATICS, 2021, 9 (02)
  • [37] Beyond Natural Language Processing: Building Knowledge Graphs to Assist Scientists Understand COVID-19 Concepts
    Yu, Yishu
    2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 245 - 251
  • [38] Natural language processing in urology: Automated extraction of clinical information from histopathology reports of uro-oncology procedures
    Huang, Honghong
    Lim, Fiona Xin Yi
    Gu, Gary Tianyu
    Han, Matthew Jiangchou
    Fang, Andrew Hao Sen
    Chia, Elian Hui San
    Bei, Eileen Yen Tze
    Tham, Sarah Zhuling
    Ho, Henry Sun Sien
    Yuen, John Shyi Peng
    Sun, Aixin
    Lim, Jay Kheng Sit
    HELIYON, 2023, 9 (04)
  • [39] Epidural pneumorrhachis in COVID-19: a rare clinical entity
    Rao, Shiavax J.
    Lakra, Pallavi
    Chittal, Abhinandan R.
    Aughenbaugh, Michael
    Haas, Christopher J.
    JOURNAL OF COMMUNITY HOSPITAL INTERNAL MEDICINE PERSPECTIVES, 2021, 11 (05): : 719 - 721
  • [40] A Deep Language Model for Symptom Extraction From Clinical Text and its Application to Extract COVID-19 Symptoms From Social Media
    Luo, Xiao
    Gandhi, Priyanka
    Storey, Susan
    Huang, Kun
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (04) : 1737 - 1748