Entity and relation extraction from clinical case reports of COVID-19: a natural language processing approach

被引:7
|
作者
Raza, Shaina [1 ,2 ]
Schwartz, Brian [1 ,2 ]
机构
[1] Publ Hlth Ontario PHO, Toronto, ON, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
关键词
Natural language processing; Data cohort; COVID-19; Named entity; Relation extraction; Transfer learning; Artificial intelligence; RECOGNITION;
D O I
10.1186/s12911-023-02117-3
中图分类号
R-058 [];
学科分类号
摘要
BackgroundExtracting relevant information about infectious diseases is an essential task. However, a significant obstacle in supporting public health research is the lack of methods for effectively mining large amounts of health data.ObjectiveThis study aims to use natural language processing (NLP) to extract the key information (clinical factors, social determinants of health) from published cases in the literature.MethodsThe proposed framework integrates a data layer for preparing a data cohort from clinical case reports; an NLP layer to find the clinical and demographic-named entities and relations in the texts; and an evaluation layer for benchmarking performance and analysis. The focus of this study is to extract valuable information from COVID-19 case reports.ResultsThe named entity recognition implementation in the NLP layer achieves a performance gain of about 1-3% compared to benchmark methods. Furthermore, even without extensive data labeling, the relation extraction method outperforms benchmark methods in terms of accuracy (by 1-8% better). A thorough examination reveals the disease's presence and symptoms prevalence in patients.ConclusionsA similar approach can be generalized to other infectious diseases. It is worthwhile to use prior knowledge acquired through transfer learning when researching other infectious diseases.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Entity and relation extraction from clinical case reports of COVID-19: a natural language processing approach
    Shaina Raza
    Brian Schwartz
    BMC Medical Informatics and Decision Making, 23
  • [2] Natural language processing to convert unstructured COVID-19 chest-CT reports into structured reports
    Fanni, Salvatore Claudio
    Romei, Chiara
    Ferrando, Giovanni
    Volpi, Federica
    D'Amore, Caterina Aida
    Bedini, Claudio
    Ubbiali, Sandro
    Valentino, Salvatore
    Neri, Emanuele
    EUROPEAN JOURNAL OF RADIOLOGY OPEN, 2023, 11
  • [3] Clinical Application of Detecting COVID-19 Risks: A Natural Language Processing Approach
    Bashir, Syed Raza
    Raza, Shaina
    Kocaman, Veysel
    Qamar, Urooj
    VIRUSES-BASEL, 2022, 14 (12):
  • [4] Novel approach by natural language processing for COVID-19 knowledge discovery
    Wang, Li
    Jiang, Lei
    Pan, Dongyan
    Wang, Qinghua
    Yin, Zeyu
    Kang, Zijian
    Tian, Haoran
    Geng, Xuqiang
    Shao, Jinsong
    Pan, Wenjie
    Yin, Jian
    Fang, Li
    Wang, Yue
    Zhang, Weide
    Li, Zhixiu
    Zheng, Jun
    Hu, Wenxin
    Pan, Yunbao
    Yu, Dong
    Guo, Shicheng
    Lu, Wei
    Li, Qiang
    Zhou, Yunyun
    Xu, Huji
    BIOMEDICAL JOURNAL, 2022, 45 (03) : 472 - 481
  • [5] Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing
    Chen, Qingyu
    Leaman, Robert
    Allot, Alexis
    Luo, Ling
    Wei, Chih-Hsuan
    Yan, Shankai
    Lu, Zhiyong
    ANNUAL REVIEW OF BIOMEDICAL DATA SCIENCE, VOL 4, 2021, 4 : 313 - 339
  • [6] Scientific Landscape of Publications in Natural Language Processing in the ASEAN Region on COVID-19: A Bibliometric Approach
    Roxas, Rachel Edita
    Tobias, Rogelio Ruzcko
    Minglana, Johanna
    2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, : 379 - 384
  • [7] Incivility in COVID-19 Vaccine Mandate Discourse and Moral Foundations: Natural Language Processing Approach
    Tin, Jason
    Stevens, Hannah
    Rasul, Muhammad Ehab
    Taylor, Laramie D.
    JMIR FORMATIVE RESEARCH, 2023, 7
  • [8] An approach to the issues around COVID-19 Application of natural language processing techniques on comments from digital news readers
    Rosati, German
    Chazarreta, Adriana
    Domenech, Laia
    Maguire, Tomas
    PAPELES DE TRABAJO, 2021, 15 (28): : 64 - 91
  • [9] Unsupervised natural language processing in the identification of patients with suspected COVID-19 infection
    da Silva, Rildo Pinto
    Pollettini, Juliana Tarossi
    Pazin Filho, Antonio
    CADERNOS DE SAUDE PUBLICA, 2023, 39 (11):
  • [10] Texas Public Agencies' Tweets and Public Engagement During the COVID-19 Pandemic: Natural Language Processing Approach
    Tang, Lu
    Liu, Wenlin
    Thomas, Benjamin
    Hong Thoai Nga Tran
    Zou, Wenxue
    Zhang, Xueying
    Zhi, Degui
    JMIR PUBLIC HEALTH AND SURVEILLANCE, 2021, 7 (04):