Natural Language Processing in Dutch Free Text Radiology Reports: Challenges in a Small Language Area Staging Pulmonary Oncology

被引:17
|
作者
Nobel, J. Martijn [1 ,2 ]
Puts, Sander [3 ]
Bakers, Frans C. H. [1 ]
Robben, Simon G. F. [1 ,2 ]
Dekker, Andre L. A. J. [3 ]
机构
[1] Maastricht Univ, Med Ctr, Dept Radiol & Nucl Med, Postbox 5800, NL-6202 AZ Maastricht, Netherlands
[2] Maastricht Univ, Sch Hlth Profess Educ, Maastricht, Netherlands
[3] Maastricht Univ, Med Ctr, GROW Sch Oncol & Dev Biol, Dept Radiat Oncol MAASTRO, Maastricht, Netherlands
关键词
Radiology; Reporting; Natural language processing; Free text; Classification system; Machine learning; CLASSIFICATION;
D O I
10.1007/s10278-020-00327-z
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Reports are the standard way of communication between the radiologist and the referring clinician. Efforts are made to improve this communication by, for instance, introducing standardization and structured reporting. Natural Language Processing (NLP) is another promising tool which can improve and enhance the radiological report by processing free text. NLP as such adds structure to the report and exposes the information, which in turn can be used for further analysis. This paper describes pre-processing and processing steps and highlights important challenges to overcome in order to successfully implement a free text mining algorithm using NLP tools and machine learning in a small language area, like Dutch. A rule-based algorithm was constructed to classify T-stage of pulmonary oncology from the original free text radiological report, based on the items tumor size, presence and involvement according to the 8th TNM classification system. PyContextNLP, spaCy and regular expressions were used as tools to extract the correct information and process the free text. Overall accuracy of the algorithm for evaluating T-stage was 0,83 in the training set and 0,87 in the validation set, which shows that the approach in this pilot study is promising. Future research with larger datasets and external validation is needed to be able to introduce more machine learning approaches and perhaps to reduce required input efforts of domain-specific knowledge. However, a hybrid NLP approach will probably achieve the best results.
引用
收藏
页码:1002 / 1008
页数:7
相关论文
共 50 条
  • [1] Natural Language Processing in Dutch Free Text Radiology Reports: Challenges in a Small Language Area Staging Pulmonary Oncology
    J. Martijn Nobel
    Sander Puts
    Frans C. H. Bakers
    Simon G. F. Robben
    André L. A. J. Dekker
    Journal of Digital Imaging, 2020, 33 : 1002 - 1008
  • [2] How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation
    Puts, Sander
    Nobel, Martijn
    Zegers, Catharina
    Bermejo, Inigo
    Robben, Simon
    Dekker, Andre
    JMIR FORMATIVE RESEARCH, 2023, 7
  • [3] Natural Language Processing Algorithm Used for Staging Pulmonary Oncology from Free-Text Radiological Reports: "Including PET-CT and Validation Towards Clinical Use"
    Nobel, J. Martijn
    Puts, Sander
    Krdzalic, Jasenko
    Zegers, Karen M. L.
    Lobbes, Marc B. I.
    Robben, Simon G. F.
    Dekker, Andre L. A. J.
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (01): : 3 - 12
  • [4] T-staging pulmonary oncology from radiological reports using natural language processing: translating into a multi-language setting
    J. Martijn Nobel
    Sander Puts
    Jakob Weiss
    Hugo J. W. L. Aerts
    Raymond H. Mak
    Simon G. F. Robben
    André L. A. J. Dekker
    Insights into Imaging, 12
  • [5] T-staging pulmonary oncology from radiological reports using natural language processing: translating into a multi-language setting
    Nobel, J. Martijn
    Puts, Sander
    Weiss, Jakob
    Aerts, Hugo J. W. L.
    Mak, Raymond H.
    Robben, Simon G. F.
    Dekker, Andre L. A. J.
    INSIGHTS INTO IMAGING, 2021, 12 (01)
  • [6] A Natural Language Processing Pipeline of Chinese Free-Text Radiology Reports for Liver Cancer Diagnosis
    Liu, Honglei
    Xu, Yan
    Zhang, Zhiqiang
    Wang, Ni
    Huang, Yanqun
    Hu, Yanjun
    Yang, Zhenghan
    Jiang, Rui
    Chen, Hui
    IEEE ACCESS, 2020, 8 : 159110 - 159119
  • [7] A systematic review of natural language processing applied to radiology reports
    Casey, Arlene
    Davidson, Emma
    Poon, Michael
    Dong, Hang
    Duma, Daniel
    Grivas, Andreas
    Grover, Claire
    Suarez-Paniagua, Victor
    Tobin, Richard
    Whiteley, William
    Wu, Honghan
    Alex, Beatrice
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [8] Application of Natural Language Processing and Machine Learning to Radiology Reports
    Jeon, Seoungdeok
    Colburn, Zachary
    Sakai, Joshua
    Hung, Ling-Hong
    Yeung, Ka Yee
    12TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS (ACM-BCB 2021), 2021,
  • [9] Incidental pulmonary nodules: Natural language processing analysis of radiology reports
    Grolleau, Emmanuel
    Couraud, Sebastien
    Delevaux, Emilien Jupin
    Piegay, Celine
    Mansuy, Adeline
    de Bermont, Julie
    Cotton, Francois
    Pialat, Jean-Baptiste
    Talbot, Francois
    Boussel, Loic
    RESPIRATORY MEDICINE AND RESEARCH, 2024, 86
  • [10] Natural Language Processing for Identification of Incidental Pulmonary Nodules in Radiology Reports
    Kang, Stella K.
    Garry, Kira
    Chung, Ryan
    Moore, William H.
    Iturrate, Eduardo
    Swartz, Jordan L.
    Kim, Danny C.
    Horwitz, Leora, I
    Blecker, Saul
    JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2019, 16 (11) : 1587 - 1594