Extracting cancer concepts from clinical notes using natural language processing: a systematic review

被引:13
|
作者
Gholipour, Maryam [1 ]
Khajouei, Reza [2 ]
Amiri, Parastoo [1 ]
Gohari, Sadrieh Hajesmaeel [3 ]
Ahmadian, Leila [2 ]
机构
[1] Kerman Univ Med Sci, Student Res Comm, Kerman, Iran
[2] Kerman Univ Med Sci, Fac Management & Med Informat Sci, Dept Hlth Informat Sci, Kerman, Iran
[3] Kerman Univ Med Sci, Inst Futures Studies Hlth, Med Informat Res Ctr, Kerman, Iran
关键词
Neoplasms; Natural language processing; NLP; Machine learning; Terminology; Information system; Systematic review; RADIOLOGY REPORTS; CLASSIFICATION; RETRIEVAL; RECORDS;
D O I
10.1186/s12859-023-05480-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundExtracting information from free texts using natural language processing (NLP) can save time and reduce the hassle of manually extracting large quantities of data from incredibly complex clinical notes of cancer patients. This study aimed to systematically review studies that used NLP methods to identify cancer concepts from clinical notes automatically.MethodsPubMed, Scopus, Web of Science, and Embase were searched for English language papers using a combination of the terms concerning "Cancer", "NLP", "Coding", and "Registries" until June 29, 2021. Two reviewers independently assessed the eligibility of papers for inclusion in the review.ResultsMost of the software programs used for concept extraction reported were developed by the researchers (n = 7). Rule-based algorithms were the most frequently used algorithms for developing these programs. In most articles, the criteria of accuracy (n = 14) and sensitivity (n = 12) were used to evaluate the algorithms. In addition, Systematized Nomenclature of Medicine-Clinical Terms (SNOMED-CT) and Unified Medical Language System (UMLS) were the most commonly used terminologies to identify concepts. Most studies focused on breast cancer (n = 4, 19%) and lung cancer (n = 4, 19%).ConclusionThe use of NLP for extracting the concepts and symptoms of cancer has increased in recent years. The rule-based algorithms are well-liked algorithms by developers. Due to these algorithms' high accuracy and sensitivity in identifying and extracting cancer concepts, we suggested that future studies use these algorithms to extract the concepts of other diseases as well.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Screening for Depression Using Natural Language Processing:Literature Review
    Teferra, Bazen Gashaw
    Rueda, Alice
    Pang, Hilary
    Valenzano, Richard
    Samavi, Reza
    Krishnan, Sridhar
    Bhat, Venkat
    INTERACTIVE JOURNAL OF MEDICAL RESEARCH, 2024, 13
  • [32] Development of a predictive model for retention in HIV care using natural language processing of clinical notes
    Oliwa, Tomasz
    Furner, Brian
    Schmitt, Jessica
    Schneider, John
    Ridgway, Jessica P.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (01) : 104 - 112
  • [33] Identifying Type II workplace violence from clinical notes using natural language processing
    Byon, Ha Do
    Harris, Catherine
    Crandall, Mary
    Song, Jiyoun
    Topaz, Maxim
    WORKPLACE HEALTH & SAFETY, 2023, 71 (10) : 484 - 490
  • [34] Natural Language Processing of Nursing Notes An Integrative Review
    Mitha, Shazia
    Schwartz, Jessica
    Hobensack, Mollie
    Cato, Kenrick
    Woo, Kyungmi
    Smaldone, Arlene
    Topaz, Maxim
    CIN-COMPUTERS INFORMATICS NURSING, 2023, 41 (06) : 377 - 384
  • [35] Natural language processing in clinical neuroscience and psychiatry: A review
    Crema, Claudio
    Attardi, Giuseppe
    Sartiano, Daniele
    Redolfi, Alberto
    FRONTIERS IN PSYCHIATRY, 2022, 13
  • [36] Using Clinical Notes and Natural Language Processing for Automated HIV Risk Assessment
    Feller, Daniel J.
    Zucker, Jason
    Yin, Michael T.
    Gordon, Peter
    Elhadad, Noemie
    JAIDS-JOURNAL OF ACQUIRED IMMUNE DEFICIENCY SYNDROMES, 2018, 77 (02) : 160 - 166
  • [37] Extraction of clinical phenotypes for Alzheimer's disease dementia from clinical notes using natural language processing
    Oh, Inez Y.
    Schindler, Suzanne E.
    Ghoshal, Nupur
    Lai, Albert M.
    Payne, Philip R. O.
    Gupta, Aditi
    JAMIA OPEN, 2023, 6 (01)
  • [38] Applying Natural Language Processing to Textual Data From Clinical Data Warehouses: Systematic Review
    Bazoge, Adrien
    Morin, Emmanuel
    Daille, Beatrice
    Gourraud, Pierre -Antoine
    JMIR MEDICAL INFORMATICS, 2023, 11
  • [39] Natural language processing pipeline to extract prostate cancer-related information from clinical notes
    Nakai, Hirotsugu
    Suman, Garima
    Adamo, Daniel A.
    Navin, Patrick J.
    Bookwalter, Candice A.
    LeGout, Jordan D.
    Chen, Frank K.
    Wellnitz, Clinton V.
    Silva, Alvin C.
    Thomas, John V.
    Kawashima, Akira
    Fan, Jungwei W.
    Froemming, Adam T.
    Lomas, Derek J.
    Humphreys, Mitchell R.
    Dora, Chandler
    Korfiatis, Panagiotis
    Takahashi, Naoki
    EUROPEAN RADIOLOGY, 2024, 34 (12) : 7878 - 7891
  • [40] Identifying stigmatizing and positive/preferred language in obstetric clinical notes using natural language processing
    Scroggins, Jihye Kim
    Hulchafo, Ismael I.
    Harkins, Sarah
    Scharp, Danielle
    Moen, Hans
    Davoudi, Anahita
    Cato, Kenrick
    Tadiello, Michele
    Topaz, Maxim
    Barcelona, Veronica
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, : 308 - 317