Evaluation of algorithms using administrative health and structured electronic medical record data to determine breast and colorectal cancer recurrence in a Canadian province Using algorithms to determine breast and colorectal cancer recurrence

被引:3
作者
Lambert, Pascal [1 ,2 ]
Pitz, Marshall [1 ,3 ,4 ,5 ]
Singh, Harminder [1 ,4 ,5 ]
Decker, Kathleen [1 ,2 ,5 ]
机构
[1] CancerCare Manitoba Res Inst, 675 McDermot Ave, Winnipeg, MB R3E 0V9, Canada
[2] CancerCare Manitoba, Dept Epidemiol & Canc Registry, 675 McDermot Ave, Winnipeg, MB R3E 0V9, Canada
[3] CancerCare Manitoba, Dept Med Oncol, 675 McDermot Ave, Winnipeg, MB R3E 0V9, Canada
[4] Univ Manitoba, Dept Internal Med, 820 Sherbrook St, Winnipeg, MB R3A 1R9, Canada
[5] Univ Manitoba, Dept Community Hlth Sci, 750 Bannatyne Ave, Winnipeg, MB R3E 0W3, Canada
关键词
Breast cancer; Colorectal cancer; Recurrence; Algorithms; Validation studies; Canada; DISEASE; MODELS; CARE;
D O I
10.1186/s12885-021-08526-9
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background Algorithms that use administrative health and electronic medical record (EMR) data to determine cancer recurrence have the potential to replace chart reviews. This study evaluated algorithms to determine breast and colorectal cancer recurrence in a Canadian province with a universal health care system. Methods Individuals diagnosed with stage I-III breast or colorectal cancer diagnosed from 2004 to 2012 in Manitoba, Canada were included. Pre-specified and conditional inference tree algorithms using administrative health and structured EMR data were developed. Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) correct classification, and scaled Brier scores were measured. Results The weighted pre-specified variable algorithm for the breast cancer validation cohort (N = 1181, 167 recurrences) demonstrated 81.1% sensitivity, 93.2% specificity, 61.4% PPV, 97.4% NPV, 91.8% correct classification, and scaled Brier score of 0.21. The weighted conditional inference tree algorithm demonstrated 68.5% sensitivity, 97.0% specificity, 75.4% PPV, 95.8% NPV, 93.6% correct classification, and scaled Brier score of 0.39. The weighted pre-specified variable algorithm for the colorectal validation cohort (N = 693, 136 recurrences) demonstrated 77.7% sensitivity, 92.8% specificity, 70.7% PPV, 94.9% NPV, 90.1% correct classification, and scaled Brier score of 0.33. The conditional inference tree algorithm demonstrated 62.6% sensitivity, 97.8% specificity, 86.4% PPV, 92.2% NPV, 91.4% correct classification, and scaled Brier score of 0.42. Conclusions Algorithms developed in this study using administrative health and structured EMR data to determine breast and colorectal cancer recurrence had moderate sensitivity and PPV, high specificity, NPV, and correct classification, but low accuracy. The accuracy is similar to other algorithms developed to classify recurrence only (i.e., distinguished from second primary) and inferior to algorithms that do not make this distinction. The accuracy of algorithms for determining cancer recurrence only must improve before replacing chart reviews.
引用
收藏
页数:10
相关论文
共 31 条
  • [1] Pregnancy outcome in women before and after cervical conisation: population based cohort study
    Albrechtsen, Susanne
    Rasmussen, Svein
    Thoresen, Steinar
    Irgens, Lorentz M.
    Iversen, Ole Erik
    [J]. BRITISH MEDICAL JOURNAL, 2008, 337 (7673): : 803 - 805
  • [2] Age, Breast Cancer Subtype Approximation, and Local Recurrence After Breast-Conserving Therapy
    Arvold, Nils D.
    Taghian, Alphonse G.
    Niemierko, Andrzej
    Raad, Rita F. Abi
    Sreedhara, Meera
    Nguyen, Paul L.
    Bellon, Jennifer R.
    Wong, Julia S.
    Smith, Barbara L.
    Harris, Jay R.
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2011, 29 (29) : 3885 - 3891
  • [3] Banerjee Imon, 2019, JCO Clin Cancer Inform, V3, P1, DOI 10.1200/CCI.19.00034
  • [4] Validation in Alberta of an administrative data algorithm to identify cancer recurrence
    Cairncross, Z. F.
    Nelson, G.
    Shack, L.
    Metcalfe, A.
    [J]. CURRENT ONCOLOGY, 2020, 27 (03) : E343 - E346
  • [5] Using Natural Language Processing to Improve Efficiency of Manual Chart Abstraction in Research: The Case of Breast Cancer Recurrence
    Carrell, David S.
    Halgrim, Scott
    Diem-Thy Tran
    Buist, Diana S. M.
    Chubak, Jessica
    Chapman, Wendy W.
    Savova, Guergana
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2014, 179 (06) : 749 - 758
  • [6] MODEL UNCERTAINTY, DATA MINING AND STATISTICAL-INFERENCE
    CHATFIELD, C
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1995, 158 : 419 - 466
  • [7] Administrative Data Algorithms to Identify Second Breast Cancer Events Following Early-Stage Invasive Breast Cancer
    Chubak, Jessica
    Yu, Onchee
    Pocobelli, Gaia
    Lamerato, Lois
    Webster, Joe
    Prout, Marianne N.
    Yood, Marianne Ulcickas
    Barlow, William E.
    Buist, Diana S. M.
    [J]. JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2012, 104 (12): : 931 - 940
  • [9] Harrell FE, 1996, STAT MED, V15, P361, DOI 10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO
  • [10] 2-4