Incorporating Breast Cancer Recurrence Events Into Population-Based Cancer Registries Using Medical Claims: Cohort Study

被引:10
作者
A'mar, Teresa [1 ]
Beatty, J. David [2 ]
Fedorenko, Catherine [1 ]
Markowitz, Daniel [2 ]
Corey, Thomas [1 ]
Lange, Jane [1 ]
Schwartz, Stephen M. [1 ]
Huang, Bin [3 ]
Chubak, Jessica [4 ]
Etzioni, Ruth [1 ]
机构
[1] Fred Hutchinson Canc Res Ctr, Publ Hlth Sci, 1100 Fairview Ave North, Seattle, WA 98104 USA
[2] Swedish Canc Inst, Seattle, WA USA
[3] Univ Kentucky, Coll Med, Lexington, KY USA
[4] Kaiser Permanente, Washington Hlth Res Inst, Seattle, WA USA
基金
美国国家卫生研究院;
关键词
cancer registries; medical claims; cancer recurrence event; statistical learning; breast cancer; medical informatics; data mining;
D O I
10.2196/18143
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: There is a need for automated approaches to incorporate information on cancer recurrence events into population-based cancer registries. Objective: The aim of this study is to determine the accuracy of a novel data mining algorithm to extract information from linked registry and medical claims data on the occurrence and timing of second breast cancer events (SBCE). Methods: We used supervised data from 3092 stage I and II breast cancer cases (with 394 recurrences), diagnosed between 1993 and 2006 inclusive, of patients at Kaiser Permanente Washington and cases in the Puget Sound Cancer Surveillance System. Our goal was to classify each month after primary treatment as pre- versus post-SBCE. The prediction feature set for a given month consisted of registry variables on disease and patient characteristics related to the primary breast cancer event, as well as features based on monthly counts of diagnosis and procedure codes for the current, prior, and future months. A month was classified as post-SBCE if the predicted probability exceeded a probability threshold (PT); the predicted time of the SBCE was taken to be the month of maximum increase in the predicted probability between adjacent months. Results: The Kaplan-Meier net probability of SBCE was 0.25 at 14 years. The month-level receiver operating characteristic curve on test data (20% of the data set) had an area under the curve of 0.986. The person-level predictions (at a monthly PT of 0.5) had a sensitivity of 0.89, a specificity of 0.98, a positive predictive value of 0.85, and a negative predictive value of 0.98. The corresponding median difference between the observed and predicted months of recurrence was 0 and the mean difference was 0.04 months. Conclusions: Data mining of medical claims holds promise for the streamlining of cancer registry operations to feasibly collect information about second breast cancer events.
引用
收藏
页数:10
相关论文
共 9 条
[1]   Performance of Cancer Recurrence Algorithms After Coding Scheme Switch From International Classification of Diseases 9th Revision to International Classification of Diseases 10th Revision [J].
Carroll, Nikki M. ;
Ritzwoller, Debra P. ;
Banegas, Matthew P. ;
O'Keeffe-Rosetti, Maureen ;
Cronin, Angel M. ;
Uno, Hajime ;
Hornbrook, Mark C. ;
Hassett, Michael J. .
JCO CLINICAL CANCER INFORMATICS, 2019, 3 :1-9
[2]  
Chen T, 2019, XGBOOSTXTREME GRADIE
[3]   Administrative Data Algorithms to Identify Second Breast Cancer Events Following Early-Stage Invasive Breast Cancer [J].
Chubak, Jessica ;
Yu, Onchee ;
Pocobelli, Gaia ;
Lamerato, Lois ;
Webster, Joe ;
Prout, Marianne N. ;
Yood, Marianne Ulcickas ;
Barlow, William E. ;
Buist, Diana S. M. .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2012, 104 (12) :931-940
[4]   Tradeoffs between accuracy measures for electronic health care data algorithms [J].
Chubak, Jessica ;
Pocobelli, Gaia ;
Weiss, Noel S. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2012, 65 (03) :343-349
[5]  
Earle CC, 2002, MED CARE, V40, P75
[6]   Validating Billing/Encounter Codes as Indicators of Lung, Colorectal, Breast, and Prostate Cancer Recurrence Using 2 Large Contemporary Cohorts [J].
Hassett, Michael J. ;
Ritzwoller, Debra P. ;
Taback, Nathan ;
Carroll, Nikki ;
Cronin, Angel M. ;
Ting, Gladys V. ;
Schrag, Deb ;
Warren, Joan L. ;
Hornbrook, Mark C. ;
Weeks, Jane C. .
MEDICAL CARE, 2014, 52 (10) :E65-E73
[7]   Measuring disease-free survival and cancer relapse using medicare claims from CALGB breast cancer trial participants (companion to 9344) [J].
Lamont, Elizabeth B. ;
Herndon, James E., II ;
Weeks, Jane C. ;
Henderson, I. Craig ;
Earle, Craig C. ;
Schilsky, Richard L. ;
Christakis, Nicholas A. .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2006, 98 (18) :1335-1338
[8]   Development, Validation, and Dissemination of a Breast Cancer Recurrence Detection and Timing Informatics Algorithm [J].
Ritzwoller, Debra P. ;
Hassett, Michael J. ;
Uno, Hajime ;
Cronin, Angel M. ;
Carroll, Nikki M. ;
Hornbrook, Mark C. ;
Kushi, Lawrence C. .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2018, 110 (03) :273-281
[9]   Challenges and Opportunities in Measuring Cancer Recurrence in the United States [J].
Warren, Joan L. ;
Yabroff, K. Robin .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2015, 107 (08)