Next-Generation Sequencing-Based Testing Among Patients With Advanced or Metastatic Nonsquamous Non-Small Cell Lung Cancer in the United States: Predictive Modeling Using Machine Learning Methods

被引:0
作者
Brnabic, Alan James Michael [1 ]
Lipkovich, Ilya [2 ]
Kadziola, Zbigniew [2 ]
He, Dan [3 ]
Krein, Peter M. [2 ]
Hess, Lisa M. [2 ]
机构
[1] Eli Lilly & Co, Sydney, Australia
[2] Eli Lilly & Co, Indianapolis, IN 46285 USA
[3] Syneos Hlth, Morrisville, NC USA
关键词
lung cancer; NGS testing; next-generation sequencing; real-world data; machine learning; biomarkers; predictive modeling; artificial intelligence; treatment guidelines; tumor biomarker; oncology; BIOMARKER; NSCLC;
D O I
10.2196/64399
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: Next-generation sequencing (NGS) has become a cornerstone of treatment for lung cancer and is recommended in current treatment guidelines for patients with advanced or metastatic disease. Objective: This study was designed to use machine learning methods to determine demographic and clinical characteristics of patients with advanced or metastatic non-small cell lung cancer (NSCLC) that may predict likelihood of receiving NGS-based testing (ever vs never NGS-tested) as well as likelihood of timing of testing (early vs late NGS-tested). Methods: Deidentified patient-level data were analyzed in this study from a real-world cohort of patients with advanced or metastatic NSCLC in the United States. Patients with nonsquamous disease, who received systemic therapy for NSCLC, and had at least 3 months of follow-up data for analysis were included in this study. Three strategies, logistic regression models, penalized logistic regression using least absolute shrinkage and selection operator penalty, and extreme gradient boosting with classification trees as base learners, were used to identify predictors of ever versus never and early versus late NGS testing. Data were split into D1(training+validation; 80%) and D2 (testing; 20%) sets; the 3 strategies were evaluated by comparing their performance on multiple m=1000 splits in the training (70%) and validation data (30%) within the D1 set. The final model was selected by evaluating performance using the area under the receiver operating curve while taking into account considerations of simplicity and clinical interpretability. Performance was re-estimated using the test data D2. Results: A total of 13,425 met the criteria for the ever NGS-tested, and 17,982 were included in the never NGS-tested group. Performance metrics showed the area under the receiver operating curve evaluated from validation data was similar across all models (77%-84%). Among those in the ever NGS-tested group, 84.08% (n=11,289) were early NGS-tested, and 15.91% (n=2136) late NGS-tested. Factors associated with both ever having NGS testing as well as early NGS testing included later year of NSCLC diagnosis, no smoking history, and evidence of programmed death ligand 1 testing (all P<.05). Factors associated with a greater chance of never receiving NGS testing included older age, lower performance status, Black race, higher number of single-gene tests, public insurance, and treatment in a geography with Molecular Diagnostics Services Program adoption (all P<.05). Conclusions:Predictors of ever versus never as well as early versus late NGS testing in the setting of advanced or metastatic NSCLC were consistent across machine learning methods in this study, demonstrating the ability of these models to identify factors that may predict NGS-based testing. There is a need to ensure that patients regardless of age, race, insurance status, and geography (factors associated with lower odds of receiving NGS testing in this study) are provided with equitable access to NGS-based testing.
引用
收藏
页数:31
相关论文
共 28 条
[1]  
[Anonymous], 2024, Definition of human subjects research
[2]  
[Anonymous], 2023, NCCN clinical practice guidelines in oncology: breast cancer. Version 5.2023
[3]  
[Anonymous], 2022, Oncology (cancer)/hematologic malignancies approval notification
[4]  
[Anonymous], 2022, NCCN Clinical Practice Guidelines in Oncology (NCCN Guidelines), B-cell Lymphomas, Version 5
[5]  
[Anonymous], Cancer Stat Facts: Lung and Bronchus Cancer
[6]  
[Anonymous], 2022, MolDX Program (Administered by Palmetto GBA)
[7]   Identifying barriers to equitable biomarker testing in underserved patients with NSCLC: A mixed-methods study to inform quality improvement opportunities [J].
Boehmer, Leigh ;
Roy, Upal Kunal Basu ;
Schrag, Janelle ;
Martin, Nikki A. ;
Salinas, Gregory D. ;
Coleman, Brandon ;
Howson, Alexandra ;
Shivakumar, Latha .
JOURNAL OF CLINICAL ONCOLOGY, 2021, 39 (28)
[8]   Biomarker Testing, Targeted Therapy and Clinical Trial Participation by Race Among Patients With Lung Cancer: A Real-World Medicaid Database Study [J].
Bruno, Debora S. ;
Li, Xiaohong ;
Hess, Lisa M. .
JTO CLINICAL AND RESEARCH REPORTS, 2024, 5 (03)
[9]   Disparities in Biomarker Testing and Clinical Trial Enrollment Among Patients With Lung, Breast, or Colorectal Cancers in the United States [J].
Bruno, Debora S. ;
Hess, Lisa M. ;
Li, Xiaohong ;
Su, Eric Wen ;
Patel, Monaliben .
JCO PRECISION ONCOLOGY, 2022, 6
[10]   Implementation Challenges and Disparities in Molecular Testing for Patients With Stage IV NSCLC: Perspectives from an Urban Safety-Net Hospital [J].
Burns, Laura ;
Jani, Chinmay ;
Radwan, Amr ;
Al Omari, Omar ;
Patel, Mohini ;
Oxnard, Geoffrey R. ;
Tapan, Umit .
CLINICAL LUNG CANCER, 2023, 24 (02) :e69-e77