Benign-malignant classification of pulmonary nodules by low-dose spiral computerized tomography and clinical data with machine learning in opportunistic screening

被引:6
|
作者
Zheng, Yansong [1 ,2 ]
Dong, Jing [3 ]
Yang, Xue [1 ,2 ]
Shuai, Ping [4 ]
Li, Yongli [5 ]
Li, Hailin [6 ,7 ]
Dong, Shengyong [1 ,2 ]
Gong, Yan [1 ,2 ]
Liu, Miao [8 ,11 ]
Zeng, Qiang [1 ,2 ,9 ,10 ]
机构
[1] Chinese Peoples Liberat Army Gen Hosp, Med Ctr 2, Dept Hlth Med, Beijing, Peoples R China
[2] Chinese Peoples Liberat Army Gen Hosp, Natl Clin Res Ctr Geriatr Dis, Beijing, Peoples R China
[3] Chinese Peoples Liberat Army Gen Hosp, Res Med Big Data Ctr & Natl Engn Lab Med Big Data, Beijing, Peoples R China
[4] Univ Elect Sci & Technol China, Sichuan Prov Peoples Hosp, Hlth Management Ctr, Chengdu, Peoples R China
[5] Zhengzhou Univ, Dept Hlth Management, Henan Key Lab Chron Dis Management, Henan Prov Peoples Hosp, Zhengzhou, Peoples R China
[6] Beihang Univ, Beijing Adv Innovat Ctr Big Data Based Precis Med, Sch Med & Engn, Beijing, Peoples R China
[7] CAS Key Lab Mol Imaging, Inst Automat, Beijing, Peoples R China
[8] Chinese Peoples Liberat Army Gen Hosp, Grad Sch, Beijing, Peoples R China
[9] Chinese Peoples Liberat Army Gen Hosp, Med Ctr 2, Dept Hlth Med, Beijing 100853, Peoples R China
[10] Chinese Peoples Liberat Army Gen Hosp, Natl Clin Res Ctr Geriatr Dis, Beijing 100853, Peoples R China
[11] Chinese Peoples Liberat Army Gen Hosp, Grad Sch, Beijing 100853, Peoples R China
来源
CANCER MEDICINE | 2023年 / 12卷 / 11期
基金
国家重点研发计划;
关键词
cancer screening; health examination; low-dose computed tomography; lung cancer; opportunistic screening; pulmonary nodules; LUNG-CANCER; CT; RISK; ADENOCARCINOMA; POPULATION; PREDICTION; MORTALITY; HARMS;
D O I
10.1002/cam4.5886
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: Many people were found with pulmonary nodules during physical examinations. It is of great practical significance to discriminate benign and malignant nodules by using data mining technology.Methods: The subjects' demographic data, baseline examination results, and annual follow-up low-dose spiral computerized tomography (LDCT) results were recorded. The findings from annual physical examinations of positive nodules, including highly suspicious nodules and clinically tentative benign nodules, was analyzed. The extreme gradient boosting (XGBoost) model was constructed and the Grid Search CV method was used to select the super parameters. External unit data were used as an external validation set to evaluate the generalization performance of the model.Results: A total of 135,503 physical examinees were enrolled. Baseline testing found that 27,636 (20.40%) participants had clinically tentative benign nodules and 611 (0.45%) participants had highly suspicious nodules. The proportion of highly suspicious nodules in participants with negative baseline was about 0.12%-0.46%, which was lower than the baseline level except the follow-up of >5 years. In the 27,636 participants with clinically tentative benign nodules, only in the first year of LDCT re-examination was the proportion of highly suspicious nodules (1.40%) significantly greater than that of baseline screening (0.45%) (p < 0.001), and the proportion of highly suspicious nodules was not different between the baseline screening and other follow-up years (p > 0.05). Furthermore, 322 cases with benign nodules and 196 patients with malignant nodules confirmed by surgery and pathology were compared. A model and the top 15 most important clinical variables were determined by XGBoost algorithm. The area under the curve (AUC) of the model was 0.76 [95% CI: 0.67-0.84], and the accuracy was 0.75. The sensitivity and specificity of the model under this threshold were 0.78 and 0.73, respectively. In the validation of model using external data, the AUC was 0.87 and the accuracy was 0.80. The sensitivity and specificity were 0.83 and 0.77, respectively.Conclusions: It is important that pulmonary nodules could be more accurately identified at the first LDCT examination. A model with 15 variables which are routinely measured in the clinic could be helpful to distinguish benign and malignant nodules. It could help the radiological team issue a more accurate report; and it may guide the clinical team regarding LDCT follow-up.
引用
收藏
页码:12050 / 12064
页数:15
相关论文
共 35 条
  • [31] Impact of Voxel Normalization on a Machine Learning-Based Method: A Study on Pulmonary Nodule Malignancy Diagnosis Using Low-Dose Computed Tomography (LDCT)
    Hsiao, Chia-Chi
    Peng, Chen-Hao
    Wu, Fu-Zong
    Cheng, Da-Chuan
    DIAGNOSTICS, 2023, 13 (24)
  • [32] Development of machine learning model to predict pulmonary function with low-dose CT-derived parameter response mapping in a community-based chest screening cohort
    Zhou, Xiuxiu
    Pu, Yu
    Zhang, Di
    Guan, Yu
    Lu, Yang
    Zhang, Weidong
    Fu, Chi-Cheng
    Fang, Qu
    Zhang, Hanxiao
    Liu, Shiyuan
    Fan, Li
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2023, 24 (11):
  • [33] Value of circulating tumor cell assisting low-dose computed tomography in screening pulmonary nodules based on existing liquid biopsy techniques: a systematic review with meta-analysis and trial sequential analysis
    Wang, Yixian
    Duan, Yuqing
    Guo, Dingjie
    Lv, Hongbo
    Li, Qiong
    Liu, Xuan
    Qiao, Na
    Meng, Hengyu
    Zhang, Xin
    Lan, Linwei
    Liu, Xiumin
    Liu, Xin
    CLINICAL & TRANSLATIONAL ONCOLOGY, 2024, 26 (12): : 3252 - 3263
  • [34] The solid component within part-solid nodules: 3-dimensional quantification, correlation with the malignant grade of nonmucinous pulmonary adenocarcinomas, and comparisons with 2-dimentional measures and semantic features in low-dose computed tomography
    Liu, Jieke
    Xie, Chaolian
    Li, Yong
    Xu, Hao
    He, Changjiu
    Qing, Haomiao
    Zhou, Peng
    CANCER IMAGING, 2023, 23 (01)
  • [35] The solid component within part-solid nodules: 3-dimensional quantification, correlation with the malignant grade of nonmucinous pulmonary adenocarcinomas, and comparisons with 2-dimentional measures and semantic features in low-dose computed tomography
    Jieke Liu
    Chaolian Xie
    Yong Li
    Hao Xu
    Changjiu He
    Haomiao Qing
    Peng Zhou
    Cancer Imaging, 23