Lung Cancer Risk Prediction Model Trained with Multi-source Data

被引:0
作者
Sun, Shijie [1 ]
Liu, Hanyue [1 ]
Wang, Ye [1 ]
Yu, Hong [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China
来源
ROUGH SETS, PT II, IJCRS 2024 | 2024年 / 14840卷
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
lung cancer; risk prediction; multi-source data; cross-modal knowledge distillation;
D O I
10.1007/978-3-031-65668-2_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent research about lung cancer risk prediction model require the data for predicting as same as the data for training whether based on single-source data or multi-source data. Both of them either cannot fully use collected multi-source data to train model or need higher data cost to predict. If the model is trained by gathered multi-source data, but still make prediction by single-source data, the cost of the patients will be avoid increasing. In this work, the cross-modal knowledge distillation technique is introduced to train the lung cancer risk prediction model for the purpose. However, present cross-modal knowledge distillation techniques are incapable of dealing with different biases in data sources. To solve this problem, the model performs features extraction on a sample from multiple perspectives. For validating the efficacy, the proposed model is evaluated with eight baselines on the NLST dataset, which includes CT image data as well as questionnaire data. In terms of AUC, the results demonstrate that the proposed model outperforms the vanilla MLP by 10.88% and the best baseline by 2.71%. The proposed model may effectively exploit history data, ensuring not only the accuracy of prediction but also lowering the user's expenditure for data.
引用
收藏
页码:280 / 294
页数:15
相关论文
共 32 条
[1]   Optuna: A Next-generation Hyperparameter Optimization Framework [J].
Akiba, Takuya ;
Sano, Shotaro ;
Yanase, Toshihiko ;
Ohta, Takeru ;
Koyama, Masanori .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631
[2]   End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography [J].
Ardila, Diego ;
Kiraly, Atilla P. ;
Bharadwaj, Sujeeth ;
Choi, Bokyung ;
Reicher, Joshua J. ;
Peng, Lily ;
Tse, Daniel ;
Etemadi, Mozziyar ;
Ye, Wenxing ;
Corrado, Greg ;
Naidich, David P. ;
Shetty, Shravya .
NATURE MEDICINE, 2019, 25 (06) :954-+
[3]  
Arik SO, 2021, AAAI CONF ARTIF INTE, V35, P6679
[4]   Variations in lung cancer risk among smokers [J].
Bach, PB ;
Kattan, MW ;
Thornquist, MD ;
Kris, MG ;
Tate, RC ;
Barnett, MJ ;
Hsieh, LJ ;
Begg, CB .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2003, 95 (06) :470-478
[5]   EMM-LC Fusion: Enhanced Multimodal Fusion for Lung Cancer Classification [J].
Barrett, James ;
Viana, Thiago .
AI, 2022, 3 (03) :659-682
[6]  
Buckeye A., 2017, Data science bowl 2017
[7]  
Chen T., 2015, R package version 0.4-2, P1, DOI DOI 10.1145/2939672.2939785
[8]   LUCAS: LUng CAncer Screening with Multimodal Biomarkers [J].
Daza, Laura ;
Castillo, Angela ;
Escobar, Maria ;
Valencia, Sergio ;
Pinzon, Bibiana ;
Arbelaez, Pablo .
MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT AND CLINICAL IMAGE-BASED PROCEDURES, ML-CDS 2020, CLIP 2020, 2020, 12445 :115-124
[9]   The National Lung Screening Trial: Overview and Study Design [J].
Gatsonis, Constantine A. .
RADIOLOGY, 2011, 258 (01) :243-253
[10]   Knowledge Distillation: A Survey [J].
Gou, Jianping ;
Yu, Baosheng ;
Maybank, Stephen J. ;
Tao, Dacheng .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (06) :1789-1819