Multi-task learning to incorporate clinical knowledge into deep learning for breast cancer diagnosis

被引:0
作者
Nebbia, Giacomo [1 ]
Arefan, Dooman [2 ]
Zuley, Margarita [2 ]
Sumkin, Jules [2 ]
Wu, Shandong [1 ,2 ,3 ,4 ]
机构
[1] Univ Pittsburgh, Intelligent Syst Program, 3240 Craft Pl, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, Dept Radiol, 3240 Craft Pl, Pittsburgh, PA 15213 USA
[3] Univ Pittsburgh, Dept Biomed Informat, 3240 Craft Pl, Pittsburgh, PA 15213 USA
[4] Univ Pittsburgh, Dept Bioengn, 3240 Craft Pl, Pittsburgh, PA 15213 USA
来源
MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS | 2021年 / 11597卷
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Breast cancer; mammography; deep learning; clinical knowledge; multi-task learning;
D O I
10.1117/12.2582285
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Deep learning models are traditionally trained purely in a data-driven approach; the information for the model training usually only comes from a single source of the training data. In this work, we investigate how to supply additional clinical knowledge that is associated with the training data. Our goal is to train deep learning models for breast cancer diagnosis using mammogram images. Along with the main classification task between clinically proven cancer vs negative/benign cases, we design two auxiliary tasks each capturing a form of additional knowledge to facilitate the main task. Specifically, one auxiliary task is to classify images according to the radiologist-made BI-RADS diagnosis scores and the other auxiliary task is to classify images in terms of the BI-RADS breast density categories. We customize a Multi-Task Learning model to jointly perform the three tasks (main task and two auxiliary tasks). We test four deep learning architectures: CBR-Tiny, ResNetl8, GoogleNet, and DenseNet and we investigate the benefit of incorporating such knowledge over ImageNet pre-trained models and in the case of randomly initialized models. We run experiments on an internal dataset consisting of screening full field digital mammography images for a total of 1,380 images (341 cancer and 1,039 negative or benign). Our results show that, by adding clinical knowledge conveyed through the two auxiliary tasks to the training process, we can improve the performance of the target task of breast cancer diagnosis, thus highlighting the benefit of incorporating clinical knowledge into data-driven learning to enhance deep learning model training.
引用
收藏
页数:6
相关论文
共 19 条
[11]  
Jan T., 2007, J MACH LEARN, P1
[12]  
Jimenez-sanchez A, 2019, INT C MED IM COMP CO INT C MED IM COMP CO
[13]   Preliminary evaluation of the publicly available Laboratory for Breast Radiodensity Assessment (LIBRA) software tool: comparison of fully automated area and volumetric density measures in a case-control study with digital mammography [J].
Keller, Brad M. ;
Chen, Jinbo ;
Daye, Dania ;
Conant, Emily F. ;
Kontos, Despina .
BREAST CANCER RESEARCH, 2015, 17
[14]   Estimation of breast percent density in raw and processed full field digital mammography images via adaptive fuzzy c-means clustering and support vector machine segmentation [J].
Keller, Brad M. ;
Nathan, Diane L. ;
Wang, Yan ;
Zheng, Yuanjie ;
Gee, James C. ;
Conant, Emily F. ;
Kontos, Despina .
MEDICAL PHYSICS, 2012, 39 (08) :4903-4917
[15]  
Kingma DP, 2015, C TRACK P
[16]   Do Better ImageNet Models Transfer Better? [J].
Kornblith, Simon ;
Shlens, Jonathon ;
Le, Quoc V. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2656-2666
[17]  
Raghu M, 2019, ADV NEURAL INFORM PR, P3347
[18]  
Szegedy C, 2015, PROC CVPR IEEE, P1, DOI 10.1109/CVPR.2015.7298594
[19]   Attention-Guided Curriculum Learning for Weakly Supervised Classification and Localization of Thoracic Diseases on Chest Radiographs [J].
Tang, Yuxing ;
Wang, Xiaosong ;
Harrison, Adam P. ;
Lu, Le ;
Xiao, Jing ;
Summers, Ronald M. .
MACHINE LEARNING IN MEDICAL IMAGING: 9TH INTERNATIONAL WORKSHOP, MLMI 2018, 2018, 11046 :249-258