Semi-supervised training using cooperative labeling of weakly annotated data for nodule detection in chest CT

被引:4
作者
Maynord, Michael [1 ,2 ]
Farhangi, M. Mehdi [2 ]
Fermuller, Cornelia [3 ]
Aloimonos, Yiannis [1 ]
Levine, Gary [4 ]
Petrick, Nicholas [2 ]
Sahiner, Berkman [2 ]
Pezeshk, Aria [2 ,5 ]
机构
[1] Univ Maryland, Dept Comp Sci, Iribe Ctr Comp Sci & Engn, College Pk, MD 20742 USA
[2] FDA, Div Imaging Diagnost & Software Reliabil DIDSR, OSEL, CDRH, Silver Spring, MD 20993 USA
[3] Univ Maryland, Inst Adv Comp Studies, Iribe Ctr Comp Sci & Engn, College Pk, MD 20742 USA
[4] FDA, Div Radiol Imaging Devices & Elect Prod, CDRH, Silver Spring, MD USA
[5] Plato Syst, San Mateo, CA USA
关键词
computer aided detection; pulmonary nodules; semi-supervised learning; FALSE-POSITIVE REDUCTION; LUNG NODULES; AUTOMATIC DETECTION; IMAGES;
D O I
10.1002/mp.16219
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
PurposeMachine learning algorithms are best trained with large quantities of accurately annotated samples. While natural scene images can often be labeled relatively cheaply and at large scale, obtaining accurate annotations for medical images is both time consuming and expensive. In this study, we propose a cooperative labeling method that allows us to make use of weakly annotated medical imaging data for the training of a machine learning algorithm. As most clinically produced data are weakly-annotated - produced for use by humans rather than machines and lacking information machine learning depends upon - this approach allows us to incorporate a wider range of clinical data and thereby increase the training set size. MethodsOur pseudo-labeling method consists of multiple stages. In the first stage, a previously established network is trained using a limited number of samples with high-quality expert-produced annotations. This network is used to generate annotations for a separate larger dataset that contains only weakly annotated scans. In the second stage, by cross-checking the two types of annotations against each other, we obtain higher-fidelity annotations. In the third stage, we extract training data from the weakly annotated scans, and combine it with the fully annotated data, producing a larger training dataset. We use this larger dataset to develop a computer-aided detection (CADe) system for nodule detection in chest CT. ResultsWe evaluated the proposed approach by presenting the network with different numbers of expert-annotated scans in training and then testing the CADe using an independent expert-annotated dataset. We demonstrate that when availability of expert annotations is severely limited, the inclusion of weakly-labeled data leads to a 5% improvement in the competitive performance metric (CPM), defined as the average of sensitivities at different false-positive rates. ConclusionsOur proposed approach can effectively merge a weakly-annotated dataset with a small, well-annotated dataset for algorithm training. This approach can help enlarge limited training data by leveraging the large amount of weakly labeled data typically generated in clinical image interpretation.
引用
收藏
页码:4255 / 4268
页数:14
相关论文
共 26 条
  • [11] Computer-aided, detection of lung nodules: False positive reduction using a 3D gradient field method and 3D ellipsoid fitting
    Ge, ZY
    Sahiner, B
    Chan, HP
    Hadjiiski, LM
    Cascade, PN
    Bogot, N
    Kazerooni, EA
    Wei, J
    Zhou, CA
    [J]. MEDICAL PHYSICS, 2005, 32 (08) : 2443 - 2454
  • [12] High performance lung nodule detection schemes in CT using local and global information
    Guo, Wei
    Li, Qiang
    [J]. MEDICAL PHYSICS, 2012, 39 (08) : 5157 - 5168
  • [13] Automatic lung segmentation in routine imaging is primarily a data diversity problem, not a methodology problem
    Hofmanninger, Johannes
    Prayer, Forian
    Pan, Jeanny
    Roehrich, Sebastian
    Prosch, Helmut
    Langs, Georg
    [J]. EUROPEAN RADIOLOGY EXPERIMENTAL, 2020, 4 (01)
  • [14] A deep 3D residual CNN for false-positive reduction in pulmonary nodule detection
    Jin, Hongsheng
    Li, Zongyao
    Tong, Ruofeng
    Lin, Lanfen
    [J]. MEDICAL PHYSICS, 2018, 45 (05) : 2097 - 2107
  • [15] Integrating Lung Parenchyma Segmentation and Nodule Detection With Deep Multi-Task Learning
    Liu, Weihua
    Liu, Xiabi
    Li, Huiyu
    Li, Mincan
    Zhao, Xinming
    Zhu, Zheng
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (08) : 3073 - 3081
  • [16] Survey on deep learning for pulmonary medical imaging
    Ma, Jiechao
    Song, Yang
    Tian, Xi
    Hua, Yiting
    Zhang, Rongguo
    Wu, Jianlin
    [J]. FRONTIERS OF MEDICINE, 2020, 14 (04) : 450 - 469
  • [17] 3-D Convolutional Neural Networks for Automatic Detection of Pulmonary Nodules in Chest CT
    Pezeshk, Aria
    Hamidian, Sardar
    Petrick, Nicholas
    Sahiner, Berkman
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (05) : 2080 - 2090
  • [18] U-Net: Convolutional Networks for Biomedical Image Segmentation
    Ronneberger, Olaf
    Fischer, Philipp
    Brox, Thomas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
  • [19] Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: The LUNA16 challenge
    Setio, Arnaud Arindra Adiyoso
    Traverso, Alberto
    de Bel, Thomas
    Berens, Moira S. N.
    van den Bogaard, Cas
    Cerello, Piergiorgio
    Chen, Hao
    Dou, Qi
    Evelina Fantacci, Maria
    Geurts, Bram
    van der Gugten, Robbert
    Heng, Pheng Ann
    Jansen, Bart
    de Kaste, Michael M. J.
    Kotov, Valentin
    Lin, Jack Yu-Hung
    Manders, Jeroen T. M. C.
    Sonora-Mengana, Alexander
    Carlos Garcia-Naranjo, Juan
    Papavasileiou, Evgenia
    Prokop, Mathias
    Saletta, Marco
    Schaefer-Prokop, Cornelia M.
    Scholten, Ernst T.
    Scholten, Luuk
    Snoeren, Miranda M.
    Lopez Torres, Ernesto
    Vandemeulebroucke, Jef
    Walasek, Nicole
    Zuidhof, Guido C. A.
    van Ginneken, Bram
    Jacobs, Colin
    [J]. MEDICAL IMAGE ANALYSIS, 2017, 42 : 1 - 13
  • [20] Pulmonary Nodule Detection in CT Images: False Positive Reduction Using Multi-View Convolutional Networks
    Setio, Arnaud Arindra Adiyoso
    Ciompi, Francesco
    Litjens, Geert
    Gerke, Paul
    Jacobs, Colin
    van Riel, Sarah J.
    Wille, Mathilde Marie Winkler
    Naqibullah, Matiullah
    Sanchez, Clara I.
    van Ginneken, Bram
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) : 1160 - 1169