An Evaluation of Consensus Techniques for Diagnostic Interpretation

被引:0
|
作者
Sauter, Jake N. [1 ]
LaBarre, Victoria M. [2 ]
Furst, Jacob D. [3 ]
Raicu, Daniela S. [3 ]
机构
[1] SUNY Coll Oswego, 7060 Route 104, Oswego, NY USA
[2] McLennan Community Coll, 140 Coll Dr, Waco, TX USA
[3] Coll Comp & Digital Media, 243 South Wabash Ave, Chicago, IL USA
来源
MEDICAL IMAGING 2018: COMPUTER-AIDED DIAGNOSIS | 2018年 / 10575卷
基金
美国国家科学基金会;
关键词
Belief Decision Tree; LIDC; Leverage Label Variability; PULMONARY NODULES; LUNG;
D O I
10.1117/12.2293778
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Learning diagnostic labels from image content has been the standard in computer-aided diagnosis. Most computer-aided diagnosis systems use low-level image features extracted directly from image content to train and test machine learning classifiers for diagnostic label prediction. When the ground truth for the diagnostic labels is not available, reference truth is generated from the experts diagnostic interpretations of the image/region of interest. More specifically, when the label is uncertain, e.g. when multiple experts label an image and their interpretations are different, techniques to handle the label variability are necessary. In this paper, we compare three consensus techniques that are typically used to encode the variability in the experts labeling of the medical data: mean, median and mode, and their effects on simple classifiers that can handle deterministic labels (decision trees) and probabilistic vectors of labels (belief decision trees). Given that the NIH/NCI Lung Image Database Consortium (LIDC) data provides interpretations for lung nodules by up to four radiologists, we leverage the LIDC data to evaluate and compare these consensus approaches when creating computer-aided diagnosis systems for lung nodules. First, low-level image features of nodules are extracted and paired with their radiologists semantic ratings (1= most likely benign, 5 = most likely malignant); second, machine learning multi-class classifiers that handle deterministic labels (decision trees) and probabilistic vectors of labels (belief decision trees) are built to predict the lung nodules semantic ratings. We show that the mean-based consensus generates the most robust classifier overall when compared to the median- and mode-based consensus. Lastly, the results of this study show that, when building CAD systems with uncertain diagnostic interpretation, it is important to evaluate different strategies for encoding and predicting the diagnostic label.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Risk evaluation of secondary malignancies after radiotherapy of breast cancer in light of the continuous development of planning techniques
    Cilla, Savino
    Deodato, Francesco
    Romano, Carmela
    Macchia, Gabriella
    Buwenge, Milly
    Boccardi, Mariangela
    Pezzulla, Donato
    Pierro, Antonio
    Zamagni, Alice
    Morganti, Alessio Giuseppe
    MEDICAL DOSIMETRY, 2023, 48 (04) : 279 - 285
  • [32] Diagnostic Evaluation After Lung Cancer Screening in Real-World Practice More Questions Than Answers
    Iaccarino, Jonathan M.
    Wiener, Renda Soylemez
    CHEST, 2020, 157 (02) : 247 - 248
  • [33] Economic Evaluation of a Novel Lung Cancer Diagnostic in a Population of Patients With a Positive Low-Dose Computed Tomography Result
    Morris, Michael J.
    Habib, Sheila A.
    Do Valle, Maggie L.
    Schneider, John E.
    JOURNAL OF HEALTH ECONOMICS AND OUTCOMES RESEARCH, 2024, 11 (02): : 74 - 79
  • [34] Evaluation of the Respiratory Microbiome and the Use of Tracheal Lavage as a Diagnostic Tool in Kemp's Ridley Sea Turtles (Lepidochelys kempii)
    McNally, Kerry L.
    Bowen, Jennifer L.
    Brisson, Jennifer O.
    Kennedy, Adam
    Innis, Charles J.
    ANIMALS, 2021, 11 (10):
  • [35] Comparative Evaluation of 2 Different Percutaneous Techniques of Simultaneous Needle Biopsy With Microwave Ablation of Suspected Malignant Pulmonary Nodules
    Hu, Miaomiao
    Wu, Linlin
    Zhang, Xusheng
    Yuan, Qianqian
    Li, Peishun
    Yang, Sen
    Wang, Baohu
    Zhang, Kaixian
    TECHNOLOGY IN CANCER RESEARCH & TREATMENT, 2023, 22
  • [36] Comparative Evaluation of 2 Different Percutaneous Techniques of Simultaneous Needle Biopsy With Microwave Ablation of Suspected Malignant Pulmonary Nodules
    Hu, Miaomiao
    Wu, Linlin
    Zhang, Xusheng
    Yuan, Qianqian
    Li, Peishun
    Yang, Sen
    Wang, Baohu
    Zhang, Kaixian
    TECHNOLOGY IN CANCER RESEARCH & TREATMENT, 2023, 22
  • [37] MR imaging of lung parenchyma at 0.2 T: evaluation of imaging techniques, comparative study with chest radiography and interobserver analysis
    Abolmaali, ND
    Schmitt, J
    Krauss, S
    Bretz, F
    Deimling, M
    Jacobi, V
    Vogl, TJ
    EUROPEAN RADIOLOGY, 2004, 14 (04) : 703 - 708
  • [38] Comparative evaluation of non-contrast CAIPIRINHA-VIBE 3T-MRI and multidetector CT for detection of pulmonary nodules: In vivo evaluation of diagnostic accuracy and image quality
    Dewes, Patricia
    Frellesen, Claudia
    Al-Butmeh, Firas
    Albrecht, Moritz H.
    Scholtz, Jan-Erik
    Metzger, Sarah C.
    Lehnert, Thomas
    Vogl, Thomas J.
    Wichmann, Julian L.
    EUROPEAN JOURNAL OF RADIOLOGY, 2016, 85 (01) : 193 - 198
  • [39] A comparison of the diagnostic value of 19-gauge histology and 22-gauge cytology needles in bronchoscopy for the evaluation of endobronchial lesions
    Ucar, Elif Yilmazel
    Meral, Mehmet
    Akgun, Metin
    Sipal, Sare
    Kaynar, Hasan
    Saglam, Leyla
    Gorguner, Ali Metin
    TURKISH JOURNAL OF MEDICAL SCIENCES, 2011, 41 (03) : 475 - 481
  • [40] Computer-assisted detection of pulmonary nodules:: performance evaluation of an expert knowledge-based detection system in consensus reading with experienced and inexperienced chest radiologists
    Marten, K
    Seyfarth, T
    Auer, F
    Wiener, E
    Grillhösl, A
    Obenauer, S
    Rummeny, EJ
    Engelke, C
    EUROPEAN RADIOLOGY, 2004, 14 (10) : 1930 - 1938