An Evaluation of Consensus Techniques for Diagnostic Interpretation

被引:0
|
作者
Sauter, Jake N. [1 ]
LaBarre, Victoria M. [2 ]
Furst, Jacob D. [3 ]
Raicu, Daniela S. [3 ]
机构
[1] SUNY Coll Oswego, 7060 Route 104, Oswego, NY USA
[2] McLennan Community Coll, 140 Coll Dr, Waco, TX USA
[3] Coll Comp & Digital Media, 243 South Wabash Ave, Chicago, IL USA
来源
MEDICAL IMAGING 2018: COMPUTER-AIDED DIAGNOSIS | 2018年 / 10575卷
基金
美国国家科学基金会;
关键词
Belief Decision Tree; LIDC; Leverage Label Variability; PULMONARY NODULES; LUNG;
D O I
10.1117/12.2293778
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Learning diagnostic labels from image content has been the standard in computer-aided diagnosis. Most computer-aided diagnosis systems use low-level image features extracted directly from image content to train and test machine learning classifiers for diagnostic label prediction. When the ground truth for the diagnostic labels is not available, reference truth is generated from the experts diagnostic interpretations of the image/region of interest. More specifically, when the label is uncertain, e.g. when multiple experts label an image and their interpretations are different, techniques to handle the label variability are necessary. In this paper, we compare three consensus techniques that are typically used to encode the variability in the experts labeling of the medical data: mean, median and mode, and their effects on simple classifiers that can handle deterministic labels (decision trees) and probabilistic vectors of labels (belief decision trees). Given that the NIH/NCI Lung Image Database Consortium (LIDC) data provides interpretations for lung nodules by up to four radiologists, we leverage the LIDC data to evaluate and compare these consensus approaches when creating computer-aided diagnosis systems for lung nodules. First, low-level image features of nodules are extracted and paired with their radiologists semantic ratings (1= most likely benign, 5 = most likely malignant); second, machine learning multi-class classifiers that handle deterministic labels (decision trees) and probabilistic vectors of labels (belief decision trees) are built to predict the lung nodules semantic ratings. We show that the mean-based consensus generates the most robust classifier overall when compared to the median- and mode-based consensus. Lastly, the results of this study show that, when building CAD systems with uncertain diagnostic interpretation, it is important to evaluate different strategies for encoding and predicting the diagnostic label.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Evaluation and Management of Liver Transplant Candidates With Prior Nonhepatic Cancer: Guidelines From the ILTS/SETH Consensus Conference
    Salcedo, Magdalena
    Vinaixa, Carmen
    Javle, Milind
    Trapero-Marugan, Maria
    Bustamante, Javier
    Line, Pal-Dag
    TRANSPLANTATION, 2022, 106 (01) : E3 - E11
  • [22] Evaluation of reconstruction techniques for lung single photon emission tomography: A Monte Carlo study
    Norberg, Pernilla
    Bakeb, Bjorn
    Jacobsson, Lars
    Carlsson, Gudrun Alm
    Gustafsson, Agnetha
    NUCLEAR MEDICINE COMMUNICATIONS, 2007, 28 (12) : 929 - 936
  • [23] Evaluation of the effect of double reporting on test accuracy in screening and diagnostic imaging studies: A review of the evidence
    Pow, Richard E.
    Mello-Thoms, Claudia
    Brennan, Patrick
    JOURNAL OF MEDICAL IMAGING AND RADIATION ONCOLOGY, 2016, 60 (03) : 306 - 314
  • [24] Evaluation of micro-CT for emphysema assessment in mice: comparison with non-radiological techniques
    Artaechevarria, Xabier
    Blanco, David
    de Biurrun, Gabriel
    Ceresa, Mario
    Perez-Martin, Daniel
    Bastarrika, Gorka
    de Torres, Juan P.
    Zulueta, Javier J.
    Montuenga, Luis M.
    Ortiz-de-Solorzano, Carlos
    Munoz-Barrutia, Arrate
    EUROPEAN RADIOLOGY, 2011, 21 (05) : 954 - 962
  • [25] Evaluation of clinical and laboratory investigation techniques of mammary gland tumors in the female dog: bibliographic study
    Soare, Marian
    Vlagioiu, Constantin
    ROMANIAN BIOTECHNOLOGICAL LETTERS, 2012, 17 (06): : 7796 - 7807
  • [26] Diagnostic imaging in acute interstitial pneumonia in foals: High variability of interpretation of chest radiographs and good conformity between ultrasonographic and post-mortem findings
    Punsmann, Sophia
    Hellige, Maren
    Hoppe, Judith
    Freise, Fritjof
    Venner, Monica
    VETERINARY RADIOLOGY & ULTRASOUND, 2021, 62 (04) : 490 - 497
  • [27] Impact of a bronchial genomic classifier on clinical decision making in patients undergoing diagnostic evaluation for lung cancer
    Ferguson, J. Scott
    Van Wert, Ryan
    Choi, Yoonha
    Rosenbluth, Michael J.
    Smith, Kate Porta
    Huang, Jing
    Spira, Avrum
    BMC PULMONARY MEDICINE, 2016, 16
  • [28] Evaluation of data balancing techniques in 3D CNNs for the classification of pulmonary nodules in CT images
    Barbosa Lima, Thiago Jose
    Duarte de Araiujo, Flavio Henrique
    de Carvalho Filho, Antonio Oseas
    Lira Rabelo, Ricardo de Andrade
    Souza Veras, Rodrigo de Melo
    Mathew, Mano Joseph
    2020 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2020, : 792 - 797
  • [29] Evaluation of air trapping at CT: Comparison of continuous-versus suspended-expiration CT techniques
    Lucidarme, O
    Grenier, PA
    Cadi, M
    Mourey-Gerosa, I
    Benali, K
    Cluzel, P
    RADIOLOGY, 2000, 216 (03) : 768 - 772
  • [30] Evaluation of precision of guidance techniques in image guided fine needle aspiration cytology of thoracic mass lesions
    Kalhan, Shivani
    Sharma, Pankaj
    Sharma, Sonia
    Dudani, Sharmila
    Ramakrishnan, T. S.
    Chowdhry, Anupama
    JOURNAL OF CYTOLOGY, 2012, 29 (01) : 6 - 10