A generalized framework for medical image classification and recognition

被引:19
作者
Abedini, M. [1 ]
Codella, N. C. F. [2 ]
Connell, J. H. [2 ]
Garnavi, R. [1 ]
Merler, M. [2 ]
Pankanti, S. [2 ]
Smith, J. R. [2 ]
Syeda-Mahmood, T. [3 ]
机构
[1] IBM Res Australia, Carlton, Vic 3053, Australia
[2] IBM Res Div, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[3] IBM Res Almaden, San Jose, CA 95120 USA
关键词
RETRIEVAL; TEXTURE; SYSTEMS;
D O I
10.1147/JRD.2015.2390017
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we study the performance of a two-stage ensemble visual machine learning framework for classification of medical images. In the first stage, models are built for subsets of features and data, and in the second stage, models are combined. We demonstrate the performance of this framework in four contexts: 1) The public ImageCLEF (Cross Language Evaluation Forum) 2013 medical modality recognition benchmark, 2) echocardiography view and mode recognition, 3) dermatology disease recognition across two datasets, and 4) a broad medical image dataset, merged from multiple data sources into a collection of 158 categories covering both general and specific medical concepts-including modalities, body regions, views, and disease states. In the first context, the presented system achieves state-of-art performance of 82.2% multiclass accuracy. In the second context, the system attains 90.48% multiclass accuracy. In the third, state-of-art performance of 90% specificity and 90% sensitivity is obtained on a small standardized dataset of 200 images using a leave-one-out strategy. For a larger dataset of 2,761 images, 95% specificity and 98% sensitivity is obtained on a 20% held-out test set. Finally, in the fourth context, the system achieves sensitivity and specificity of 94.7% and 98.4%, respectively, demonstrating the ability to generalize over domains.
引用
收藏
页数:18
相关论文
共 33 条
  • [21] SEPARABILITY AND REFINEMENT OF HIERARCHICAL SEMANTIC VIDEO LABELS AND THEIR GROUND TRUTH
    Kender, John R.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 673 - 676
  • [22] Kumar R, 2009, PROC CVPR IEEE, P723, DOI 10.1109/CVPRW.2009.5206838
  • [23] Content-based image retrieval in medical applications for picture archiving and communication systems
    Lehmann, TM
    Güld, MO
    Thies, C
    Fischer, B
    Keysers, D
    Kohnen, M
    Schubert, H
    Wein, BB
    [J]. MEDICAL IMAGING 2003: PACS AND INTEGRATED MEDICAL INFORMATION SYSTEMS: DESIGN AND EVALUATION, 2003, 5033 : 109 - 117
  • [24] Liu Y, 2006, LECT NOTES ARTIF INT, V3918, P107
  • [25] Otey M., 2006, INT WORKSHOP COMPUTE, P187
  • [26] Park IC, 2007, PR IEEE COMP DESIGN, P1, DOI 10.1109/ICCD.2007.4601872
  • [27] Evaluating Color Descriptors for Object and Scene Recognition
    van de Sande, Koen E. A.
    Gevers, Theo
    Snoek, Cees G. M.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1582 - 1596
  • [28] Visual Word Ambiguity
    van Gemert, Jan C.
    Veenman, Cor J.
    Smeulders, Arnold W. M.
    Geusebroek, Jan-Mark
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (07) : 1271 - 1283
  • [29] Vedaldi A., 2010, ACM Multimedia, P1469, DOI DOI 10.1145/1873951.1874249
  • [30] Wu H, 2013, I S BIOMED IMAGING, P752