Active Learning Performance in Labeling Radiology Images Is 90% Effective

被引:4
作者
Bangert, Patrick [1 ]
Moon, Hankyu [1 ]
Woo, Jae Oh [1 ]
Didari, Sima [1 ]
Hao, Heng [1 ]
机构
[1] Samsung SDSA, San Jose, CA 95112 USA
来源
FRONTIERS IN RADIOLOGY | 2021年 / 1卷
关键词
artificial intelligence; computer vision; annotation; labeling; active learning; object discovery; SYSTEMS; MODEL;
D O I
10.3389/fradi.2021.748968
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
To train artificial intelligence (AI) systems on radiology images, an image labeling step is necessary. Labeling for radiology images usually involves a human radiologist manually drawing a (polygonal) shape onto the image and attaching a word to it. As datasets are typically large, this task is repetitive, time-consuming, error-prone, and expensive. The AI methodology of active learning (AL) can assist human labelers by continuously sorting the unlabeled images in order of information gain and thus getting the labeler always to label the most informative image next. We find that after about 10%, depending on the dataset, of the images in a realistic dataset are labeled, virtually all the information content has been learnt and the remaining images can be automatically labeled. These images can then be checked by the radiologist, which is far easier and faster to do. In this way, the entire dataset is labeled with much less human effort. We introduce AL in detail and expose the effectiveness using three real-life datasets. We contribute five distinct elements to the standard AL workflow creating an advanced methodology.
引用
收藏
页数:12
相关论文
共 70 条
  • [1] Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions
    Adomavicius, G
    Tuzhilin, A
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (06) : 734 - 749
  • [2] [Anonymous], 2021, GE
  • [3] [Anonymous], 2021, Philips
  • [4] [Anonymous], 2011, European Journal of Scientific Research
  • [5] [Anonymous], 2006, Learning Interpretable Models
  • [6] ATI, 2020, Explaining Decisions Made With AI
  • [7] A completely annotated whole slide image dataset of canine breast cancer to aid human breast cancer research
    Aubreville, Marc
    Bertram, Christof A.
    Donovan, Taryn A.
    Marzahl, Christian
    Maier, Andreas
    Klopfleisch, Robert
    [J]. SCIENTIFIC DATA, 2020, 7 (01)
  • [8] Baccelli F, 2016, IEEE INT SYMP INFO, P695, DOI 10.1109/ISIT.2016.7541388
  • [9] Bangert P., 2021, Machine Learning and Data Science in the Power Generation Industry
  • [10] Bangert Patrick., 2012, OPTIMIZATION IND PRO