An Efficient High-Quality Medical Lesion Image Data Labeling Method Based on Active Learning

被引:6
作者
Zhou, Jiancun [1 ,2 ]
Cao, Rui [2 ]
Kang, Jian [3 ]
Guo, Kehua [2 ]
Xu, Yangting [3 ]
机构
[1] Hunan City Univ, All Solid State Energy Storage Mat & Devices Key, Yiyang 413000, Peoples R China
[2] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[3] Cent South Univ, Xiangya Hosp 3, Dept Dermatol, Changsha 410083, Peoples R China
基金
美国国家科学基金会;
关键词
Biomedical imaging; Machine learning; Labeling; Semisupervised learning; Data models; Manuals; Diseases; High-quality data; biomedical engineering; active learning; deep learning; CLASSIFICATION;
D O I
10.1109/ACCESS.2020.3014355
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid development of artificial intelligence has allowed deep learning technology to change our lives and has brought considerable convenience, but deep learning cannot succeed without a sufficient quantity and quality of data. In medical systems, due to the special nature of medical data resources, labeling and screening require professional input from doctors at considerable cost. However, if these data cannot be used effectively, then resources are wasted. To solve this problem, this paper proposes an effective high-quality medical lesion image data labeling method based on active learning, which labels the most representative and high-quality medical image data with artificial assistance. First, we generated subregions for all unlabeled images and predicted their classifications. Second, multifactor calculations were performed on all images. Finally, the values of multiple factors were used to sort all images, and the top-ranked images were selected and labeled with artificial assistance. The above steps were repeated until a suitable number of datasets had been labeled. The experimental results showed that a model trained on the labeled high-quality dataset could achieve the same quality as the model trained on all the data and save a considerable amount of time on manual labeling, which demonstrates the effectiveness of the method. The method ensures that the labeled data are valuable, high quality and rich in information to reduce the labeling workload and avoid wasting data resources.
引用
收藏
页码:144331 / 144342
页数:12
相关论文
共 32 条
[1]  
[Anonymous], 2013, INT CONF ADV COMMUN
[2]  
[Anonymous], 2020, IEEE ACCESS, DOI DOI 10.1109/ACCESS.2020.2997736
[3]  
Antony J, 2016, INT C PATT RECOG, P1195, DOI 10.1109/ICPR.2016.7899799
[4]   Automatic Semantic Modeling of Indoor Scenes from Low-quality RGB-D Data using Contextual Information [J].
Chen, Kang ;
Lai, Yu-Kun ;
Wu, Yu-Xin ;
Martin, Ralph ;
Hu, Shi-Min .
ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (06)
[5]   Dermatologist-level classification of skin cancer with deep neural networks [J].
Esteva, Andre ;
Kuprel, Brett ;
Novoa, Roberto A. ;
Ko, Justin ;
Swetter, Susan M. ;
Blau, Helen M. ;
Thrun, Sebastian .
NATURE, 2017, 542 (7639) :115-+
[6]   Case-Based Statistical Learning: A Non-Parametric Implementation With a Conditional-Error Rate SVM [J].
Gorriz, J. M. ;
Ramirez, J. ;
Suckling, J. ;
Illan, Ignacio Alvarez ;
Ortiz, Andres ;
Martinez-Murcia, F. J. ;
Segovia, Fermin ;
Salas-Gonzalez, D. ;
Wang, Shuihua .
IEEE ACCESS, 2017, 5 :11468-11478
[7]   LCC: Towards efficient label completion and correction for supervised medical image learning in smart diagnosis [J].
Guo, Kehua ;
Cao, Rui ;
Kui, Xiaoyan ;
Ma, Jianhua ;
Kang, Jian ;
Chi, Tao .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2019, 133 :51-59
[8]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]  
Huazhu Fu, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P132, DOI 10.1007/978-3-319-46723-8_16
[10]  
Joshi AJ, 2009, PROC CVPR IEEE, P2364