Melanoma Detection Using XGB Classifier Combined with Feature Extraction and K-Means SMOTE Techniques

被引:21
作者
Chang, Chih-Chi [1 ]
Li, Yu-Zhen [1 ]
Wu, Hui-Ching [2 ]
Tseng, Ming-Hseng [1 ,3 ]
机构
[1] Chung Shan Med Univ, Dept Med Informat, Taichung 402, Taiwan
[2] Chung Shan Med Univ, Dept Med Sociol & Social Work, Taichung 402, Taiwan
[3] Chung Shan Med Univ Hosp, Informat Technol Off, Taichung 402, Taiwan
关键词
melanoma; feature extraction; transfer learning; imbalanced data; oversampling techniques; machine learning; ENSEMBLE; FUSION;
D O I
10.3390/diagnostics12071747
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Melanoma, a very severe form of skin cancer, spreads quickly and has a high mortality rate if not treated early. Recently, machine learning, deep learning, and other related technologies have been successfully applied to computer-aided diagnostic tasks of skin lesions. However, some issues in terms of image feature extraction and imbalanced data need to be addressed. Based on a method for manually annotating image features by dermatologists, we developed a melanoma detection model with four improvement strategies, including applying the transfer learning technique to automatically extract image features, adding gender and age metadata, using an oversampling technique for imbalanced data, and comparing machine learning algorithms. According to the experimental results, the improved strategies proposed in this study have statistically significant performance improvement effects. In particular, our proposed ensemble model can outperform previous related models.
引用
收藏
页数:19
相关论文
共 42 条
[11]   Epidemiology and Risk Factors of Melanoma [J].
Carr, Stephanie ;
Smith, Christy ;
Wernberg, Jessica .
SURGICAL CLINICS OF NORTH AMERICA, 2020, 100 (01) :1-+
[12]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[13]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[14]  
Codella Noel, 2019, arXiv
[15]  
Codella NCF, 2018, I S BIOMED IMAGING, P168, DOI 10.1109/ISBI.2018.8363547
[16]  
Combalia M, 2019, ARXIV
[17]  
DAGHRIR J, 2020, 2020 5 INT C ADV TEC, P1, DOI DOI 10.1109/ATSIP49331.2020.9231544
[18]   Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE [J].
Douzas, Georgios ;
Bacao, Fernando ;
Last, Felix .
INFORMATION SCIENCES, 2018, 465 :1-20
[19]   Deep Learning-Based Methods for Automatic Diagnosis of Skin Lesions [J].
El-Khatib, Hassan ;
Popescu, Dan ;
Ichim, Loretta .
SENSORS, 2020, 20 (06)
[20]   A Transfer Learning Architecture Based on a Support Vector Machine for Histopathology Image Classification [J].
Fan, Jiayi ;
Lee, JangHyeon ;
Lee, YongKeun .
APPLIED SCIENCES-BASEL, 2021, 11 (14)