Practical Training Approaches for Discordant Atopic Dermatitis Severity Datasets: Merging Methods With Soft-Label and Train-Set Pruning

被引:7
作者
Cho, Soo Ick [1 ]
Lee, Dongheon [2 ]
Han, Byeol [3 ]
Lee, Ji Su [1 ]
Hong, Ji Yeon [4 ]
Chung, Jin Ho [1 ]
Lee, Dong Hun [1 ]
Na, Jung-Im [5 ,6 ]
机构
[1] Seoul Natl Univ, Seoul Natl Univ Hosp, Coll Med, Dept Dermatol, Seoul 03080, South Korea
[2] Chungnam Natl Univ, Chungnam Natl Univ Hosp, Coll Med, Dept Biomed Engn, Daejeon 35015, South Korea
[3] Eulji Univ, Uijeongbu Eulji Med Ctr, Sch Med, Dept Dermatol, Uijongbu 11759, South Korea
[4] Chungnam Natl Univ, Sejong Hosp, Dept Dermatol, Sejong 30099, South Korea
[5] Seoul Natl Univ, Bundang Hosp, Dept Dermatol, Seongnam 13620, South Korea
[6] Seoul Natl Univ, Coll Med, Seoul 03080, South Korea
关键词
Training; Merging; Hospitals; Convolutional neural networks; Biological system modeling; Dermatology; Bioinformatics; Atopic dermatitis; convolutional neural networks; discordance; investigator's global assessment; soft-label; INVESTIGATOR GLOBAL ASSESSMENT; RELIABILITY; GUIDELINES; ECZEMA; MANAGEMENT; ADULTS; EASI; CARE; AD;
D O I
10.1109/JBHI.2022.3218166
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective assessment of atopic dermatitis (AD) is essential for choosing proper management strategies. This study investigated the performance of convolutional neural networks (CNN) models in grading the severity of AD. Five board-certified dermatologists independently evaluated the severity of 9,192 AD images. The severity of AD was evaluated based on an Investigator's Global Assessment (IGA) and six signs of AD. For CNN training, we applied three distinct approaches: 1) ensemble vs. integration 2) hard-label vs. soft-label and 3) train-set pruning. For the IGA prediction, the two best models were chosen based on the macro-averaged AUROC and F-1 score. The ensemble-soft-label-pruning model was chosen based on AUROC 0.943, 0.927 for the internal and external validation set respectively, and integration-soft-label-whole dataset model was chosen based on the F1-score 0.750, 0.721 for the internal and external validation set respectively. CNN models trained by multi-evaluator dataset outperformed the models by an individual evaluator dataset, and they performed better to the dataset in which the assessment of dermatologists was concordant. In conclusion, CNN models for AD could be improved by labeled dataset from multiple evaluators, merging methods with soft-label and train-set pruning.
引用
收藏
页码:166 / 175
页数:10
相关论文
共 44 条
[11]  
Elmore JG, 2015, JAMA-J AM MED ASSOC, V313, P1122, DOI 10.1001/jama.2015.1405
[12]  
Esteva A, 2017, ONCOLOGIE, V19, P407
[13]   Update on Atopic Dermatitis: Diagnosis, Severity Assessment, and Treatment Selection [J].
Fishbein, Anna B. ;
Silverberg, Jonathan I. ;
Wilson, Eve J. ;
Ong, Peck Y. .
JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY-IN PRACTICE, 2020, 8 (01) :91-101
[14]   A systematic review of Investigator Global Assessment (IGA) in atopic dermatitis (AD) trials: Many options, no standards [J].
Futamura, Masaki ;
Leshem, Yael A. ;
Thomas, Kim S. ;
Nankervis, Helen ;
Williams, Hywel C. ;
Simpson, Eric L. .
JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY, 2016, 74 (02) :288-294
[15]   Atopic Dermatitis in America Study: A Cross-Sectional Study Examining the Prevalence and Disease Burden of Atopic Dermatitis in the US Adult Population [J].
Fuxench, Zelma C. Chiesa ;
Block, Julie K. ;
Boguniewicz, Mark ;
Boyle, John ;
Fonacier, Luz ;
Gelfand, Joel M. ;
Grayson, Mitchell H. ;
Margolis, David J. ;
Mitchell, Lynda ;
Silverberg, Jonathan, I ;
Schwartz, Lawrence ;
Simpson, Eric L. ;
Ong, Peck Y. .
JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2019, 139 (03) :583-590
[16]   SoftSeg: Advantages of soft versus binary training for image segmentation [J].
Gros, Charley ;
Lemay, Andreanne ;
Cohen-Adad, Julien .
MEDICAL IMAGE ANALYSIS, 2021, 71
[17]   Efficacy and Safety of Lebrikizumab, a High-Affinity Interleukin 13 Inhibitor, in Adults With Moderate to Severe Atopic Dermatitis A Phase 2b Randomized Clinical Trial [J].
Guttman-Yassky, Emma ;
Blauvelt, Andrew ;
Eichenfield, Lawrence F. ;
Paller, Amy S. ;
Armstrong, April W. ;
Drew, Janice ;
Gopalan, Ramanan ;
Simpson, Eric L. .
JAMA DERMATOLOGY, 2020, 156 (04) :411-420
[18]   Assessment of deep neural networks for the diagnosis of benign and malignant skin neoplasms in comparison with dermatologists: A retrospective validation study [J].
Han, Seung Seog ;
Moon, Ik Jun ;
Kim, Seong Hwan ;
Na, Jung-Im ;
Kim, Myoung Shin ;
Park, Gyeong Hun ;
Park, Ilwoo ;
Kim, Keewon ;
Lim, Woohyung ;
Lee, Ju Hee ;
Chang, Sung Eun .
PLOS MEDICINE, 2020, 17 (11)
[19]   Classification of the Clinical Images for Benign and Malignant Cutaneous Tumors Using a Deep Learning Algorithm [J].
Han, Seung Seog ;
Kim, Myoung Shin ;
Lim, Woohyung ;
Park, Gyeong Hun ;
Park, Ilwoo ;
Chang, Sung Eun .
JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2018, 138 (07) :1529-1538
[20]   The eczema area and severity index (EASI): assessment of reliability in atopic dermatitis [J].
Hanifin, JM ;
Thurston, M ;
Omoto, M ;
Cherill, R ;
Tofte, SJ ;
Graeber, M .
EXPERIMENTAL DERMATOLOGY, 2001, 10 (01) :11-18