Practical Training Approaches for Discordant Atopic Dermatitis Severity Datasets: Merging Methods With Soft-Label and Train-Set Pruning

被引:6
作者
Cho, Soo Ick [1 ]
Lee, Dongheon [2 ]
Han, Byeol [3 ]
Lee, Ji Su [1 ]
Hong, Ji Yeon [4 ]
Chung, Jin Ho [1 ]
Lee, Dong Hun [1 ]
Na, Jung-Im [5 ,6 ]
机构
[1] Seoul Natl Univ, Seoul Natl Univ Hosp, Coll Med, Dept Dermatol, Seoul 03080, South Korea
[2] Chungnam Natl Univ, Chungnam Natl Univ Hosp, Coll Med, Dept Biomed Engn, Daejeon 35015, South Korea
[3] Eulji Univ, Uijeongbu Eulji Med Ctr, Sch Med, Dept Dermatol, Uijongbu 11759, South Korea
[4] Chungnam Natl Univ, Sejong Hosp, Dept Dermatol, Sejong 30099, South Korea
[5] Seoul Natl Univ, Bundang Hosp, Dept Dermatol, Seongnam 13620, South Korea
[6] Seoul Natl Univ, Coll Med, Seoul 03080, South Korea
关键词
Training; Merging; Hospitals; Convolutional neural networks; Biological system modeling; Dermatology; Bioinformatics; Atopic dermatitis; convolutional neural networks; discordance; investigator's global assessment; soft-label; INVESTIGATOR GLOBAL ASSESSMENT; RELIABILITY; GUIDELINES; ECZEMA; MANAGEMENT; ADULTS; EASI; CARE; AD;
D O I
10.1109/JBHI.2022.3218166
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective assessment of atopic dermatitis (AD) is essential for choosing proper management strategies. This study investigated the performance of convolutional neural networks (CNN) models in grading the severity of AD. Five board-certified dermatologists independently evaluated the severity of 9,192 AD images. The severity of AD was evaluated based on an Investigator's Global Assessment (IGA) and six signs of AD. For CNN training, we applied three distinct approaches: 1) ensemble vs. integration 2) hard-label vs. soft-label and 3) train-set pruning. For the IGA prediction, the two best models were chosen based on the macro-averaged AUROC and F-1 score. The ensemble-soft-label-pruning model was chosen based on AUROC 0.943, 0.927 for the internal and external validation set respectively, and integration-soft-label-whole dataset model was chosen based on the F1-score 0.750, 0.721 for the internal and external validation set respectively. CNN models trained by multi-evaluator dataset outperformed the models by an individual evaluator dataset, and they performed better to the dataset in which the assessment of dermatologists was concordant. In conclusion, CNN models for AD could be improved by labeled dataset from multiple evaluators, merging methods with soft-label and train-set pruning.
引用
收藏
页码:166 / 175
页数:10
相关论文
共 44 条
  • [1] Image classification with deep learning in the presence of noisy labels: A survey
    Algan, Gorkem
    Ulusoy, Ilkay
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 215
  • [2] Automated severity scoring of atopic dermatitis patients by a deep neural network
    Bang, Chul Hwan
    Yoon, Jae Woong
    Ryu, Jae Yeon
    Chun, Jae Heon
    Han, Ju Hee
    Lee, Young Bok
    Lee, Jun Young
    Park, Young Min
    Lee, Suk Jun
    Lee, Ji Hyun
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [3] Dupilumab Treatment in Adults with Moderate-to-Severe Atopic Dermatitis
    Beck, Lisa A.
    Thaci, Diamant
    Hamilton, Jennifer D.
    Graham, Neil M.
    Bieber, Thomas
    Rocklin, Ross
    Ming, Jeffrey E.
    Ren, Haobo
    Kao, Richard
    Simpson, Eric
    Ardeleanu, Marius
    Weinstein, Steven P.
    Pirozzi, Gianluca
    Guttman-Yassky, Emma
    Suarez-Farinas, Mayte
    Hager, Melissa D.
    Stahl, Neil
    Yancopoulos, George D.
    Radin, Allen R.
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2014, 371 (02) : 130 - 139
  • [4] Assessment of Intra- and Inter-Rater Reliability of Three Methods for Measuring Atopic Dermatitis Severity: EASI, Objective SCORAD, and IGA
    Bozek, Agnieszka
    Reich, Adam
    [J]. DERMATOLOGY, 2017, 233 (01) : 16 - 22
  • [5] Prevalence and Incidence of Atopic Dermatitis: A Systematic Review
    Bylund, Simon
    von Kobyletzki, Laura B.
    Svalstedt, Marika
    Svensson, Ake
    [J]. ACTA DERMATO-VENEREOLOGICA, 2020, 100 : 320 - 329
  • [6] Ground truthing from multi-rater labeling with three-way decision and possibility theory
    Campagner, Andrea
    Ciucci, Davide
    Svensson, Carl-Magnus
    Figge, Marc Thilo
    Cabitza, Federico
    [J]. INFORMATION SCIENCES, 2021, 545 : 771 - 790
  • [7] Dermatologist-level classification of malignant lip diseases using a deep convolutional neural network
    Cho, S. I.
    Sun, S.
    Mun, J. -H.
    Kim, C.
    Kim, S. Y.
    Cho, S.
    Youn, S. W.
    Kim, H. C.
    Chung, J. H.
    [J]. BRITISH JOURNAL OF DERMATOLOGY, 2020, 182 (06) : 1388 - 1394
  • [8] Inter- and intra-observer reliability of radiological grading systems for knee osteoarthritis
    Eckersley, Thomas
    Faulkner, Jordan
    Al-Dadah, Oday
    [J]. SKELETAL RADIOLOGY, 2021, 50 (10) : 2069 - 2078
  • [9] Eczemacouncil, TARG LES SEV SCOR
  • [10] Guidelines of care for the management of atopic dermatitis Section 2. Management and treatment of atopic dermatitis with topical therapies
    Eichenfield, Lawrence F.
    Tom, Wynnis L.
    Berger, Timothy G.
    Krol, Alfons
    Paller, Amy S.
    Schwarzenberger, Kathryn
    Bergman, James N.
    Chamlin, Sarah L.
    Cohen, David E.
    Cooper, Kevin D.
    Cordoro, Kelly M.
    Davis, Dawn M.
    Feldman, Steven R.
    Hanifin, Jon M.
    Margolis, David J.
    Silverman, Robert A.
    Simpson, Eric L.
    Williams, Hywel C.
    Elmets, Craig A.
    Block, Julie
    Harrod, Christopher G.
    Begolka, Wendy Smith
    Sidbury, Robert
    [J]. JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY, 2014, 71 (01) : 116 - 132