A multimodal transformer to fuse images and metadata for skin disease classification

被引:0
作者
Gan Cai
Yu Zhu
Yue Wu
Xiaoben Jiang
Jiongyao Ye
Dawei Yang
机构
[1] East China University of Science and Technology,School of Information Science and Engineering
[2] Zhongshan Hospital,Department of Pulmonary and Critical Care Medicine
[3] Fudan University,undefined
[4] Shanghai Engineering Research Center of Internet of Things for Respiratory Medicine,undefined
来源
The Visual Computer | 2023年 / 39卷
关键词
Skin disease; Deep learning; Transformer; Multimodal fusion; Attention;
D O I
暂无
中图分类号
学科分类号
摘要
Skin disease cases are rising in prevalence, and the diagnosis of skin diseases is always a challenging task in the clinic. Utilizing deep learning to diagnose skin diseases could help to meet these challenges. In this study, a novel neural network is proposed for the classification of skin diseases. Since the datasets for the research consist of skin disease images and clinical metadata, we propose a novel multimodal Transformer, which consists of two encoders for both images and metadata and one decoder to fuse the multimodal information. In the proposed network, a suitable Vision Transformer (ViT) model is utilized as the backbone to extract image deep features. As for metadata, they are regarded as labels and a new Soft Label Encoder (SLE) is designed to embed them. Furthermore, in the decoder part, a novel Mutual Attention (MA) block is proposed to better fuse image features and metadata features. To evaluate the model’s effectiveness, extensive experiments have been conducted on the private skin disease dataset and the benchmark dataset ISIC 2018. Compared with state-of-the-art methods, the proposed model shows better performance and represents an advancement in skin disease diagnosis.
引用
收藏
页码:2781 / 2793
页数:12
相关论文
共 50 条
  • [41] Deep learning for Parkinson’s disease classification using multimodal and multi-sequences PET/MR images
    Yan Chang
    Jiajin Liu
    Shuwei Sun
    Tong Chen
    Ruimin Wang
    EJNMMI Research, 15 (1)
  • [42] ConvNeXt-ST-AFF: A Novel Skin Disease Classification Model Based on Fusion of ConvNeXt and Swin Transformer
    Hao, Shengnan
    Zhang, Liguo
    Jiang, Yanyan
    Wang, Jingkun
    Ji, Zhanlin
    Zhao, Li
    Ganchev, Ivan
    IEEE ACCESS, 2023, 11 : 117460 - 117473
  • [43] Segmentation and Classification of Skin Lesions from Dermoscopic Images
    Palivela, Lakshmi Harika
    Athanesious, Joshan J.
    Deepika, V.
    Vignesh, M.
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2021, 80 (04): : 328 - 335
  • [44] Classification and diagnosis model for Alzheimer's disease based on multimodal data fusion
    Fu, Yaqin
    Xu, Lin
    Zhang, Yujie
    Zhang, Linshuai
    Zhang, Pengfei
    Cao, Lu
    Jiang, Tao
    MEDICINE, 2024, 103 (52) : e41016
  • [45] Multimodal Deep Learning using Images and Text for Information Graphic Classification
    Kim, Edward
    McCoy, Kathleen F.
    ASSETS'18: PROCEEDINGS OF THE 20TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2018, : 143 - 148
  • [46] Weakly Supervised Classification for Nasopharyngeal Carcinoma With Transformer in Whole Slide Images
    Hu, Ziwei
    Wang, Jianchao
    Gao, Qinquan
    Wu, Zhida
    Xu, Hanchuan
    Guo, Zhechen
    Quan, Jiawei
    Zhong, Lihua
    Du, Min
    Tong, Tong
    Chen, Gang
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (12) : 7251 - 7262
  • [47] Improved Skin Disease Classification Using Generative Adversarial Network
    Mondal, Bisakh
    Das, Nibaran
    Santosh, K. C.
    Nasipuri, Mita
    2020 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS(CBMS 2020), 2020, : 520 - 525
  • [48] Opportunities and Challenges: Classification of Skin Disease Based on Deep Learning
    Bin Zhang
    Xue Zhou
    Yichen Luo
    Hao Zhang
    Huayong Yang
    Jien Ma
    Liang Ma
    Chinese Journal of Mechanical Engineering, 2021, 34
  • [49] Opportunities and Challenges: Classification of Skin Disease Based on Deep Learning
    Zhang, Bin
    Zhou, Xue
    Luo, Yichen
    Zhang, Hao
    Yang, Huayong
    Ma, Jien
    Ma, Liang
    CHINESE JOURNAL OF MECHANICAL ENGINEERING, 2021, 34 (01)
  • [50] Classification of Skin Lesion Images with Deep Learning Approaches
    Bayram, Buket
    Kulavuz, Bahadir
    Ertugrul, Berkay
    Bayram, Bulent
    Bakirman, Tolga
    Cakar, Tuna
    Dogan, Metehan
    BALTIC JOURNAL OF MODERN COMPUTING, 2022, 10 (02): : 241 - 250