Uncertainty Quantification for MLP-Mixer Using Bayesian Deep Learning

被引:4
作者
Abdullah, Abdullah A. [1 ]
Hassan, Masoud M. [1 ]
Mustafa, Yaseen T. [2 ,3 ]
机构
[1] Univ Zakho, Fac Sci, Comp Sci Dept, Duhok 42002, Kurdistan Regio, Iraq
[2] Nawroz Univ, Coll Sci, Comp Sci Dept, Duhok 42001, Kurdistan Regio, Iraq
[3] Univ Zakho, Fac Sci, Environm Sci Dept, Duhok 42002, Kurdistan Regio, Iraq
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 07期
关键词
uncertainty quantification; Bayesian deep learning; MLP-Mixer; variational inference (VI); MC-dropout;
D O I
10.3390/app13074547
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Convolutional neural networks (CNNs) have become a popular choice for various image classification applications. However, the multi-layer perceptron mixer (MLP-Mixer) architecture has been proposed as a promising alternative, particularly for large datasets. Despite its advantages in handling large datasets and models, MLP-Mixer models have limitations when dealing with small datasets. This study aimed to quantify and evaluate the uncertainty associated with MLP-Mixer models for small datasets using Bayesian deep learning (BDL) methods to quantify uncertainty and compare the results to existing CNN models. In particular, we examined the use of variational inference and Monte Carlo dropout methods. The results indicated that BDL can improve the performance of MLP-Mixer models by 9.2 to 17.4% in term of accuracy across different mixer models. On the other hand, the results suggest that CNN models tend to have limited improvement or even decreased performance in some cases when using BDL. These findings suggest that BDL is a promising approach to improve the performance of MLP-Mixer models, especially for small datasets.
引用
收藏
页数:16
相关论文
共 28 条
  • [1] A review of uncertainty quantification in deep learning: Techniques, applications and challenges
    Abdar, Moloud
    Pourpanah, Farhad
    Hussain, Sadiq
    Rezazadegan, Dana
    Liu, Li
    Ghavamzadeh, Mohammad
    Fieguth, Paul
    Cao, Xiaochun
    Khosravi, Abbas
    Acharya, U. Rajendra
    Makarenkov, Vladimir
    Nahavandi, Saeid
    [J]. INFORMATION FUSION, 2021, 76 : 243 - 297
  • [2] A Review on Bayesian Deep Learning in Healthcare: Applications and Challenges
    Abdullah, Abdullah A.
    Hassan, Masoud M.
    Mustafa, Yaseen T.
    [J]. IEEE ACCESS, 2022, 10 : 36538 - 36562
  • [3] Dataset of breast ultrasound images
    Al-Dhabyani, Walid
    Gomaa, Mohammed
    Khaled, Hussien
    Fahmy, Aly
    [J]. DATA IN BRIEF, 2020, 28
  • [4] Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
    Alzubaidi, Laith
    Zhang, Jinglan
    Humaidi, Amjad J.
    Al-Dujaili, Ayad
    Duan, Ye
    Al-Shamma, Omran
    Santamaria, J.
    Fadhel, Mohammed A.
    Al-Amidie, Muthana
    Farhan, Laith
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [5] Aria M., 2021, Acute Lymphoblastic Leukemia (ALL) Image Dataset, DOI [10.34740/KAGGLE/DSV/2175623, DOI 10.34740/KAGGLE/DSV/2175623]
  • [6] BO-ALLCNN: Bayesian-Based Optimized CNN for Acute Lymphoblastic Leukemia Detection in Microscopic Blood Smear Images
    Atteia, Ghada
    Alhussan, Amel A.
    Samee, Nagwan Abdel
    [J]. SENSORS, 2022, 22 (15)
  • [7] CNN Variants for Computer Vision: History, Architecture, Application, Challenges and Future Scope
    Bhatt, Dulari
    Patel, Chirag
    Talsania, Hardik
    Patel, Jigar
    Vaghela, Rasmika
    Pandya, Sharnil
    Modi, Kirit
    Ghayvat, Hemant
    [J]. ELECTRONICS, 2021, 10 (20)
  • [8] Bayesian Convolutional Neural Network-based Models for Diagnosis of Blood Cancer
    Billah, Mohammad Ehtasham
    Javed, Farrukh
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [9] Variational Inference: A Review for Statisticians
    Blei, David M.
    Kucukelbir, Alp
    McAuliffe, Jon D.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) : 859 - 877
  • [10] Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]