Uncertainty Quantification for MLP-Mixer Using Bayesian Deep Learning

被引:7
作者
Abdullah, Abdullah A. [1 ]
Hassan, Masoud M. [1 ]
Mustafa, Yaseen T. [2 ,3 ]
机构
[1] Univ Zakho, Fac Sci, Comp Sci Dept, Duhok 42002, Kurdistan Regio, Iraq
[2] Nawroz Univ, Coll Sci, Comp Sci Dept, Duhok 42001, Kurdistan Regio, Iraq
[3] Univ Zakho, Fac Sci, Environm Sci Dept, Duhok 42002, Kurdistan Regio, Iraq
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 07期
关键词
uncertainty quantification; Bayesian deep learning; MLP-Mixer; variational inference (VI); MC-dropout;
D O I
10.3390/app13074547
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Convolutional neural networks (CNNs) have become a popular choice for various image classification applications. However, the multi-layer perceptron mixer (MLP-Mixer) architecture has been proposed as a promising alternative, particularly for large datasets. Despite its advantages in handling large datasets and models, MLP-Mixer models have limitations when dealing with small datasets. This study aimed to quantify and evaluate the uncertainty associated with MLP-Mixer models for small datasets using Bayesian deep learning (BDL) methods to quantify uncertainty and compare the results to existing CNN models. In particular, we examined the use of variational inference and Monte Carlo dropout methods. The results indicated that BDL can improve the performance of MLP-Mixer models by 9.2 to 17.4% in term of accuracy across different mixer models. On the other hand, the results suggest that CNN models tend to have limited improvement or even decreased performance in some cases when using BDL. These findings suggest that BDL is a promising approach to improve the performance of MLP-Mixer models, especially for small datasets.
引用
收藏
页数:16
相关论文
共 28 条
[1]   A review of uncertainty quantification in deep learning: Techniques, applications and challenges [J].
Abdar, Moloud ;
Pourpanah, Farhad ;
Hussain, Sadiq ;
Rezazadegan, Dana ;
Liu, Li ;
Ghavamzadeh, Mohammad ;
Fieguth, Paul ;
Cao, Xiaochun ;
Khosravi, Abbas ;
Acharya, U. Rajendra ;
Makarenkov, Vladimir ;
Nahavandi, Saeid .
INFORMATION FUSION, 2021, 76 :243-297
[2]   A Review on Bayesian Deep Learning in Healthcare: Applications and Challenges [J].
Abdullah, Abdullah A. ;
Hassan, Masoud M. ;
Mustafa, Yaseen T. .
IEEE ACCESS, 2022, 10 :36538-36562
[3]   Dataset of breast ultrasound images [J].
Al-Dhabyani, Walid ;
Gomaa, Mohammed ;
Khaled, Hussien ;
Fahmy, Aly .
DATA IN BRIEF, 2020, 28
[4]   Review of deep learning: concepts, CNN architectures, challenges, applications, future directions [J].
Alzubaidi, Laith ;
Zhang, Jinglan ;
Humaidi, Amjad J. ;
Al-Dujaili, Ayad ;
Duan, Ye ;
Al-Shamma, Omran ;
Santamaria, J. ;
Fadhel, Mohammed A. ;
Al-Amidie, Muthana ;
Farhan, Laith .
JOURNAL OF BIG DATA, 2021, 8 (01)
[5]  
Aria M., 2021, Acute lymphoblastic leukemia (all) image dataset, DOI [10.34740/KAGGLE/DSV/2175623, DOI 10.34740/KAGGLE/DSV/2175623]
[6]   BO-ALLCNN: Bayesian-Based Optimized CNN for Acute Lymphoblastic Leukemia Detection in Microscopic Blood Smear Images [J].
Atteia, Ghada ;
Alhussan, Amel A. ;
Samee, Nagwan Abdel .
SENSORS, 2022, 22 (15)
[7]   CNN Variants for Computer Vision: History, Architecture, Application, Challenges and Future Scope [J].
Bhatt, Dulari ;
Patel, Chirag ;
Talsania, Hardik ;
Patel, Jigar ;
Vaghela, Rasmika ;
Pandya, Sharnil ;
Modi, Kirit ;
Ghayvat, Hemant .
ELECTRONICS, 2021, 10 (20)
[8]   Bayesian Convolutional Neural Network-based Models for Diagnosis of Blood Cancer [J].
Billah, Mohammad Ehtasham ;
Javed, Farrukh .
APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
[9]   Variational Inference: A Review for Statisticians [J].
Blei, David M. ;
Kucukelbir, Alp ;
McAuliffe, Jon D. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) :859-877
[10]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929