Dual-level Deep Evidential Fusion: Integrating multimodal information for enhanced reliable decision-making in deep learning

被引：21

作者：

Shao, Zhimin ^{[1
]}

Dou, Weibei ^{[1
,3
]}

Pan, Yu ^{[2
,3
]}

机构：

[1] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol BNRist, Dept Elect Engn, Beijing 100084, Peoples R China

[2] Beijing Tsinghua Changgung Hosp, Dept Rehabil Med, Beijing 102218, Peoples R China

[3] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China

来源：

INFORMATION FUSION | 2024年 / 103卷

基金：

中国国家自然科学基金;

关键词：

Evidential deep learning; Basic belief assignment; Multimodal fusion; Uncertainty estimation; Dempster-Shafer theory; NETWORKS;

D O I：

10.1016/j.inffus.2023.102113

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal learning has gained significant attention in recent years for combining information from different modalities using Deep Neural Networks (DNNs). However, existing approaches often overlook the varying importance of modalities and neglect uncertainty estimation, leading to limited generalization and unreliable predictions. In this paper, we propose a novel algorithm, Dual-level Deep Evidential Fusion (DDEF), to address these challenges by integrating multimodal information at both the Basic Belief Assignment (BBA) level and multimodal level, for enhancing accuracy, robustness, and reliability. The proposed DDEF approach utilizes the Dirichlet framework and BBA methods to connect neural network outputs with Dirichlet distribution parameters, enabling effective uncertainty estimation, and the Dempster-Shafer Theory (DST) is used for dual-level fusion, facilitating the fusion of evidence from two BBA methods and multiple modalities. It has been validated by two experiments on synthetic digit classification, and real-world medical prognosis after brain- computer interface (BCI) treatment, and by demonstrating superior performance compared to existing methods. Our findings emphasize the importance of considering multimodal integration and uncertainty estimation for reliable decision-making in deep learning.

引用

页数：9

共 56 条

[1] Early, intermediate and late fusion strategies for robust deep learning-based multimodal action recognition [J].

Boulahia, Said Yacine ;

Amamra, Abdenour ;

Madi, Mohamed Ridha ;

Daikh, Said .

MACHINE VISION AND APPLICATIONS, 2021, 32 (06)

[2]

Charpentier Bertrand, 2020, ADV NEURAL INFORM PR, V33

[3] Modern views of machine learning for precision psychiatry [J].

Chen, Zhe Sage ;

Kulkarni, Prathamesh Param ;

Galatzer-Levy, Isaac R. ;

Bigio, Benedetta ;

Nasca, Carla ;

Zhang, Yu .

PATTERNS, 2022, 3 (11)

[4]

Cohen G, 2017, IEEE IJCNN, P2921, DOI 10.1109/IJCNN.2017.7966217

[5] Analysis of multimodal data fusion from an information theory perspective [J].

Dai, Yinglong ;

Yan, Zheng ;

Cheng, Jiangchang ;

Duan, Xiaojun ;

Wang, Guojun .

INFORMATION SCIENCES, 2023, 623 :164-183

[6]

de Campos TE, 2009, VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, P273

[7]

Dempster AP, 2008, STUD FUZZ SOFT COMP, V219, P57

[8] A neural network classifier based on Dempster-Shafer theory [J].

Denoeux, T .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (02) :131-150

[9]

DENOEUX T, 1994, MACH INTELL PATT REC, V16, P13

[10]

Gal Y, 2016, PR MACH LEARN RES, V48

← 1 2 3 4 5 6 →