Extracting Explanations, Justification, and Uncertainty from Black-Box Deep Neural Networks

被引：0

作者：

Ardis, Paul ^{[1
]}

Flenner, Arjuna ^{[2
]}

机构：

[1] GE Aerosp Res, 1 Res Circle, Niskayuna, NY 12309 USA

[2] GE Aerosp, 3290 Patterson Ave SE, Grand Rapids, MI 49512 USA

来源：

ASSURANCE AND SECURITY FOR AI-ENABLED SYSTEMS | 2024年 / 13054卷

关键词：

D O I：

10.1117/12.3012765

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep Neural Networks (DNNs) do not inherently compute or exhibit empirically-justified task confidence. In mission critical applications, it is important to both understand associated DNN reasoning and its supporting evidence. In this paper, we propose a novel Bayesian approach to extract explanations, justifications, and uncertainty estimates from DNNs. Our approach is efficient both in terms of memory and computation, and can be applied to any black box DNN without any retraining, including applications to anomaly detection and out-of-distribution detection tasks. We validate our approach on the CIFAR-10 dataset, and show that it can significantly improve the interpretability and reliability of DNNs.

引用

页数：8

共 50 条

[31] DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks [J].

Aghababaeyan, Zohreh ;

Abdellatif, Manel ;

Dadkhah, Mahboubeh ;

Briand, Lionel .

ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (06)

[32] NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks [J].

Li, Yandong ;

Li, Lijun ;

Wang, Liqiang ;

Zhang, Tong ;

Gong, Boqing .

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97

[33] Spectral Privacy Detection on Black-box Graph Neural Networks [J].

Yang, Yining ;

Lu, Jialiang .

2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,

[34] Neural networks in antenna engineering - Beyond black-box modeling [J].

Patnaik, A ;

Anagnostou, D ;

Christodoulou, CG .

2005 IEEE/ACES INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND APPLIED COMPUTATIONAL ELECTROMAGNETICS, 2005, :598-601

[35] BET: Black-Box Efficient Testing for Convolutional Neural Networks [J].

Wang, Jialai ;

Qiu, Han ;

Rong, Yi ;

Ye, Hengkai ;

Li, Qi ;

Li, Zongpeng ;

Zhang, Chao .

PROCEEDINGS OF THE 31ST ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2022, 2022, :164-175

[36] A Unique Identification-Oriented Black-Box Watermarking Scheme for Deep Classification Neural Networks [J].

Mo, Mouke ;

Wang, Chuntao ;

Bian, Shan .

SYMMETRY-BASEL, 2024, 16 (03)

[37] A black-box backdoor attack against quantum neural networks [J].

Zhao, Jiayu ;

Yan, Lili ;

Tan, Dong ;

Chang, Yan ;

Zhang, Shibin .

QUANTUM SCIENCE AND TECHNOLOGY, 2025, 10 (03)

[38] Black-box Adversarial Attack and Defense on Graph Neural Networks [J].

Li, Haoyang ;

Di, Shimin ;

Li, Zijian ;

Chen, Lei ;

Cao, Jiannong .

2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, :1017-1030

[39] Revisiting Black-box Ownership Verification for Graph Neural Networks [J].

Zhou, Ruikai ;

Yang, Kang ;

Wang, Xiuling ;

Wang, Wendy Hui ;

Xu, Jun .

45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP 2024, 2024, :2478-2496

[40] Feature Importance Explanations for Temporal Black-Box Models [J].

Sood, Akshay ;

Craven, Mark .

THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, :8351-8360

← 1 2 3 4 5 →