A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability?

被引：273

作者：

Huang, Xiaowei ^{[1
]}

Kroening, Daniel ^{[2
]}

Ruan, Wenjie ^{[3
]}

Sharp, James ^{[4
]}

Sun, Youcheng ^{[5
]}

Thamo, Emese ^{[1
]}

Wu, Min ^{[2
]}

Yi, Xinping ^{[1
]}

机构：

[1] Univ Liverpool, Liverpool, Merseyside, England

[2] Univ Oxford, Oxford, England

[3] Univ Lancaster, Lancaster, England

[4] Def Sci & Technol Lab Dstl, Porton Down Salisbury, England

[5] Queens Univ Belfast, Belfast, Antrim, North Ireland

来源：

COMPUTER SCIENCE REVIEW | 2020年 / 37卷

基金：

英国工程与自然科学研究理事会;

关键词：

ABSTRACTION-REFINEMENT; ROBUSTNESS; EXTRACTION;

D O I：

10.1016/j.cosrev.2020.100270

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the past few years, significant progress has been made on deep neural networks (DNNs) in achieving human-level performance on several long-standing tasks. With the broader deployment of DNNs on various applications, the concerns over their safety and trustworthiness have been raised in public, especially after the widely reported fatal incidents involving self-driving cars. Research to address these concerns is particularly active, with a significant number of papers released in the past few years. This survey paper conducts a review of the current research effort into making DNNs safe and trustworthy, by focusing on four aspects: verification, testing, adversarial attack and defence, and interpretability. In total, we survey 202 papers, most of which were published after 2017. (c) 2020 Elsevier Inc. All rights reserved.

引用

页数：35

共 199 条

[1]

Abbasi M., 2018, CONTROLLING GEN ITS

[2]

Agarwal A., 2018, arXiv

[3]

Amjad R.A, 2018, ABS180209766 CORR

[4]

Ammann Paul, 2016, Introduction to Software Testing, V2nd, DOI 10.1017/9781316771273

[5]

Amodei D., 2016, PREPRINT, DOI 10.48550/ARXIV.1606.06565

[6]

Ancona M., 2018, INT C LEARN REPR

[7] LEARNING REGULAR SETS FROM QUERIES AND COUNTEREXAMPLES [J].

ANGLUIN, D .

INFORMATION AND COMPUTATION, 1987, 75 (02) :87-106

[8]

[Anonymous], 2018, ARXIV180801614

[9]

[Anonymous], SIGN SYST COMP 37 AS

[10]

[Anonymous], 2018, Adversarial Attacks Against Medical Deep Learning Systems

← 1 2 3 4 5 6 7 8 9 10 →