Black-box Attacks Against Neural Binary Function Detection

被引：0

作者：

Bundt, Joshua ^{[1
,2
]}

Davinroy, Michael ^{[1
]}

Agadakos, Ioannis ^{[1
,3
]}

Oprea, Alina ^{[1
]}

Robertson, William ^{[1
]}

机构：

[1] Northeastern Univ, Boston, MA 02115 USA

[2] Army Cyber Inst, West Point, NY 10996 USA

[3] Amazon, Seattle, WA USA

来源：

PROCEEDINGS OF THE 26TH INTERNATIONAL SYMPOSIUM ON RESEARCH IN ATTACKS, INTRUSIONS AND DEFENSES, RAID 2023 | 2023年

基金：

美国国家科学基金会;

关键词：

binary analysis; disassembly; deep neural network; function boundary detection; CODE;

D O I：

10.1145/3607199.3607200

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Binary analyses based on deep neural networks (DNNs), or neural binary analyses (NBAs), have become a hotly researched topic in recent years. DNNs have been wildly successful at pushing the performance and accuracy envelopes in the natural language and image processing domains. Thus, DNNs are highly promising for solving binary analysis problems that are hard due to a lack of complete information resulting from the lossy compilation process. Despite this promise, it is unclear that the prevailing strategy of repurposing embeddings and model architectures originally developed for other problem domains is sound given the adversarial contexts under which binary analysis often operates. In this paper, we empirically demonstrate that the current state of the art in neural function boundary detection is vulnerable to both inadvertent and deliberate adversarial attacks. We proceed from the insight that current generation NBAs are built upon embeddings and model architectures intended to solve syntactic problems. We devise a simple, reproducible, and scalable black-box methodology for exploring the space of inadvertent attacks - instruction sequences that could be emitted by common compiler toolchains and configurations - that exploits this syntactic design focus. We then show that these inadvertent misclassifications can be exploited by an attacker, serving as the basis for a highly effective black-box adversarial example generation process. We evaluate this methodology against two state-of-the-art neural function boundary detectors: XDA and DeepDi. We conclude with an analysis of the evaluation data and recommendations for how future research might avoid succumbing to similar attacks.

引用

页码：1 / 16

页数：16

共 78 条

[1] Control-Flow Integrity Principles, Implementations, and Applications
Abadi, Martin
Budiu, Mihai
Erlingsson, Ulfar
Ligatti, Jay
[J]. ACM TRANSACTIONS ON INFORMATION AND SYSTEM SECURITY, 2009, 13 (01)
[2] Nibbler: Debloating Binary Shared Libraries
Agadakos, Ioannis
Jin, Di
Williams-King, David
Kemerlis, Vasileios P.
Portokalidis, Georgios
[J]. 35TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE (ACSA), 2019, : 70 - 83
[3] Adversarial Deep Learning for Robust Detection of Binary Encoded Malware
Al-Dujaili, Abdullah
Huang, Alex
Hemberg, Erik
O'reilly, Una-May
[J]. 2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2018), 2018, : 76 - 82
[4] Compiler-Agnostic Function Detection in Binaries
Andriesse, Dennis
Slowinska, Asia
Bos, Herbert
[J]. 2017 IEEE EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY (EUROS&P), 2017, : 177 - 189
[5] Andriesse D, 2016, PROCEEDINGS OF THE 25TH USENIX SECURITY SYMPOSIUM, P583
[6] Athalye A, 2018, PR MACH LEARN RES, V80
[7] Axelsson Stefan, 2000, The Base-Rate Fallacy and the Difficulty of Intrusion Detection, V3, P20
[8] Brown TB, 2020, Arxiv, DOI arXiv:2005.14165
[9] Bao T, 2014, PROCEEDINGS OF THE 23RD USENIX SECURITY SYMPOSIUM, P845
[10] Superset Disassembly: Statically Rewriting x86 Binaries Without Heuristics
Bauman, Erick
Lin, Zhiqiang
Hamlen, Kevin W.
[J]. 25TH ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2018), 2018,

← 1 2 3 4 5 6 7 8 →