Reconstructing Test Labels From Noisy Loss Functions

被引：0

作者：

Aggarwal, Abhinav ^{[1
]}

Kasiviswanathan, Shiva Prasad ^{[1
]}

Xu, Zekun ^{[1
]}

Feyisetan, Oluwaseyi ^{[1
]}

Teissier, Nathanael ^{[1
]}

机构：

[1] Amazon, Seattle, WA 98109 USA

来源：

INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151 | 2022年 / 151卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Machine learning classifiers rely on loss functions for performance evaluation, often on a private (hidden) dataset. In a recent line of research, label inference was introduced as the problem of reconstructing the ground truth labels of this private dataset from just the (possibly perturbed) cross-entropy loss function values evaluated at chosen prediction vectors (without any other access to the hidden dataset). In this paper, we formally study the necessary and sufficient conditions under which label inference is possible from any (noisy) loss function value. Using tools from analytical number theory, we show that a broad class of commonly used loss functions, including general Bregman divergence-based losses and multiclass cross-entropy with common activation functions like sigmoid and soft-max, it is possible to design label inference attacks that succeed even for arbitrary noise levels and using only a single query from the adversary. We formally study the computational complexity of label inference and show that while in general, designing adversarial prediction vectors for these attacks is co-NP-hard, once we have these vectors, the attacks can also be carried out through a lightweight augmentation to any neural network model, making them look benign and hard to detect. The observations in this paper provide a deeper understanding of the vulnerabilities inherent in modern machine learning and could be used for designing future trustworthy ML.

引用

页数：22

共 14 条

[1]

Aggarwal A, 2021, PR MACH LEARN RES, V139

[2]

Arora S, 2009, COMPUTATIONAL COMPLEXITY: A MODERN APPROACH, P1, DOI 10.1017/CBO9780511804090

[3] ON THE UNIQUE SATISFIABILITY PROBLEM [J].

BLASS, A ;

GUREVICH, Y .

INFORMATION AND CONTROL, 1982, 55 (1-3) :80-88

[4] DETECTING SQUAREFREE NUMBERS [J].

Booker, Andrew R. ;

Hiary, Ghaith A. ;

Keating, Jon P. .

DUKE MATHEMATICAL JOURNAL, 2015, 164 (02) :235-275

[5]

Bregman L., 1967, USSR Computational Mathematics and Mathematical Physics, V7, P200, DOI [10.1016/0041-5553(67)90040-7, DOI 10.1016/0041-5553(67)90040-7]

[6]

Brent RichardP., 2010, Modern Computer Arithmetic, V18

[7]

Dinur I., 2003, PODS, DOI DOI 10.1145/773153.773173

[8] Calibrating noise to sensitivity in private data analysis [J].

Dwork, Cynthia ;

McSherry, Frank ;

Nissim, Kobbi ;

Smith, Adam .

THEORY OF CRYPTOGRAPHY, PROCEEDINGS, 2006, 3876 :265-284

[9]

Knuth DonaldE., 2014, ART COMPUTER PROGRAM, V2

[10]

Liu Chaoyue., 2016, P ADV NEUR INF PROC, P2351

← 1 2 →