Secure Audio Classification for Voice Assistants: A Multi-Party Homomorphic Approach

被引：0

作者：

Heya, Tasnia Ashrafi ^{[1
]}

Serwadda, Abdul ^{[1
]}

机构：

[1] Texas Tech Univ, Lubbock, TX 79409 USA

来源：

2024 16TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION, HSI 2024 | 2024年

关键词：

Homomorphic neural network; Smart speakers; Voice Assistants; Audio classification; Multy-party computation;

D O I：

10.1109/HSI61632.2024.10613529

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The widespread adoption of smart speaker technologies like Amazon Echo and Google Home has significantly embedded them into our everyday lives. These devices offer advanced features, including emergency services and smart home capabilities, through environmental sound source detection. However, they also face cybersecurity risks such as backdoor and adversarial attacks, which exploit server-side vulnerabilities. In our experiment, we initially conducted a user survey (n=97) to get an overview of user perspectives on security concerns related to voice assistants. To counteract these threats, our study investigates a secure multi-party homomorphic neural network for audio classification, aiming to safeguard against such threats by processing encrypted audio data. We collected data from 20 homes, extracted time-series features, and encrypted these for input into the Homomorphic Neural Network (HNN), leading to encrypted predictions. The model demonstrated a high accuracy of 93.18% in identifying various audio objects (11 types in this study). This research not only assesses accuracy but also contrasts the multi-party homomorphic method with conventional neural networks across various performance indicators, highlighting the efficiencies, and potential challenges of using encrypted models.

引用

页数：8