The Feasibility of Injecting Inaudible Voice Commands to Voice Assistants

被引:46
作者
Yan, Chen [1 ]
Zhang, Guoming [1 ]
Ji, Xiaoyu [1 ]
Zhang, Tianchen [1 ]
Zhang, Taimin [1 ]
Xu, Wenyuan [1 ]
机构
[1] Zhejiang Univ, Dept Elect Engn, Hangzhou 310027, Zhejiang, Peoples R China
关键词
Speech recognition; Microphones; Hardware; Security; Google; Software; Micromechanical devices; Inaudible voice commands; voice assistants; speech recognition; microphones; security analysis; defense;
D O I
10.1109/TDSC.2019.2906165
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Voice assistants (VAs) such as Siri and Google Now have become an increasingly popular human-machine interaction method and have made various systems voice controllable. Prior work on attacking voice assistants shows that the hidden voice commands that are incomprehensible to people can control the VAs. Hidden voice commands, though 'hidden', are nonetheless audible. In this work, we design a completely inaudible attack, DolphinAttack, that modulates voice commands on ultrasonic carriers to achieve inaudibility. By leveraging the nonlinearity of the microphone circuits, the modulated low-frequency audio commands can be successfully demodulated, recovered, and more importantly interpreted by the voice assistants. We validate DolphinAttack on popular voice assistants, including Siri, Google Now, S Voice, HiVoice, Cortana, Alexa, etc. By injecting a sequence of inaudible voice commands, we show a few proof-of-concept attacks, which include activating Siri to initiate a FaceTime call on iPhone, activating Google Now to turn on the airplane mode, and even manipulating the navigation system in an Audi automobile. We propose hardware and software defense solutions. We validate that it is feasible to detect DolphinAttack by classifying the audios using supported vector machine (SVM), and suggest to re-design voice assistants to be resilient to inaudible voice command attacks.
引用
收藏
页码:1108 / 1124
页数:17
相关论文
共 61 条
[1]  
A. E. Society, 2013, AES5 2008 R2013 AES
[2]  
Acapela Group, 2017, AC TEXT SPEECH DEM
[3]  
Analog Devices, 2011, ADMP401 OMN MICR BOT
[4]  
[Anonymous], 2018, BUSINESS INSIDER
[5]  
[Anonymous], 2010, P 19 USENIX C SEC US
[6]  
[Anonymous], 2015, 9 USENIX WORKSH OFF
[7]  
[Anonymous], INT C COMP GRAPH SIM
[8]  
Avisoft Bioacoustics, 2017, ULTR DYN SPEAK VIF
[9]  
Aviv AJ, 2012, 28TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE (ACSAC 2012), P41