Privacy-Preserving Liveness Detection for Securing Smart Voice Interfaces

被引:0
作者
Meng, Yan [1 ]
Li, Jiachun [1 ]
Zhu, Haojin [1 ]
Tian, Yuan [2 ,3 ]
Chen, Jiming [4 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[2] Univ Calif Los Angeles, Elect & Comp Engn Dept, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Inst Technol Law & Policy ITLP, Los Angeles, CA 90095 USA
[4] Zhejiang Univ, State Key Lab Ind Control Technol, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Microphone arrays; Loudspeakers; Training; Privacy; Fingerprint recognition; Arrays; Mouth; Liveness detection; voice interface; smart speakers; multi-channel audio;
D O I
10.1109/TDSC.2023.3319833
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Smart speakers are widely used as the primary user interface in intelligent systems, including smart homes and industrial IoT. However, they are vulnerable to voice spoofing attacks which result in malicious command execution or privacy information leakage. Passive liveness detection, which thwarts voice spoofing via analyzing the collected audio rather than deploying sensors to distinguish between live-human and spoofing voices, has drawn increasing attention. But existing schemes either face performance degradation under environmental factor changes or require the user to keep fixed gestures, which limit their deployment in real-world scenarios. Besides, the space distributed property of smart speakers causes building a universal classifier for all involved users to be cumbersome and increases privacy leakage issues. To address the challenges mentioned above, we propose LiveArray, an efficient, lightweight, and privacy-preserving passive liveness detection system. LiveArray exploits a novel liveness feature, array fingerprint, which utilizes the microphone array inherently adopted by the smart speaker to improve the accuracy of liveness detection. LiveArray's further employs the federated learning-based architecture to reduce the dataset collection overhead during classifier building and eliminate the potential privacy leakage during data transmission. Experimental results show that LiveArray achieves an accuracy of 99.16%, which is superior to existing passive schemes.
引用
收藏
页码:2900 / 2916
页数:17
相关论文
共 54 条
  • [1] Abdullah H, 2021, P IEEE S SECUR PRIV, P712, DOI 10.1109/SP40001.2021.00009
  • [2] Abdullah H, 2021, P IEEE S SECUR PRIV, P730, DOI 10.1109/SP40001.2021.00014
  • [3] Ahmed ME, 2020, PROCEEDINGS OF THE 29TH USENIX SECURITY SYMPOSIUM, P2685
  • [4] Monkey Says, Monkey Does: Security and Privacy on Voice Assistants
    Alepis, Efthimios
    Patsakis, Constantinos
    [J]. IEEE ACCESS, 2017, 5 : 17841 - 17851
  • [5] 2MA: Verifying Voice Commands via Two Microphone Authentication
    Blue, Logan
    Abdullah, Hadi
    Vargas, Luis
    Traynor, Patrick
    [J]. PROCEEDINGS OF THE 2018 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (ASIACCS'18), 2018, : 89 - 100
  • [6] Hello, Is It Me You're Looking For? Differentiating Between Human and Electronic Speakers for Voice Interface Security
    Blue, Logan
    Vargas, Luis
    Traynor, Patrick
    [J]. WISEC'18: PROCEEDINGS OF THE 11TH ACM CONFERENCE ON SECURITY & PRIVACY IN WIRELESS AND MOBILE NETWORKS, 2018, : 123 - 133
  • [7] Carlini N, 2016, PROCEEDINGS OF THE 25TH USENIX SECURITY SYMPOSIUM, P513
  • [8] Chen G., 2019, P IEEE S SEC PRIV, DOI DOI 10.1109/SP40001
  • [9] You Can Hear But You Cannot Steal: Defending against Voice Impersonation Attacks on Smartphones
    Chen, Si
    Ren, Kui
    Piao, Sixu
    Wang, Cong
    Wang, Qian
    Weng, Jian
    Su, Lu
    Mohaisen, Aziz
    [J]. 2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 183 - 195
  • [10] CMUSphinx, 2022, CMUSphinx open source speech recognition