Fighting AI with AI: Fake Speech Detection using Deep Learning

被引:0
作者
Malik, Hafiz [1 ]
Changalvala, Raghavendar [1 ]
机构
[1] Univ Michigan, Dept Elect & Comp Engn, Informat Syst Secur & Forens Lab, Dearborn, MI 48128 USA
来源
2019 AES INTERNATIONAL CONFERENCE ON AUDIO FORENSICS | 2019年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Voice cloning technologies have found applications in a variety of areas ranging from personalized speech interfaces to advertisement, video gaming, and so on. Existing voice cloning systems are capable of learning speaker characteristics from few samples and generating perceptually indistinguishable speech. These advances pose new security and privacy threats to voice-driven interfaces. This paper presents a deep learning-based framework for learning cloned speech synthesis models and the bona-fide speech production processes. To this end, a convolutional neural network is trained and tested on spectrogram estimated from input audio recordings. Performance of the proposed method is evaluated on cloned and bona-fide audios. Experimental results indicate that the proposed method is capable of detecting bona-fide and cloned audios with a close to perfect accuracy.
引用
收藏
页数:9
相关论文
共 26 条
  • [1] [Anonymous], 2017, CORR
  • [2] [Anonymous], 2017, CORR
  • [3] [Anonymous], 2016, CORR
  • [4] [Anonymous], 2017, CORR
  • [5] [Anonymous], 2019, MICROSOFT CORTANA
  • [6] [Anonymous], 2019, ANAZOM ALEXA
  • [7] [Anonymous], 2019, GOOGLE HOME
  • [8] [Anonymous], 2018, CHASE INTRODUCES VOI
  • [9] [Anonymous], 1993, LINGUISTIC DATA CONS, DOI DOI 10.35111/17GK-BN40
  • [10] Apple, 2020, APPL DES