HUMAN AND MACHINE SPEAKER RECOGNITION BASED ON SHORT TRIVIAL EVENTS

被引:0
|
作者
Zhang, Miao [1 ,2 ]
Kang, Xiaofei [1 ,3 ]
Wang, Yanqing [1 ,2 ]
Li, Lantian [1 ]
Tang, Zhiyuan [1 ]
Dai, Haisheng [4 ]
Wang, Dong [1 ]
机构
[1] Tsinghua Univ, Ctr Speech & Language Technol, Beijing, Peoples R China
[2] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[3] Peking Univ, Beijing, Peoples R China
[4] JD AI Res, Beijing, Peoples R China
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年
基金
中国国家自然科学基金;
关键词
speaker recognition; speech perception; deep neural network; speaker feature learning;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Human speech often has events that we will call trivial events, e.g., cough, laugh and sniff. Compared to regular speech, these trivial events are usually short and variable, thus generally regarded as not speaker discriminative and so are largely ignored by present speaker recognition research. However, these trivial events are highly valuable in some particular circumstances such as forensic examination, as they are less subjected to intentional change, so can be used to discover the genuine speaker from disguised speech. In this paper, we collect a trivial event speech database that involves 75 speakers and 6 types of events, and report preliminary speaker recognition results on this database, by both human listeners and machines. Particularly, the deep feature learning technique recently proposed by our group is utilized to analyze and recognize the trivial events, leading to acceptable equal error rates (EERs) ranging from 5% to 15% despite the extremely short durations (0.2-0.5 seconds) of these events. Comparing different types of events, 'hmm' seems more speaker discriminative.
引用
收藏
页码:5009 / 5013
页数:5
相关论文
共 50 条
  • [1] Automatic Speaker Recognition System based on Machine Learning Algorithms
    Mokgonyane, Tumisho Billson
    Sefara, Tshephisho Joseph
    Modipa, Thipe Isaiah
    Mogale, Mercy Mosibudi
    Manamela, Madimetja Jonas
    Manamela, Phuti John
    2019 SOUTHERN AFRICAN UNIVERSITIES POWER ENGINEERING CONFERENCE/ROBOTICS AND MECHATRONICS/PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA (SAUPEC/ROBMECH/PRASA), 2019, : 141 - 146
  • [2] Automatic Speaker Recognition System based on Optimised Machine Learning Algorithms
    Mokgonyane, Tumisho Billson
    Sefara, Tshephisho Joseph
    Modipa, Thipe Isaiah
    Manamela, Madimetja Jonas
    2019 IEEE AFRICON, 2019,
  • [3] RESTRICTED BOLTZMANN MACHINE SUPERVECTORS FOR SPEAKER RECOGNITION
    Ghahabi, Omid
    Hernando, Javier
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4804 - 4808
  • [4] Speaker recognition based on short utterance compensation method of generative adversarial networks
    Hu, Zhangfang
    Fu, Yaqin
    Luo, Yuan
    Xu, Xuan
    Xia, Zhiguang
    Zhang, Hongwei
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (02) : 443 - 450
  • [5] SELF-SUPERVISED SPEAKER RECOGNITION TRAINING USING HUMAN-MACHINE DIALOGUES
    Cekic, Metehan
    Li, Ruirui
    Chen, Zeya
    Yang, Yuguang
    Stolcke, Andreas
    Madhow, Upamanyu
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6132 - 6136
  • [6] Speaker recognition based on short utterance compensation method of generative adversarial networks
    Zhangfang Hu
    Yaqin Fu
    Yuan Luo
    Xuan Xu
    Zhiguang Xia
    Hongwei Zhang
    International Journal of Speech Technology, 2020, 23 : 443 - 450
  • [7] Speaker recognition based on telephone quality short Polish sequences with removed silence
    Marciniak, Tomasz
    Krzykowska, Agnieszka
    Weychan, Radoslaw
    PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (06): : 42 - 46
  • [8] Speaker Cluster based GMM Tokenization for Speaker Recognition
    Ma, Bin
    Zhu, Donglai
    Tong, Rong
    Li, Haizhou
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 505 - 508
  • [9] SHORT-TIMED SPEECH DYNAMICS FOR SPEAKER RECOGNITION
    LI, H
    HATON, JP
    SU, J
    ELECTRONICS LETTERS, 1995, 31 (17) : 1416 - 1418
  • [10] Machine learning-based self-powered acoustic sensor for speaker recognition
    Han, Jae Hyun
    Bae, Kang Min
    Hong, Seong Kwang
    Park, Hyunsin
    Kwak, Jun-Hyuk
    Wang, Hee Seung
    Joe, Daniel Juhyung
    Park, Jung Hwan
    Jung, Young Hoon
    Hur, Shin
    Yoo, Chang D.
    Lee, Keon Jae
    NANO ENERGY, 2018, 53 : 658 - 665