ASSESSING THE SPEAKER RECOGNITION PERFORMANCE OF NAIVE LISTENERS USING MECHANICAL TURK

被引:0
|
作者
Shen, Wade [1 ]
Campbell, Joseph [1 ]
Straub, Derek [1 ]
Schwartz, Reva [2 ]
机构
[1] MIT, Lincoln Lab, 244 Wood St, Lexington, MA 02476 USA
[2] United States Secret Service, Washington, DC 20006 USA
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
关键词
speaker recognition; human perception; human assisted speaker recognition; IDENTIFICATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we attempt to quantify the ability of naive listeners to perform speaker recognition in the context of the NIST evaluation task. We describe our protocol: a series of listening experiments using large numbers of naive listeners (432) on Amazon's Mechanical Turk that attempts to measure the ability of the average human listener to perform speaker recognition. Our goal was to compare the performance of the average human listener to both forensic experts and state-of-the-art automatic systems. We show that naive listeners vary substantially in their performance, but that an aggregation of listener responses can achieve performance similar to that of expert forensic examiners.
引用
收藏
页码:5916 / 5919
页数:4
相关论文
共 50 条
  • [1] Speaker verification by human listeners: Experiments comparing human and machine performance using the NIST 1998 speaker evaluation data
    Schmidt-Nielsen, A
    Crystal, TH
    DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 249 - 266
  • [2] Performance Comparison of Speaker and Emotion Recognition
    Revathy, A.
    Shanmugapriya, P.
    Mohan, V.
    2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,
  • [3] Standoff Speaker Recognition: Effects of Recording Distance Mismatch on Speaker Recognition System Performance
    Fowler, Mike
    McCurry, Mark
    Bramsen, Jonathan
    Dunsin, Kehinde
    Remus, Jeremiah
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3680 - 3683
  • [4] Speaker Recognition Using Dl
    Dhole, Avinash
    Kadroli, Vijaylaxmi
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, MACHINE LEARNING AND APPLICATIONS, VOL 1, ICDSMLA 2023, 2025, 1273 : 993 - 1010
  • [5] An efficient speaker recognition using quantum neural network
    Kaur, Rupinderdeep
    Sharma, R. K.
    Kumar, Parteek
    MODERN PHYSICS LETTERS B, 2018, 32 (31):
  • [6] Towards improving the performance of speaker recognition systems
    Johnson, Neethu
    George, Kuruvachan K.
    Kumar, Santhosh C.
    Raj, Reghu P. C.
    2014 FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND COMMUNICATIONS (ICCSC), 2014, : 38 - 41
  • [7] Boosting Speaker Recognition Performance with Compact Representations
    Yaman, Sibel
    Pelecanos, Jason
    Omar, Mohamed K.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 388 - 391
  • [8] Speaker recognition using decision fusion
    BIOSIGNALS 2008: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON BIO-INSPIRED SYSTEMS AND SIGNAL PROCESSING, VOL 1, 2008, : 267 - 272
  • [9] Employing Phonetic Information in DNN Speaker Embeddings to Improve Speaker Recognition Performance
    Rahman, Md Hafizur
    Himawan, Ivan
    Mclaren, Mitchell
    Fookes, Clinton
    Sridharan, Sridha
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3593 - 3597
  • [10] IMPROVING SPEAKER RECOGNITION PERFORMANCE IN THE DOMAIN ADAPTATION CHALLENGE USING DEEP NEURAL NETWORKS
    Garcia-Romero, Daniel
    Zhang, Xiaohui
    McCree, Alan
    Povey, Daniel
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 378 - 383