ASSESSING THE SPEAKER RECOGNITION PERFORMANCE OF NAIVE LISTENERS USING MECHANICAL TURK

被引:0
|
作者
Shen, Wade [1 ]
Campbell, Joseph [1 ]
Straub, Derek [1 ]
Schwartz, Reva [2 ]
机构
[1] MIT, Lincoln Lab, 244 Wood St, Lexington, MA 02476 USA
[2] United States Secret Service, Washington, DC 20006 USA
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
关键词
speaker recognition; human perception; human assisted speaker recognition; IDENTIFICATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we attempt to quantify the ability of naive listeners to perform speaker recognition in the context of the NIST evaluation task. We describe our protocol: a series of listening experiments using large numbers of naive listeners (432) on Amazon's Mechanical Turk that attempts to measure the ability of the average human listener to perform speaker recognition. Our goal was to compare the performance of the average human listener to both forensic experts and state-of-the-art automatic systems. We show that naive listeners vary substantially in their performance, but that an aggregation of listener responses can achieve performance similar to that of expert forensic examiners.
引用
收藏
页码:5916 / 5919
页数:4
相关论文
共 50 条
  • [41] Using MAP Estimation of Feature Transformation for Speaker Recognition
    Zhu, Donglai
    Ma, Bin
    Li, Haizhou
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 849 - 852
  • [42] TEXT DEPENDENT SPEAKER RECOGNITION USING SHIFTED MFCC
    Mukherjee, Rishiraj
    Islam, Tanmoy
    Sankar, Ravi
    2013 PROCEEDINGS OF IEEE SOUTHEASTCON, 2013,
  • [43] TEXT DEPENDENT SPEAKER RECOGNITION USING SHIFTED MFCC
    Mukherjee, Rishiraj
    Islam, Tanmoy
    Sankar, Ravi
    2012 PROCEEDINGS OF IEEE SOUTHEASTCON, 2012,
  • [44] Multimedia document retrieval using speech and speaker recognition
    Viswanathan M.
    Beigi H.S.M.
    Dharanipragada S.
    Maali F.
    Tritschler A.
    International Journal on Document Analysis and Recognition, 2000, 2 (04) : 147 - 162
  • [45] SPEAKER RECOGNITION USING A KIND OF NOVEL PHONOTACTIC INFORMATION
    Zhang, Xiang
    Xiao, Xiang
    Wang, Haipeng
    Suo, Hongbin
    Zhao, Qingwei
    Yan, Yonghong
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 330 - 333
  • [46] AN APPLICATION OF SPEAKER RECOGNITION USING ARTIFICIAL NEURAL NETWORKS
    Caner, Murat
    Ustun, Seydi Vakkas
    PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2006, 12 (02): : 279 - 284
  • [47] Eliciting and evaluating likelihood ratios for speaker recognition by human listeners under forensically realistic channel-mismatched conditions
    Hughes, Vincent
    Llamas, Carmen
    Kettig, Thomas
    INTERSPEECH 2022, 2022, : 5238 - 5242
  • [48] Initial Analysis of the Impact of Emotional Speech on the Performance of Speaker Recognition on New Serbian Emotional Database
    Mandaric, Igor
    Vujovic, Mia
    Suzic, Sinisa
    Nosek, Tijana
    Simic, Nikola
    Delic, Vlado
    2021 29TH TELECOMMUNICATIONS FORUM (TELFOR), 2021,
  • [49] I-Vector Extraction Using Speaker Relevancy for Short Duration Speaker Recognition
    Kang, Woo Hyun
    Cho, Won Ik
    Jang, Se Young
    Lee, Hyeon Seung
    Kim, Nam Soo
    IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 79 - 87
  • [50] Evaluating the effects of continuous pitch and speech tempo modifications on perceptual speaker verification performance by familiar and unfamiliar listeners
    O'Brien, Benjamin
    Meunier, Christine
    Ghio, Alain
    SPEECH COMMUNICATION, 2024, 165