ASSESSING THE SPEAKER RECOGNITION PERFORMANCE OF NAIVE LISTENERS USING MECHANICAL TURK

被引：0

作者：

Shen, Wade ^{[1
]}

Campbell, Joseph ^{[1
]}

Straub, Derek ^{[1
]}

Schwartz, Reva ^{[2
]}

机构：

[1] MIT, Lincoln Lab, 244 Wood St, Lexington, MA 02476 USA

[2] United States Secret Service, Washington, DC 20006 USA

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年

关键词：

speaker recognition; human perception; human assisted speaker recognition; IDENTIFICATION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper we attempt to quantify the ability of naive listeners to perform speaker recognition in the context of the NIST evaluation task. We describe our protocol: a series of listening experiments using large numbers of naive listeners (432) on Amazon's Mechanical Turk that attempts to measure the ability of the average human listener to perform speaker recognition. Our goal was to compare the performance of the average human listener to both forensic experts and state-of-the-art automatic systems. We show that naive listeners vary substantially in their performance, but that an aggregation of listener responses can achieve performance similar to that of expert forensic examiners.

引用

页码：5916 / 5919

页数：4

共 50 条

[41] Using MAP Estimation of Feature Transformation for Speaker Recognition
Zhu, Donglai
Ma, Bin
Li, Haizhou
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 849 - 852
[42] TEXT DEPENDENT SPEAKER RECOGNITION USING SHIFTED MFCC
Mukherjee, Rishiraj
Islam, Tanmoy
Sankar, Ravi
2013 PROCEEDINGS OF IEEE SOUTHEASTCON, 2013,
[43] TEXT DEPENDENT SPEAKER RECOGNITION USING SHIFTED MFCC
Mukherjee, Rishiraj
Islam, Tanmoy
Sankar, Ravi
2012 PROCEEDINGS OF IEEE SOUTHEASTCON, 2012,
[44] Multimedia document retrieval using speech and speaker recognition
Viswanathan M.
Beigi H.S.M.
Dharanipragada S.
Maali F.
Tritschler A.
International Journal on Document Analysis and Recognition, 2000, 2 (04) : 147 - 162
[45] SPEAKER RECOGNITION USING A KIND OF NOVEL PHONOTACTIC INFORMATION
Zhang, Xiang
Xiao, Xiang
Wang, Haipeng
Suo, Hongbin
Zhao, Qingwei
Yan, Yonghong
2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 330 - 333
[46] AN APPLICATION OF SPEAKER RECOGNITION USING ARTIFICIAL NEURAL NETWORKS
Caner, Murat
Ustun, Seydi Vakkas
PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2006, 12 (02): : 279 - 284
[47] Eliciting and evaluating likelihood ratios for speaker recognition by human listeners under forensically realistic channel-mismatched conditions
Hughes, Vincent
Llamas, Carmen
Kettig, Thomas
INTERSPEECH 2022, 2022, : 5238 - 5242
[48] Initial Analysis of the Impact of Emotional Speech on the Performance of Speaker Recognition on New Serbian Emotional Database
Mandaric, Igor
Vujovic, Mia
Suzic, Sinisa
Nosek, Tijana
Simic, Nikola
Delic, Vlado
2021 29TH TELECOMMUNICATIONS FORUM (TELFOR), 2021,
[49] I-Vector Extraction Using Speaker Relevancy for Short Duration Speaker Recognition
Kang, Woo Hyun
Cho, Won Ik
Jang, Se Young
Lee, Hyeon Seung
Kim, Nam Soo
IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 79 - 87
[50] Evaluating the effects of continuous pitch and speech tempo modifications on perceptual speaker verification performance by familiar and unfamiliar listeners
O'Brien, Benjamin
Meunier, Christine
Ghio, Alain
SPEECH COMMUNICATION, 2024, 165

← 1 2 3 4 5 →