ASSESSING THE SPEAKER RECOGNITION PERFORMANCE OF NAIVE LISTENERS USING MECHANICAL TURK

被引：0

作者：

Shen, Wade ^{[1
]}

Campbell, Joseph ^{[1
]}

Straub, Derek ^{[1
]}

Schwartz, Reva ^{[2
]}

机构：

[1] MIT, Lincoln Lab, 244 Wood St, Lexington, MA 02476 USA

[2] United States Secret Service, Washington, DC 20006 USA

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年

关键词：

speaker recognition; human perception; human assisted speaker recognition; IDENTIFICATION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper we attempt to quantify the ability of naive listeners to perform speaker recognition in the context of the NIST evaluation task. We describe our protocol: a series of listening experiments using large numbers of naive listeners (432) on Amazon's Mechanical Turk that attempts to measure the ability of the average human listener to perform speaker recognition. Our goal was to compare the performance of the average human listener to both forensic experts and state-of-the-art automatic systems. We show that naive listeners vary substantially in their performance, but that an aggregation of listener responses can achieve performance similar to that of expert forensic examiners.

引用

页码：5916 / 5919

页数：4

共 50 条

[1] Speaker verification by human listeners: Experiments comparing human and machine performance using the NIST 1998 speaker evaluation data
Schmidt-Nielsen, A
Crystal, TH
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 249 - 266
[2] Performance Comparison of Speaker and Emotion Recognition
Revathy, A.
Shanmugapriya, P.
Mohan, V.
2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,
[3] Standoff Speaker Recognition: Effects of Recording Distance Mismatch on Speaker Recognition System Performance
Fowler, Mike
McCurry, Mark
Bramsen, Jonathan
Dunsin, Kehinde
Remus, Jeremiah
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3680 - 3683
[4] Speaker Recognition Using Dl
Dhole, Avinash
Kadroli, Vijaylaxmi
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, MACHINE LEARNING AND APPLICATIONS, VOL 1, ICDSMLA 2023, 2025, 1273 : 993 - 1010
[5] An efficient speaker recognition using quantum neural network
Kaur, Rupinderdeep
Sharma, R. K.
Kumar, Parteek
MODERN PHYSICS LETTERS B, 2018, 32 (31):
[6] Towards improving the performance of speaker recognition systems
Johnson, Neethu
George, Kuruvachan K.
Kumar, Santhosh C.
Raj, Reghu P. C.
2014 FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND COMMUNICATIONS (ICCSC), 2014, : 38 - 41
[7] Boosting Speaker Recognition Performance with Compact Representations
Yaman, Sibel
Pelecanos, Jason
Omar, Mohamed K.
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 388 - 391
[8] Speaker recognition using decision fusion
BIOSIGNALS 2008: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON BIO-INSPIRED SYSTEMS AND SIGNAL PROCESSING, VOL 1, 2008, : 267 - 272
[9] Employing Phonetic Information in DNN Speaker Embeddings to Improve Speaker Recognition Performance
Rahman, Md Hafizur
Himawan, Ivan
Mclaren, Mitchell
Fookes, Clinton
Sridharan, Sridha
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3593 - 3597
[10] IMPROVING SPEAKER RECOGNITION PERFORMANCE IN THE DOMAIN ADAPTATION CHALLENGE USING DEEP NEURAL NETWORKS
Garcia-Romero, Daniel
Zhang, Xiaohui
McCree, Alan
Povey, Daniel
2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 378 - 383

← 1 2 3 4 5 →