Creating a Data Collection for Evaluating Rich Speech Retrieval

被引：0

作者：

Eskevich, Maria ^{[1
]}

Jones, Gareth J. F. ^{[1
]}

Larson, Martha ^{[2
]}

Ordelman, Roeland ^{[3
]}

机构：

[1] Dublin City Univ, Ctr Digital Video Proc, Sch Comp, Dublin 9, Ireland

[2] Delft Univ Technol, Delft, Netherlands

[3] Univ Twente, Enschede, Netherlands

来源：

LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2012年

基金：

爱尔兰科学基金会; 欧盟第七框架计划;

关键词：

Speech Search; Speech Collection Creation; Speech Retrieval; Crowdsourcing;

D O I：

暂无

中图分类号：

H0 [语言学];

学科分类号：

030303 ; 0501 ; 050102 ;

摘要：

We describe the development of a test collection for the investigation of speech retrieval beyond identification of relevant content. This collection focuses on satisfying user information needs for queries associated with specific types of speech acts. The collection is based on an archive of the Internet video from Internet video sharing platform (blip.tv), and was provided by the MediaEval benchmarking initiative. A crowdsourcing approach was used to identify segments in the video data which contain speech acts, to create a description of the video containing the act and to generate search queries designed to refind this speech act. We describe and reflect on our experiences with crowdsourcing this test collection using the Amazon Mechanical Turk platform. We highlight the challenges of constructing this dataset, including the selection of the data source, design of the crowdsouring task and the specification of queries and relevant items.

引用

页码：1736 / 1743

页数：8

共 50 条

[31] A matching algorithm between arbitrary sections of two speech data sets for speech retrieval
Itoh, Y
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 593 - 596
[32] Perceptual bias in speech error data collection:: Insights from Spanish speech errors
Perez, Elvira
Santiago, Julio
Palma, Alfonso
O'Seaghdha, Padraig G.
JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 2007, 36 (03) : 207 - 235
[33] Perceptual Bias in Speech Error Data Collection: Insights from Spanish Speech Errors
Elvira Pérez
Julio Santiago
Alfonso Palma
Padraig G. O’Seaghdha
Journal of Psycholinguistic Research, 2007, 36 : 207 - 235
[34] Experiences in data collection for the training of an automatic speech recognizer in Sepedi
Manamela, MJD
Botha, EC
2002 IEEE AFRICON, VOLS 1 AND 2: ELECTROTECHNOLOGICAL SERVICES FOR AFRICA, 2002, : 377 - 381
[35] Data collection of Japanese dialects and its influence into speech recognition
Kudo, I
Nakama, T
Watanabe, T
Kameyama, R
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2021 - 2024
[36] A Speech-Based Data Collection Interface for Contact Tracing
Babaian, Tamara
HCI INTERNATIONAL 2021 - LATE BREAKING POSTERS, HCII 2021, PT II, 2021, 1499 : 134 - 138
[37] THE ROLE OF AUTOMATED SPEECH RECOGNITION IN ENDOSCOPIC DATA-COLLECTION
JOHANNES, RS
CARRLOCKE, DL
ENDOSCOPY, 1992, 24 : 493 - 498
[38] Building a Domain-Specific Document Collection for Evaluating Metadata Effects on Information Retrieval
Magdy, Walid
Min, Jinming
Leveling, Johannes
Jones, Gareth J. F.
LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1368 - 1373
[39] CREATING ALARYNGEAL SPEECH
SMITH, RJH
EAR NOSE & THROAT JOURNAL, 1982, 61 (04): : 193 - 199
[40] Quick Rich Transcriptions of Arabic Broadcast News Speech Data
Bendahman, Chomicha
Glenn, Meghan
Mostefa, Djamel
Paulsson, Niklas
Strassel, Stephanie
SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3605 - 3608

← 1 2 3 4 5 →