Creating a Data Collection for Evaluating Rich Speech Retrieval

被引:0
|
作者
Eskevich, Maria [1 ]
Jones, Gareth J. F. [1 ]
Larson, Martha [2 ]
Ordelman, Roeland [3 ]
机构
[1] Dublin City Univ, Ctr Digital Video Proc, Sch Comp, Dublin 9, Ireland
[2] Delft Univ Technol, Delft, Netherlands
[3] Univ Twente, Enschede, Netherlands
来源
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2012年
基金
爱尔兰科学基金会; 欧盟第七框架计划;
关键词
Speech Search; Speech Collection Creation; Speech Retrieval; Crowdsourcing;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
We describe the development of a test collection for the investigation of speech retrieval beyond identification of relevant content. This collection focuses on satisfying user information needs for queries associated with specific types of speech acts. The collection is based on an archive of the Internet video from Internet video sharing platform (blip.tv), and was provided by the MediaEval benchmarking initiative. A crowdsourcing approach was used to identify segments in the video data which contain speech acts, to create a description of the video containing the act and to generate search queries designed to refind this speech act. We describe and reflect on our experiences with crowdsourcing this test collection using the Amazon Mechanical Turk platform. We highlight the challenges of constructing this dataset, including the selection of the data source, design of the crowdsouring task and the specification of queries and relevant items.
引用
收藏
页码:1736 / 1743
页数:8
相关论文
共 50 条
  • [31] A matching algorithm between arbitrary sections of two speech data sets for speech retrieval
    Itoh, Y
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 593 - 596
  • [32] Perceptual bias in speech error data collection:: Insights from Spanish speech errors
    Perez, Elvira
    Santiago, Julio
    Palma, Alfonso
    O'Seaghdha, Padraig G.
    JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 2007, 36 (03) : 207 - 235
  • [33] Perceptual Bias in Speech Error Data Collection: Insights from Spanish Speech Errors
    Elvira Pérez
    Julio Santiago
    Alfonso Palma
    Padraig G. O’Seaghdha
    Journal of Psycholinguistic Research, 2007, 36 : 207 - 235
  • [34] Experiences in data collection for the training of an automatic speech recognizer in Sepedi
    Manamela, MJD
    Botha, EC
    2002 IEEE AFRICON, VOLS 1 AND 2: ELECTROTECHNOLOGICAL SERVICES FOR AFRICA, 2002, : 377 - 381
  • [35] Data collection of Japanese dialects and its influence into speech recognition
    Kudo, I
    Nakama, T
    Watanabe, T
    Kameyama, R
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2021 - 2024
  • [36] A Speech-Based Data Collection Interface for Contact Tracing
    Babaian, Tamara
    HCI INTERNATIONAL 2021 - LATE BREAKING POSTERS, HCII 2021, PT II, 2021, 1499 : 134 - 138
  • [37] THE ROLE OF AUTOMATED SPEECH RECOGNITION IN ENDOSCOPIC DATA-COLLECTION
    JOHANNES, RS
    CARRLOCKE, DL
    ENDOSCOPY, 1992, 24 : 493 - 498
  • [38] Building a Domain-Specific Document Collection for Evaluating Metadata Effects on Information Retrieval
    Magdy, Walid
    Min, Jinming
    Leveling, Johannes
    Jones, Gareth J. F.
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1368 - 1373
  • [39] CREATING ALARYNGEAL SPEECH
    SMITH, RJH
    EAR NOSE & THROAT JOURNAL, 1982, 61 (04): : 193 - 199
  • [40] Quick Rich Transcriptions of Arabic Broadcast News Speech Data
    Bendahman, Chomicha
    Glenn, Meghan
    Mostefa, Djamel
    Paulsson, Niklas
    Strassel, Stephanie
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3605 - 3608