Creating a Data Collection for Evaluating Rich Speech Retrieval

被引:0
|
作者
Eskevich, Maria [1 ]
Jones, Gareth J. F. [1 ]
Larson, Martha [2 ]
Ordelman, Roeland [3 ]
机构
[1] Dublin City Univ, Ctr Digital Video Proc, Sch Comp, Dublin 9, Ireland
[2] Delft Univ Technol, Delft, Netherlands
[3] Univ Twente, Enschede, Netherlands
来源
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2012年
基金
爱尔兰科学基金会; 欧盟第七框架计划;
关键词
Speech Search; Speech Collection Creation; Speech Retrieval; Crowdsourcing;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
We describe the development of a test collection for the investigation of speech retrieval beyond identification of relevant content. This collection focuses on satisfying user information needs for queries associated with specific types of speech acts. The collection is based on an archive of the Internet video from Internet video sharing platform (blip.tv), and was provided by the MediaEval benchmarking initiative. A crowdsourcing approach was used to identify segments in the video data which contain speech acts, to create a description of the video containing the act and to generate search queries designed to refind this speech act. We describe and reflect on our experiences with crowdsourcing this test collection using the Amazon Mechanical Turk platform. We highlight the challenges of constructing this dataset, including the selection of the data source, design of the crowdsouring task and the specification of queries and relevant items.
引用
收藏
页码:1736 / 1743
页数:8
相关论文
共 50 条
  • [41] Digitising data collection to improve haemophilia care: Creating a bespoke, interdisciplinary data management system
    Davis, Richard
    Emmines, Rick
    Dempsey, Stephen
    Morjaria, Pankaj
    McLaughlin, Paul
    Aradom, Elsa
    Chowdary, Pratima
    HAEMOPHILIA, 2022, 28 : 36 - 37
  • [42] LABORATORY DATA-COLLECTION, ANALYSIS AND RETRIEVAL IN A SYSTEM WITH DISTRIBUTED INTELLIGENCE
    JEZL, BA
    HANAFEY, MK
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1985, 190 (SEP): : 3 - COP
  • [43] Evaluating Feasibility of Personal Diabetes Device Data Collection for Research
    Faulds, Eileen R.
    Militello, Lisa K.
    Tubbs-Cooley, Heather
    Happ, Mary Beth
    NURSING RESEARCH, 2020, 69 (06) : 476 - 482
  • [44] A COMPUTERIZED REPOSITORY SYSTEM FOR COLLECTION AND RETRIEVAL OF ACCELERATION STRESS RESEARCH DATA
    WHINNERY, JE
    SLAUGHTER, JW
    METHODS OF INFORMATION IN MEDICINE, 1981, 20 (01) : 10 - 15
  • [45] Evaluating Efficient Data Collection Algorithms for Environmental Sensor Networks
    Evans, William C.
    Bahr, Alexander
    Martinoli, Alcherio
    DISTRIBUTED AUTONOMOUS ROBOTIC SYSTEMS, 2013, 83 : 77 - 89
  • [46] Evaluating passive physiological data collection during Spravato treatment
    Solomon, Todd M.
    Hajduk, Matus
    Majernik, Martin
    Jemison, Jamileh
    Deschamps, Alexander
    Scoggins, Jenna
    Kolar, Adam
    Pinheiro, Miguel Amavel
    Dubec, Peter
    Skala, Ondrej
    Muir, Owen
    Tinkelman, Amanda
    Karlin, Daniel R.
    Barrow, Robert
    FRONTIERS IN DIGITAL HEALTH, 2023, 5
  • [47] A data mining approach for creating a job position in the system for evaluating competencies
    Pektor, Ondrej
    Walek, Bogdan
    Martinik, Ivo
    2018 11TH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING, 2019, 1195
  • [48] CREATING SPEECH COPORA FOR SPEECH SCIENCE AND TECHNOLOGY
    ITAHASHI, S
    IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1906 - 1910
  • [49] Evaluating flight coordination approaches of UAV squads for WSN data collection enhancing the internet range on WSN data collection
    Olivieri de Souza, Bruno Jose
    Endler, Markus
    JOURNAL OF INTERNET SERVICES AND APPLICATIONS, 2020, 11 (01)
  • [50] Speech data retrieval system constructed on a universal phonetic code domain
    Tanaka, K
    Itoh, Y
    Kojima, H
    Fujimura, N
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 323 - 326