Information retrieval test collection for searching spontaneous Czech speech

被引:0
作者
Ircing, Pavel [1 ]
Pecina, Pavel [2 ]
Oard, Douglas W. [3 ]
Wang, Jianqiang [4 ]
White, Ryen W. [5 ]
Hoidekr, Jan [1 ]
机构
[1] Univ W Bohemia, Fac Sci Appl, Dept Cybernet, Univ 8, Plzen 30614, Czech Republic
[2] Charles Univ Prague, Inst Formal & Appl Linguist, Prague 11800, Czech Republic
[3] Univ Maryland, Coll Informat Studies, College Pk, MD 20742 USA
[4] SUNY Buffalo, Dept Informat & Lib Studies, Buffalo, NY 14260 USA
[5] Microsoft Corp, One Microsoft Way, Redmond, WA 98052 USA
来源
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2007年 / 4629卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the design of the first large-scale IR test collection built for the Czech language. The creation of this collection also happens to be very challenging, as it is based on a continuous text stream from automatic transcription of spontaneous speech and thus lacks clearly defined document boundaries. All aspects of the collection building are presented, together with some general findings of initial experiments.
引用
收藏
页码:439 / +
页数:3
相关论文
共 50 条
  • [21] A system for speech driven information retrieval
    Gonzalez-Ferreras, Cesar
    Cardenoso-Payo, Valentin
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 624 - 628
  • [22] Speech Transcript Evaluation for Information Retrieval
    van der Werff, Laurens
    Kraaij, Wessel
    de Jong, Franciska
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1536 - +
  • [23] SPONTANEOUS SPEECH IN EPIC PROSE - CZECH - KOZEVNIKOVA,K
    SCHAARSC.G
    CANADIAN SLAVONIC PAPERS, 1974, 16 (03) : 511 - 513
  • [24] Speech interface and information retrieval for Medical Information system
    Hsu, CY
    Chen, B
    FASEB JOURNAL, 2001, 15 (04) : A485 - A485
  • [25] Collection profiling for collection fusion in distributed information retrieval systems
    Lu, Chengye
    Xu, Yue
    Geva, Shlomo
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2007, 4798 : 279 - 288
  • [26] Creating a Data Collection for Evaluating Rich Speech Retrieval
    Eskevich, Maria
    Jones, Gareth J. F.
    Larson, Martha
    Ordelman, Roeland
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1736 - 1743
  • [27] Mediated Web Information Retrieval for a Complex Searching Task
    Lee, Hyuk-Jin
    Muresan, Gheorghe
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (07): : 1372 - 1391
  • [28] WTR: A Test Collection for Web Table Retrieval
    Chen, Zhiyu
    Zhang, Shuo
    Davison, Brian D.
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2514 - 2520
  • [29] An Adaptive Information Retrieval System for Efficient Web Searching
    Hajeer, Safaa I.
    Ismail, Rasha M.
    Badr, Nagwa L.
    Tolba, Mohamed Fahmy
    ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS, AMLTA 2014, 2014, 488 : 472 - 482
  • [30] Converted Lattice-based Chinese Spontaneous Speech Retrieval Based on Mutual Information Confidence Measure
    Huang, Xiangsong
    Zhao, Chunhui
    Pan, Dapeng
    Liu, Baisen
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2009, : 484 - +