Information retrieval test collection for searching spontaneous Czech speech

被引:0
|
作者
Ircing, Pavel [1 ]
Pecina, Pavel [2 ]
Oard, Douglas W. [3 ]
Wang, Jianqiang [4 ]
White, Ryen W. [5 ]
Hoidekr, Jan [1 ]
机构
[1] Univ W Bohemia, Fac Sci Appl, Dept Cybernet, Univ 8, Plzen 30614, Czech Republic
[2] Charles Univ Prague, Inst Formal & Appl Linguist, Prague 11800, Czech Republic
[3] Univ Maryland, Coll Informat Studies, College Pk, MD 20742 USA
[4] SUNY Buffalo, Dept Informat & Lib Studies, Buffalo, NY 14260 USA
[5] Microsoft Corp, One Microsoft Way, Redmond, WA 98052 USA
来源
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2007年 / 4629卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the design of the first large-scale IR test collection built for the Czech language. The creation of this collection also happens to be very challenging, as it is based on a continuous text stream from automatic transcription of spontaneous speech and thus lacks clearly defined document boundaries. All aspects of the collection building are presented, together with some general findings of initial experiments.
引用
收藏
页码:439 / +
页数:3
相关论文
共 50 条
  • [11] A SEARCHING PROCEDURE FOR INFORMATION-RETRIEVAL
    GOFFMAN, W
    INFORMATION STORAGE AND RETRIEVAL, 1964, 2 (02): : 73 - 78
  • [12] Speech information retrieval: a review
    Ryan P. Hafen
    Michael J. Henry
    Multimedia Systems, 2012, 18 : 499 - 518
  • [13] Speech information retrieval: a review
    Hafen, Ryan P.
    Henry, Michael J.
    MULTIMEDIA SYSTEMS, 2012, 18 (06) : 499 - 518
  • [14] The Seventeen Theoretical Constructs of Information Searching and Information Retrieval
    Jansen, Bernard J.
    Rieh, Soo Young
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (08): : 1517 - 1534
  • [15] Advances in information retrieval collection on the European conference on information retrieval 2023
    Kamps, Jaap
    Goeuriot, Lorraine
    Crestani, Fabio
    DISCOVER COMPUTING, 2024, 27 (01)
  • [16] A Test Collection for Coreferent Mention Retrieval
    Sankepally, Rashmi
    Chen, Tongfei
    Van Durme, Benjamin
    Oard, Douglas W.
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 1209 - 1212
  • [17] A Test Collection for Interactive Lifelog Retrieval
    Gurrin, Cathal
    Schoeffmann, Klaus
    Joho, Hideo
    Munzer, Bernd
    Albatal, Rami
    Hopfgartner, Frank
    Zhou, Liting
    Dang-Nguyen, Duc-Tien
    MULTIMEDIA MODELING (MMM 2019), PT I, 2019, 11295 : 312 - 324
  • [18] PROXIMITY SEARCHING AS AN INFORMATION-RETRIEVAL TOOL
    VASTA, BM
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1973, (AUG26): : 7 - 7
  • [19] Partial collection replication for information retrieval
    Lu, ZH
    McKinley, KS
    INFORMATION RETRIEVAL, 2003, 6 (02): : 159 - 198
  • [20] Partial Collection Replication for Information Retrieval
    Zhihong Lu
    Kathryn S. McKinley
    Information Retrieval, 2003, 6 : 159 - 198