Information retrieval test collection for searching spontaneous Czech speech

被引:0
|
作者
Ircing, Pavel [1 ]
Pecina, Pavel [2 ]
Oard, Douglas W. [3 ]
Wang, Jianqiang [4 ]
White, Ryen W. [5 ]
Hoidekr, Jan [1 ]
机构
[1] Univ W Bohemia, Fac Sci Appl, Dept Cybernet, Univ 8, Plzen 30614, Czech Republic
[2] Charles Univ Prague, Inst Formal & Appl Linguist, Prague 11800, Czech Republic
[3] Univ Maryland, Coll Informat Studies, College Pk, MD 20742 USA
[4] SUNY Buffalo, Dept Informat & Lib Studies, Buffalo, NY 14260 USA
[5] Microsoft Corp, One Microsoft Way, Redmond, WA 98052 USA
来源
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2007年 / 4629卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the design of the first large-scale IR test collection built for the Czech language. The creation of this collection also happens to be very challenging, as it is based on a continuous text stream from automatic transcription of spontaneous speech and thus lacks clearly defined document boundaries. All aspects of the collection building are presented, together with some general findings of initial experiments.
引用
收藏
页码:439 / +
页数:3
相关论文
共 50 条
  • [1] Czech Spontaneous Speech Collection and Annotation: The Database of Technical Lectures
    Rajnoha, Josef
    Pollak, Petr
    CROSS-MODAL ANALYSIS OF SPEECH, GESTURES, GAZE AND FACIAL EXPRESSIONS, 2009, 5641 : 377 - 385
  • [2] A Bayesian Framework for XML Information Retrieval: Searching and Learning with the INEX Collection
    Benjamin Piwowarski
    Patrick Gallinari
    Information Retrieval, 2005, 8 : 655 - 681
  • [3] A Bayesian framework for XML information retrieval: Searching and learning with the INEX collection
    Piwowarski, B
    Gallinari, P
    INFORMATION RETRIEVAL, 2005, 8 (04): : 655 - 681
  • [4] Test collection based evaluation of information retrieval systems
    Sanderson M.
    Foundations and Trends in Information Retrieval, 2010, 4 (04): : 247 - 375
  • [5] Comparison of Retrieval Approaches and Blind Relevance Feedback Methods Within the Czech Speech Information Retrieval
    Skorkovska, Lucie
    SPEECH AND COMPUTER, 2016, 9811 : 182 - 190
  • [6] Mahak: A test collection for evaluation of farsi information retrieval systems
    Esmaili, Kyumars Sheykh
    Abolhassani, Hassan
    Neshati, Mahmood
    Behrangi, Ehsan
    Rostami, Asreen
    Nasiri, Mojtaba Mohammadi
    2007 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1 AND 2, 2007, : 639 - +
  • [7] Information Retrieval Using a Macedonian Test Collection for Question Answering
    Armenska, Jasmina
    Tomovski, Aleksandar
    Zdravkova, Katerina
    Pehcevski, Jovan
    ICT INNOVATIONS 2010, 2011, 83 : 205 - +
  • [8] Web searching and information retrieval
    Pokorny, J
    COMPUTING IN SCIENCE & ENGINEERING, 2004, 6 (04) : 43 - 48
  • [9] Interaction in information searching and retrieval
    Beaulieu, M
    JOURNAL OF DOCUMENTATION, 2000, 56 (04) : 431 - 439
  • [10] ONLINE SEARCHING IN INFORMATION-RETRIEVAL
    BARRACLOUGH, ED
    JOURNAL OF DOCUMENTATION, 1977, 33 (03) : 220 - 238