Information retrieval test collection for searching spontaneous Czech speech

被引:0
作者
Ircing, Pavel [1 ]
Pecina, Pavel [2 ]
Oard, Douglas W. [3 ]
Wang, Jianqiang [4 ]
White, Ryen W. [5 ]
Hoidekr, Jan [1 ]
机构
[1] Univ W Bohemia, Fac Sci Appl, Dept Cybernet, Univ 8, Plzen 30614, Czech Republic
[2] Charles Univ Prague, Inst Formal & Appl Linguist, Prague 11800, Czech Republic
[3] Univ Maryland, Coll Informat Studies, College Pk, MD 20742 USA
[4] SUNY Buffalo, Dept Informat & Lib Studies, Buffalo, NY 14260 USA
[5] Microsoft Corp, One Microsoft Way, Redmond, WA 98052 USA
来源
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2007年 / 4629卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the design of the first large-scale IR test collection built for the Czech language. The creation of this collection also happens to be very challenging, as it is based on a continuous text stream from automatic transcription of spontaneous speech and thus lacks clearly defined document boundaries. All aspects of the collection building are presented, together with some general findings of initial experiments.
引用
收藏
页码:439 / +
页数:3
相关论文
共 50 条
[41]   The collection and dissemination of social science information in the Czech Republic [J].
Mateju, P ;
Tucker, A .
INFORMATION DISSEMINATION AND ACCESS IN RUSSIA AND EASTERN EUROPE: PROBLEMS AND SOLUTIONS IN EAST AND WEST, 1998, 26 :158-163
[42]   Combining Multiple Models for Speech Information Retrieval [J].
Alzghool, Muath ;
Inkpen, Diana .
SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, :132-135
[43]   Video retrieval using speech and image information [J].
Hauptmann, AG ;
Jin, R ;
Ng, TD .
STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2003, 2003, 5021 :148-159
[44]   INFORMATION RETRIEVAL METHODS FOR AUTOMATIC SPEECH RECOGNITION [J].
Xiao, Xiaoqiang ;
Droppo, Jasha ;
Acero, Alex .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :5550-5553
[45]   VERBS OF SPEECH AND METALINGUISTIC INFORMATION - CZECH - SOLTYS,O [J].
MACUROVA, A .
CESKA LITERATURA, 1987, 35 (03) :282-285
[46]   From bibliography to test collection: Enhancing topical relevance assessment for bibliographic information retrieval system evaluation [J].
Bean, CA ;
Selden, CR ;
Rindflesch, TC ;
Aronson, AR .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1999, :1022-1022
[47]   Simulating information retrieval test collections [J].
Hawking D. ;
Billerbeck B. ;
Thomas P. ;
Craswell N. .
Synthesis Lectures on Information Concepts, Retrieval, and Services, 2020, 12 (02) :1-184
[48]   On test collections for adaptive information retrieval [J].
Voorhees, Ellen M. .
INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (06) :1879-1885
[49]   THE RETRIEVAL OF INFORMATION - A TEST FOR THE COMPATIBILITY PRINCIPLE [J].
KEKENBOSCH, C .
ANNEE PSYCHOLOGIQUE, 1983, 83 (01) :25-37
[50]   Practical Searching Over Encrypted Data By Private Information Retrieval [J].
Yoshida, Rei ;
Cui, Yang ;
Sekino, Tomohiro ;
Shigetomi, Rie ;
Otsuka, Akira ;
Imai, Hideki .
2010 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE GLOBECOM 2010, 2010,