Information retrieval test collection for searching spontaneous Czech speech

被引:0
作者
Ircing, Pavel [1 ]
Pecina, Pavel [2 ]
Oard, Douglas W. [3 ]
Wang, Jianqiang [4 ]
White, Ryen W. [5 ]
Hoidekr, Jan [1 ]
机构
[1] Univ W Bohemia, Fac Sci Appl, Dept Cybernet, Univ 8, Plzen 30614, Czech Republic
[2] Charles Univ Prague, Inst Formal & Appl Linguist, Prague 11800, Czech Republic
[3] Univ Maryland, Coll Informat Studies, College Pk, MD 20742 USA
[4] SUNY Buffalo, Dept Informat & Lib Studies, Buffalo, NY 14260 USA
[5] Microsoft Corp, One Microsoft Way, Redmond, WA 98052 USA
来源
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2007年 / 4629卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the design of the first large-scale IR test collection built for the Czech language. The creation of this collection also happens to be very challenging, as it is based on a continuous text stream from automatic transcription of spontaneous speech and thus lacks clearly defined document boundaries. All aspects of the collection building are presented, together with some general findings of initial experiments.
引用
收藏
页码:439 / +
页数:3
相关论文
共 50 条
[31]   Mediated Web Information Retrieval for a Complex Searching Task [J].
Lee, Hyuk-Jin ;
Muresan, Gheorghe .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (07) :1372-1391
[32]   DATABASE SEARCHING - INFORMATION-RETRIEVAL FOR NUTRITION PROFESSIONALS [J].
UPDEGROVE, NA .
JOURNAL OF NUTRITION EDUCATION, 1990, 22 (05) :241-247
[33]   Retrieval of Research-level Mathematical Information Needs: A Test Collection and Technical Terminology Experiment [J].
Stathopoulos, Yiannos A. ;
Tuefel, Simone .
PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, :334-340
[34]   WTR: A Test Collection for Web Table Retrieval [J].
Chen, Zhiyu ;
Zhang, Shuo ;
Davison, Brian D. .
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, :2514-2520
[35]   Converted Lattice-based Chinese Spontaneous Speech Retrieval Based on Mutual Information Confidence Measure [J].
Huang, Xiangsong ;
Zhao, Chunhui ;
Pan, Dapeng ;
Liu, Baisen .
PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2009, :484-+
[36]   Nurturing Filled Pause Detection for Spontaneous Speech Retrieval [J].
Hamzah, Raseeda ;
Jamil, Nursuriati ;
Seman, Noraini .
INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014, 2014, 8870 :458-469
[37]   Building a text collection for Urdu information retrieval [J].
Rasheed, Imran ;
Banka, Haider ;
Khan, Hamaid M. .
ETRI JOURNAL, 2021, 43 (05) :856-868
[38]   THE COLLECTION AND PRELIMINARY-ANALYSIS OF A SPONTANEOUS SPEECH DATABASE [J].
ZUE, V ;
DALY, N ;
GLASS, J ;
GOODINE, D ;
LEUNG, H ;
PHILLIPS, M ;
POLIFRONI, J ;
SENEFF, S ;
SOCLOF, M .
SPEECH AND NATURAL LANGUAGE, 1989, :126-134
[39]   Towards automatic transcription of spontaneous Czech speech in the MALACH project [J].
Psutka, J ;
Ircing, P ;
Psutka, JV ;
Radová, V ;
Byrne, W ;
Venkataramani, V ;
Hajic, J ;
Gustman, S .
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 :214-219
[40]   Prominence and Melody Mistakes in the Spontaneous Speech of Czech Learners of English [J].
Tymbay, Alexey .
SKASE JOURNAL OF THEORETICAL LINGUISTICS, 2022, 19 (02) :2-17