The role of multi-word units in interactive information retrieval

被引:0
作者
Vechtomova, O [1 ]
机构
[1] Univ Waterloo, Dept Management Sci, Waterloo, ON N2L 3G1, Canada
来源
ADVANCES IN INFORMATION RETRIEVAL | 2005年 / 3408卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper presents several techniques for selecting noun phrases for interactive query expansion following pseudo-relevance feedback and a new phrase search method. A combined syntactico-statistical method was used for the selection of phrases. First, noun phrases were selected using a part-of-speech tagger and a noun-phrase chunker, and secondly, different statistical measures were applied to select phrases for query expansion. Experiments were also conducted studying the effectiveness of noun phrases in document ranking. We analyse the problems of phrase weighting and suggest new ways of addressing them. A new method of phrase matching and weighting was developed, which specifically addresses the problem of weighting overlapping and non-contiguous word sequences in documents.
引用
收藏
页码:403 / 420
页数:18
相关论文
共 30 条
[1]  
ALLAN J, 2004, P 12 TEXT RETR C NIS, P24
[2]  
[Anonymous], 2004, WORKSH DESCR
[3]  
[Anonymous], 1996, P 19 ANN INT ACM SIG, DOI DOI 10.1145/243199.243202
[4]  
[Anonymous], 1996, P COLING, DOI DOI 10.3115/992628.992639
[5]  
BANERJEE S, 2003, P 4 INT C INT TEXT P
[6]   Interactive searching and interface issues in the Okapi best match probabilistic retrieval system [J].
Beaulieu, M ;
Jones, S .
INTERACTING WITH COMPUTERS, 1998, 10 (03) :237-248
[7]  
BELY N, 1970, PROCEDURES ANAL SEMA
[8]  
Brill E, 1995, COMPUT LINGUIST, V21, P543
[9]  
CLARKE CLA, 1995, CS9507 U WAT
[10]  
Dunning T., 1993, Computational Linguistics, V19, P61