A method for supporting document selection in cross-language information retrieval and its evaluation

被引:0
作者
Suzuki, M [1 ]
Inoue, N [1 ]
Hashimoto, K [1 ]
机构
[1] KDD Res & Dev Labs Inc, Kamifukuoka, Saitama 3568502, Japan
来源
COMPUTERS AND THE HUMANITIES | 2001年 / 35卷 / 04期
关键词
browsing support; cross-language information retrieval; partial translation; term list;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
It is important to give useful clues for selecting desired content from a number of retrieval results obtained (usually) from a vague search request. Compared with monolingual retrieval, such a support framework is inevitable and much more significant for filtering given translingual retrieval results. This paper describes an attempt to provide appropriate translation of major keywords in each document in a cross-language information retrieval (CLIR) result, as a browsing support for users. Our idea of determining appropriate translation of major keywords is based on word co-occurrence distribution in the translation target language, considering the actual situation of WWW content where it is difficult to obtain aligned parallel (multilingual) corpora. The proposed method provides higher quality of keyword translation to yield a more effective support in identifying the target documents in the retrieval result. We report the advantage of this browsing support technique through evaluation experiments including comparison with conditions of referring to a translated document summary, and discuss related issues to be examined towards more effective cross-language information extraction.
引用
收藏
页码:421 / 438
页数:18
相关论文
共 18 条
[1]  
AOKI K, 1998, P 2 AS PAC C SIM EV
[2]  
BALLESTEROS L, 1998, CROSS LANGUAGE INFOR
[3]  
CARBONELL JG, 1997, P 15 INT JOINT C ART, P708
[4]  
DAVIS MW, 1997, AAAI SPRING S CROSS
[5]  
DORR BJ, 1998, P 1 INT C LANG RES E
[6]  
GREFENSTETTE G, 1999, P ASLIB 099 TRANSL C
[7]  
KIKUI G, 1998, P COLING ACL 98, P670
[8]  
KIKUI G, 1995, P IPSJ APPL NAT LANG, P97
[9]  
Matsumoto Y., 1999, NAISTISTR99009
[10]  
MOCHIZUKI H, 1999, J NATURAL LANGUAGE P, V6, P101