Disambiguation strategies for Cross-Language Information Retrieval

被引:0
作者
Hiemstra, D [1 ]
de Jong, F [1 ]
机构
[1] Univ Twente, Ctr Telemat & Informat Technol, NL-7500 AE Enschede, Netherlands
来源
RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS | 1999年 / 1696卷
关键词
Cross-Language Information Retrieval; statistical machine translation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper gives an overview of tools and methods for Cross-Language Information Retrieval (CLIR) that arp developed within the Twenty-One project. The tools and methods are evaluated with the TREC CLIR task document collection using Dutch queries on the English document base. The main issue addressed here is an evaluation of two approaches to disambiguation. The underlying question is whether a lot of effort should be: put in finding the correct translation for each query term before searching, or whether searching with more than one possible translation leads to better results? The experimental study suggests that the quality of search methods is more important than the quality of disambiguation methods. Good retrieval methods are able to disambiguate translated queries implicitly during searching.
引用
收藏
页码:274 / 293
页数:20
相关论文
共 50 条
[21]   Research on Chinese-English cross-language information retrieval [J].
Zhang, Tao ;
Zhang, Yue-Jie .
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, :2591-+
[22]   Dictionary-based techniques for cross-language information retrieval [J].
Levow, GA ;
Oard, DW ;
Resnik, P .
INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (03) :523-547
[23]   Creating and exploiting a comparable corpus in cross-language information retrieval [J].
Talvensaari, Tuomas ;
Laurikkala, Jorma ;
Jarvelin, Kalervo ;
Juhola, Martti ;
Keskustalo, Heikki .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2007, 25 (01)
[24]   Research on English-Chinese cross-language information retrieval [J].
Zhang, Yue-Jie ;
Zhang, Tao .
PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, :3448-+
[25]   The Dilution/Concentration conditions for cross-language information retrieval models [J].
Li, Bo ;
Gaussier, Eric ;
Yang, Dan .
INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (02) :291-302
[26]   Comparative Analysis of Information Retrieval Models on Quran Dataset in Cross-Language Information Retrieval Systems [J].
Taan, Ayman A. ;
Khan, Shafiq Ur Rehman ;
Raza, Ali ;
Hanif, Ayaz Muhammad ;
Anwar, Hira .
IEEE ACCESS, 2021, 9 :169056-169067
[27]   Term disambiguation techniques based on target document collection for cross-language information retrieval: An empirical comparison of performance between techniques [J].
Kishida, Kazuaki .
INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (01) :103-120
[28]   Improving Retrieval Performance Of English-Hindi Based Cross-Language Information Retrieval [J].
Varshney, Saurabh ;
Bajpai, Jyoti .
PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE IN MOOC, INNOVATION AND TECHNOLOGY IN EDUCATION (MITE), 2013, :300-305
[29]   An axiomatic approach to corpus-based cross-language information retrieval [J].
Rahimi, Razieh ;
Montazeralghaem, Ali ;
Shakery, Azadeh .
INFORMATION RETRIEVAL JOURNAL, 2020, 23 (03) :191-215
[30]   Using Comparable Corpora to Improve the Effectiveness of Cross-Language Information Retrieval [J].
Sadat, Fatiha .
ADVANCES IN NATURAL LANGUAGE PROCESSING, 2010, 6233 :320-331