How do search engines respond to some non-English queries?

被引:29
作者
Bar-Ilan, J
Gutman, T
机构
[1] Hebrew Univ Jerusalem, IL-91905 Jerusalem, Israel
[2] Ex Libris Ltd, Jerusalem, Israel
关键词
diacritics; inflections; morphological analysis; non-English languages; pre- and postfixes; search engines;
D O I
10.1177/0165551505049255
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we explored the capabilities of search engines for non-English languages. As a test case, we considered four languages: Russian, French, Hungarian and Hebrew. For each of these languages we tested three general search engines: AltaVista, FAST and Google and some local search engines. Our results indicate that in the examined cases the general search engines ignore the special characteristics of non-English languages, and sometimes they do not even handle diacritics well. These findings are rather disturbing, since for example Google is very popular in non-English speaking countries as well, and users are either not aware of what they miss when using search tools that do not take into account the structure and the special characteristics of the specific language or have no alternatives but to use these search engines.
引用
收藏
页码:13 / 28
页数:16
相关论文
共 55 条
[1]  
Abu-Salem H, 1999, J AM SOC INFORM SCI, V50, P524, DOI 10.1002/(SICI)1097-4571(1999)50:6<524::AID-ASI7>3.0.CO
[2]  
2-M
[3]  
AIRIO E, 2003, MULTILINGUAL EXPT UT
[4]  
ALKHARASHI IA, 1994, J AM SOC INFORM SCI, V45, P548, DOI 10.1002/(SICI)1097-4571(199409)45:8<548::AID-ASI3>3.0.CO
[5]  
2-X
[6]   From plain character strings to meaningful words: Producing better full text databases for inflectional and compounding languages with morphological analysis software [J].
Alkula, R .
INFORMATION RETRIEVAL, 2001, 4 (3-4) :195-208
[7]  
ANDRIEU O, ALL WEB SYNTAXE
[8]  
ANDRIEU O, ALTACISTA SYNTAXE
[9]  
ANDRIEU O, VIOLA SYNTAXE
[10]  
ANDRIEU O, GOOGLE SYNTAXE