Processing queries with metrical constraints in XML-based IR systems

被引:1
作者
Klein, Shmuel T. [1 ]
机构
[1] Bar Ilan Univ, Dept Comp Sci, IL-52900 Ramat Gan, Israel
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2008年 / 59卷 / 01期
关键词
D O I
10.1002/asi.20734
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
XML documents combine features from classical IR systems allowing free text, with explicit structures as in databases. Many query languages have been specially designed for IR applications on XML documents. This work concentrates on a special type of language for which the problem of processing queries including metrical constraints is investigated. The main question is how to define the distance between terms in different locations of the XML tree in an intuitively justifiable way, without jeopardizing the ability to get good retrieval results in terms of recall and precision. A new definition is given and its usefulness is shown on several examples from the INEX collection.
引用
收藏
页码:86 / 97
页数:12
相关论文
共 22 条
  • [1] ANH VN, 2002, P 1 WORKSH INEX DAGS, P99
  • [2] BAEZAYATES R, 2002, J AM SOC INFORM SCI, V53
  • [3] Choueka Y., 1987, Proceedings of the Tenth Annual International ACMSIGIR Conference on Research and Development in Information Retrieval, P306, DOI 10.1145/42005.42039
  • [4] CLARK J, 1999, WORLD WIDE WEB CONSO
  • [5] EquiX - A search and query language for XML
    Cohen, S
    Kanza, Y
    Kogan, Y
    Sagiv, Y
    Nutt, W
    Serebrenik, A
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2002, 53 (06): : 454 - 466
  • [6] FERNANDEZ M, 2004, WORLD WIDE WEB CONSO
  • [7] Fraenkel A. S., 1976, Jurimetrics Journal, V16, P149
  • [8] Fraenkel AS, 1999, J AM SOC INFORM SCI, V50, P845, DOI 10.1002/(SICI)1097-4571(1999)50:10<845::AID-ASI2>3.0.CO
  • [9] 2-A
  • [10] THE USE OF SEMANTIC LINKS IN HYPERTEXT INFORMATION-RETRIEVAL
    FREI, HP
    STIEGER, D
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1995, 31 (01) : 1 - 13