Automatic query taxonomy generation for information retrieval applications

被引:13
作者
Chuang, SL [1 ]
Chien, LF [1 ]
机构
[1] Acad Sinica, Inst Sci Informat, Taipei 115, Taiwan
关键词
information retrieval; worldwide Web; search engines; query languages; taxonomy; cluster analysis;
D O I
10.1108/14684520310489032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is crucial for information retrieval systems to learn more about what users search for in order to fulfil the intent of searches. This paper introduces query taxonomy generation, which attempts to organise users' queries into a hierarchical structure of topic classes. Such a query taxonomy provides a basis for the in-depth analysis of users' queries on a larger scale and can benefit many information retrieval systems. The proposed approach to this problem consists of two computational processes: hierarchical query clustering to generate a query taxonomy from scratch, and query categorisation to place newly-arrived queries into the taxonomy. The results of the preliminary experiment have shown the potential of the proposed approach in generating taxonomies for queries, which may be useful in various Web information retrieval applications.
引用
收藏
页码:243 / 255
页数:13
相关论文
共 16 条
  • [11] Subject categorization of query terms for exploring Web users' search interests
    Pu, HT
    Chuang, SL
    Yang, C
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2002, 53 (08): : 617 - 630
  • [12] TERM-WEIGHTING APPROACHES IN AUTOMATIC TEXT RETRIEVAL
    SALTON, G
    BUCKLEY, C
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1988, 24 (05) : 513 - 523
  • [13] SULLIVAN D, 2002, DM REV
  • [14] Vaithyanathan S., 2000, Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence, P599
  • [15] Wen JR, 2001, P 10 INT C WORLD WID, P162, DOI DOI 10.1145/371920.371974
  • [16] Yang Y, 2001, 2001 SECOND INTERNATIONAL CONFERENCE ON ENGINEERING MATERIALS, VOL II, P133