Automatic query taxonomy generation for information retrieval applications

被引:13
作者
Chuang, SL [1 ]
Chien, LF [1 ]
机构
[1] Acad Sinica, Inst Sci Informat, Taipei 115, Taiwan
关键词
information retrieval; worldwide Web; search engines; query languages; taxonomy; cluster analysis;
D O I
10.1108/14684520310489032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is crucial for information retrieval systems to learn more about what users search for in order to fulfil the intent of searches. This paper introduces query taxonomy generation, which attempts to organise users' queries into a hierarchical structure of topic classes. Such a query taxonomy provides a basis for the in-depth analysis of users' queries on a larger scale and can benefit many information retrieval systems. The proposed approach to this problem consists of two computational processes: hierarchical query clustering to generate a query taxonomy from scratch, and query categorisation to place newly-arrived queries into the taxonomy. The results of the preliminary experiment have shown the potential of the proposed approach in generating taxonomies for queries, which may be useful in various Web information retrieval applications.
引用
收藏
页码:243 / 255
页数:13
相关论文
共 16 条
  • [1] Beeferman D., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P407, DOI 10.1145/347090.347176
  • [2] BELEW RK, 2001, FINDING OUT COGNITIV
  • [3] Buckland M., 1999, D-Lib Magazine
  • [4] Chuang SL, 2002, 2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P75, DOI 10.1109/ICDM.2002.1183888
  • [5] Enriching Web taxonomies through subject categorization of query terms from search engine logs
    Chuang, SL
    Chien, LF
    [J]. DECISION SUPPORT SYSTEMS, 2003, 35 (01) : 113 - 127
  • [6] Dasarathy B.V., 1991, Nearest Neighbour (NN) Norms: NN Pattern Classification Techniques., V17, P441
  • [7] Larsen B., 1999, P 5 ACM SIGKDD INT C, P16, DOI [10.1145/312129.312186, DOI 10.1145/312129.312186]
  • [8] Lau T, 1999, CISM COUR L, P119
  • [9] AN EXAMINATION OF PROCEDURES FOR DETERMINING THE NUMBER OF CLUSTERS IN A DATA SET
    MILLIGAN, GW
    COOPER, MC
    [J]. PSYCHOMETRIKA, 1985, 50 (02) : 159 - 179
  • [10] Mirkin B., 1996, Mathematical Classification and Clustering