Concept hierarchy based text database categorization in a metasearch engine environment

被引:0
|
作者
Wang, WX [1 ]
Meng, WY [1 ]
Yu, C [1 ]
机构
[1] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA
来源
PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL I | 2000年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Document categorization as a technique to improve the retrieval of useful documents has been extensively investigated. One important issue in a large-scale metasearch engine is to select text databases that are likely to contain useful documents for a given query. We believe that database categorization can be a potentially effective technique for good database selection, especially in the Internet environment where short queries are usually submitted. In this paper, we propose and evaluate several database categorization algorithms. This study indicates that while some document categorization algorithms could be adopted for database categorization, algorithms that take into consideration the special characteristics of databases may be more effective. Preliminary experimental results are provided to compare the proposed database categorization algorithms.
引用
收藏
页码:283 / 290
页数:4
相关论文
共 50 条
  • [1] Concept Hierarchy-Based Text Database Categorization
    Weiyi Meng
    Wenxian Wang
    Hongyu Sun
    Clement Yu
    Knowledge and Information Systems, 2002, 4 (2) : 132 - 150
  • [2] Domain semantic mapping of database metasearch engine
    Miao, Guangxiang
    Chen, Xiangyang
    Journal of Southeast University (English Edition), 2007, 23 (03) : 357 - 360
  • [3] Exploiting hierarchy in text categorization
    Weigend A.S.
    Wiener E.D.
    Pedersen J.O.
    Information Retrieval, 1999, 1 (3): : 193 - 216
  • [4] A Concept-based Model for Enhancing Text Categorization
    Shehata, Shady
    Karray, Fakhri
    Kamel, Mohamed
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 629 - 637
  • [5] The frame of a metasearch engine based on agent
    Chen, JJ
    Xue, Y
    Song, HT
    ACTIVE MEDIA TECHNOLOGY, 2003, : 132 - 137
  • [6] Use of Ontology to Support Concept-Based Text Categorization
    Lee, Yen-Hsien
    Tsao, Wan-Jung
    Chu, Tsai-Hsin
    DESIGNING E-BUSINESS SYSTEMS, 2009, 22 : 201 - +
  • [7] Text categorization based on concept indexing and principal component analysis
    Huang, K
    Ma, SP
    2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 51 - 56
  • [8] Text categorization based on term co-occurrence concept
    Ni, Maoshu
    Lin, Hongfei
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 222 - 225
  • [9] Concept indexing for automated text categorization
    Gómez, JM
    Cortizo, JC
    Puertas, E
    Ruiz, M
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2004, 3136 : 195 - 206
  • [10] BUILDING THE TEXT ENGINE DATABASE
    STEVENS, A
    DR DOBBS JOURNAL, 1995, 20 (02): : 119 - 122