WebReader:: a mechanism for automating the search and collecting information from the World Wide Web

被引:0
作者
Chan, JCY [1 ]
Li, Q [1 ]
机构
[1] REUTERS Asia Pte Ltd, Tech Dev Dept, Hong Kong, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL II | 2000年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current Web search engines are based on keyword search, and relevance of a web page is dependent on the number of hit count on the keywords. As keyword marching is not at the same level as semantic matching, the searching scope is unnecessarily broad and the precision (and recall) can be rather low. These problems give rise to the undesirable performance on web information searching. In this paper we describe a mechanism called WebReader, which is a middle ware between the browser and the Web for automating the search and collecting information from the Web. By facilitating meta-data specification in XML and manipulation in XSL, WebReader provides the users with a centralized, structured, and categorized means to specify and query Web information. An experimental prototype based on XML, XSL and Java has been developed to show the feasibility and practicality of our apl,roach through a real-life application example.
引用
收藏
页码:47 / 54
页数:8
相关论文
共 15 条
[1]  
AROCENA G, 1998, P ICDE 98 ORL
[2]  
AROCENA GO, APPL WEB QUERY LANGU
[3]  
BOUMPHREY F, 1998, XML APPL, pCH1
[4]   Mining the web's link structure [J].
Chakrabarti, S ;
Dom, BE ;
Kumar, SR ;
Raghavan, P ;
Rajagopalan, S ;
Tomkins, A ;
Gibson, D ;
Kleinberg, J .
COMPUTER, 1999, 32 (08) :60-+
[5]  
GRAVANO L, 1998, IEEE DATA ENG B, P28
[6]  
Greenberg I, 1999, COMPUTER, V32, P4
[7]  
HOMER A, 1999, XML IE5 PROGRAMMERS, pCH1
[8]  
KONOPNICKI D, 1995, P 21 INT C VER LARG, P54
[9]  
LAKSHMANAN LVS, 1996, P IEEE RIDE 96 NEW O
[10]  
MANBER U, 1998, IEEE DATA ENG B, P21