From wrapping to knowledge

被引:12
作者
Arjona, Jose L. [1 ]
Corchuelo, Rafael [2 ]
Ruiz, David [2 ]
Toro, Miguel
机构
[1] Univ Huelva, Dept Elect Comp Sci Syst & Automat Engn, Huelva, Spain
[2] Univ Seville, Dept Comp Languages & Syst, Seville, Spain
关键词
Enterprise Information Integration; wrappers; semiautomatic annotation;
D O I
10.1109/TKDE.2007.31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One the most challenging problems for Enterprise Information Integration is to deal with heterogeneous information sources on the Web. The reason is that they usually provide information that is in human-readable form only, which makes it difficult for a software agent,to understand it. Current solutions build on the idea of annotating the information with semantics. If the information is unstructured, proposals such as S-CREAM, MnM, or Armadillo may be effective enough since they rely on using natural language processing techniques; furthermore, their accuracy can be improved by using redundant information on the Web, as C-PANKOW has proved recently. If the information is structured and closely related to a back-end database, Deep Annotation ranges among the most effective proposals, but it requires the information providers to modify their applications; if Deep Annotation is not applicable, the easiest solution consists of using a wrapper and transforming its output into annotations. In this paper, we prove that this transformation can be automated by means of an efficient, domain-independent algorithm. To the best of our knowledge, this is the first attempt to devise and formalize such a systematic, general solution.
引用
收藏
页码:310 / 323
页数:14
相关论文
共 35 条
  • [1] Adelberg Brad, 1998, SIGMOD, 1998, P283, DOI [10.1145/276304.276330, DOI 10.1145/276304.276330]
  • [2] Agarwal S, 2003, LECT NOTES COMPUT SC, V2870, P211
  • [3] Baader R, 2005, LECT NOTES ARTIF INT, V2605, P228
  • [4] A defeasible logic reasoner for the semantic web
    Bassiliades, Nick
    Antoniou, Grigoris
    Vlahavas, Ioannis
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2006, 2 (01) : 1 - 41
  • [5] Bry F., 2005, International Journal on Semantic Web and Information Systems, V1, P1, DOI 10.4018/jswis.2005040101
  • [6] Reconfigurable Web wrapper agents
    Chang, CH
    Siek, H
    Lu, JJ
    Hsu, CN
    Chiou, JJ
    [J]. IEEE INTELLIGENT SYSTEMS, 2003, 18 (05) : 34 - 40
  • [7] Cimiano P., 2005, P 14 INT C WORLD WID, P332
  • [8] CIMIANO P, 2004, P 13 INT C WORLD WID, P462, DOI DOI 10.1145/988672.988735
  • [9] Ciravegna F., 2002, Proceedings of SIGIR 2002. Twenty-Fifth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P367
  • [10] Ciravegna F, 2004, LECT NOTES COMPUT SC, V3053, P312