Identification and classification of Deep Web query interfaces via ontology

被引:0
作者
Qiang B. [1 ,2 ]
Cai G. [1 ]
Wen Y. [1 ]
Wu C. [2 ]
Tang C. [1 ]
机构
[1] School of Computer Science and Engineering, Guilin University of Electronic Technology
[2] College of Computer and Information Science, Southwest University
关键词
Deep Web; Ontology; Query interface classification; Query interface identification; Schema extraction;
D O I
10.4156/ijact.vol3.issue9.5
中图分类号
学科分类号
摘要
In order to obtain the large quantities of valuable information on Deep Web, it is required to discover the related individual query interface and design the integrated query interface on which user query request can be submitted. The key challenges are to identify and classify the Deep Web query interface accurately. In view of the regular data of Deep Web, we consider to construct the Deep Web domain ontology to help identify and classify the Deep Web query interfaces. Correspondingly, one domain ontology construction approach by referring to the hierarchal schema of query interface is proposed. Based on the constructed domain ontology, the modified interface expression algorithm and the identification and classification algorithm for Deep Web query interfaces are also presented, respectively. Experimental results show the effectiveness of our proposed algorithms.
引用
收藏
页码:33 / 40
页数:7
相关论文
共 19 条
  • [1] Michael K.B., The Deep Web: Surfacing the Hidden Value (White Paper), (2000)
  • [2] Chang K., He B., Li C., Patel M., Zhang Z., Structured databases on the Web: Observations and implications, Proceedings of the ACM SIGMOD Record, pp. 61-70, (2004)
  • [3] Liu W., Meng X., Meng W., ViDE: A vision-based approach for Deep Web data extraction, IEEE Transactions On Knowledge and Data Engineering, 22, 3, pp. 447-460, (2010)
  • [4] Hatem A.M., Aboulnaga A., Schema clustering and retrieval for multi-domain pay-as-you-go data integration systems, Proceedings of the 2010 International Conference On Management of Data, pp. 411-422, (2010)
  • [5] Cope J., Craswell N., Hawking D., Automated discovery of search interfaces on the Web, Proceedings of the 14th Australasian Database Conference On Research and Practice of InformationTechnology, pp. 181-189, (2003)
  • [6] Lage J.P., da Silva A.S., Golgher P.B., Et al., Automatic generation of agents for collecting hidden Web pages for data extraction, Data & Knowledge Engineering, 49, 2, pp. 177-196, (2004)
  • [7] Ipeirotis P.G., Gravano L., Sahami M., QProber: A system for automatic classification of hidden-Web databases, ACM TOIS, 21, 1, pp. 1-41, (2003)
  • [8] Gong Z., Zhang J., Liu Q., Hidden-web database exploration, Proceedings of the 6th International Conference On Intelligent Systems Design and Applications, pp. 838-843, (2006)
  • [9] Lin P., Zhao L., Research on the expression and extraction of WDB, Journal of Convergence Information Technology, 5, 3, pp. 103-112, (2010)
  • [10] Zhao H., Study of Deep Web sources classification technology, Proceedings of 2nd International Conference On Future Computer and Communication, pp. 324-326, (2010)