Ontology-Based Web Information Extraction

被引:0
作者
Mo, Qian [1 ]
Chen, Yi-hong [1 ]
机构
[1] Beijing Technol & Business Univ, Comp & Informat Engn Coll, Beijing 100048, Peoples R China
来源
COMMUNICATIONS AND INFORMATION PROCESSING, PT 1 | 2012年 / 288卷
关键词
Web information extraction; Ontology; DOM (Document Object Model);
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article aims to give introduce to an Ontology-based Web information extraction system and the related content about Web information extraction and Ontology. This paper introduces the modules of the Web information extraction system, including: Web page preprocessing, DOM tree formation, Positioning information domain, Lexical analysis, Ontology construction, Ontology analysis, Keyword management, Rule generation, Information extraction and Information storage. And then, it also describes the experimental results. Finally, it describes the development trends and challenges of Ontology-based Web information extraction.
引用
收藏
页码:118 / 126
页数:9
相关论文
共 8 条
[1]  
Chen J, 2007, OVERVIEW ONTOLOGY BA
[2]  
DENG ZH, 2002, OVERVIEW ONTOLOGY
[3]  
Eikvil L., 1999, 945 NORW COMP CTR
[4]  
Gomez-Perez A., 1999, P IJCAI 99 WORKSH ON, P1, DOI DOI 10.1006/IJHC.2000.0415
[5]  
Liu B., 2003, KDD '03: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, P601
[6]  
LIU J, 2007, FUJIAN COMPUTER
[7]  
LIU Jian-wei, 2010, COMPUTER ENG
[8]  
Yang X.-Q, 2009, INFORM TECHNOLOGY