Ontology-Based Web Information Extraction

被引：0

作者：

Mo, Qian ^{[1
]}

Chen, Yi-hong ^{[1
]}

机构：

[1] Beijing Technol & Business Univ, Comp & Informat Engn Coll, Beijing 100048, Peoples R China

来源：

COMMUNICATIONS AND INFORMATION PROCESSING, PT 1 | 2012年 / 288卷

关键词：

Web information extraction; Ontology; DOM (Document Object Model);

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article aims to give introduce to an Ontology-based Web information extraction system and the related content about Web information extraction and Ontology. This paper introduces the modules of the Web information extraction system, including: Web page preprocessing, DOM tree formation, Positioning information domain, Lexical analysis, Ontology construction, Ontology analysis, Keyword management, Rule generation, Information extraction and Information storage. And then, it also describes the experimental results. Finally, it describes the development trends and challenges of Ontology-based Web information extraction.

引用

页码：118 / 126

页数：9