Study of Extraction for Web Pages Information Based on XML

被引：0

作者：

Li, Suming ^{[1
]}

机构：

[1] Nanchang Inst Sci & Technol, Sch Natl Educ, Nanchang 330108, Peoples R China

来源：

PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS | 2016年 / 81卷

关键词：

XML; web pages; information extraction; knowledge base;

D O I：

暂无

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

This paper proposes a web information platform based on XML. First, the information platform combines the advantages of existing different extraction technology, automatically extracts the key information in accordance with XML technology, next translates key information into structural and extensible XML documents, finally, concludes corresponding extraction rules by a group of similar pages, and then finishes the extraction for web pages information by these extraction rule.

引用

页码：829 / 832

页数：4