Study of Extraction for Web Pages Information Based on XML

被引:0
作者
Li, Suming [1 ]
机构
[1] Nanchang Inst Sci & Technol, Sch Natl Educ, Nanchang 330108, Peoples R China
来源
PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS | 2016年 / 81卷
关键词
XML; web pages; information extraction; knowledge base;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper proposes a web information platform based on XML. First, the information platform combines the advantages of existing different extraction technology, automatically extracts the key information in accordance with XML technology, next translates key information into structural and extensible XML documents, finally, concludes corresponding extraction rules by a group of similar pages, and then finishes the extraction for web pages information by these extraction rule.
引用
收藏
页码:829 / 832
页数:4
相关论文
共 6 条
[1]  
HU D D, 2010, J COMPUTER RES DEV, V9, P77
[2]  
LIANG H, 2009, J CHINESE COMPUTER S, V30, P8884
[3]   Solution for automatic Web review extraction [J].
Liu W. ;
Yan H.-L. ;
Xiao J.-G. ;
Zeng J.-X. .
Ruan Jian Xue Bao/Journal of Software, 2010, 21 (12) :3220-3236
[4]  
PENG H, 2012, J COMPUTER APPL, V32, P2361
[5]  
ZHANG S H, 2012, J HENAN U NATURAL SC, V11, P17
[6]  
ZHOU J, 2010, COMPUTER APPL, V11, P21