Research and Implementation of Real-time Automatic Web Page Classification System

被引:0
作者
Han, Weihong [1 ]
Zhu, Weihui [1 ]
Jia, Yan [1 ]
机构
[1] Natl Univ Def Technol, Comp Sch, Changsha, Hunan, Peoples R China
来源
PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MATERIAL, MECHANICAL AND MANUFACTURING ENGINEERING | 2015年 / 27卷
关键词
Web Page Classification; security filtering; service discovery; service collection;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the development of Internet and communication technology, the Internet data growth rapidly, and the type of network services varied. According to the different properties of the network services, network services classification is the foundation of many network applications, including network service management, green Internet, network bandwidth usage category management, network reputation management, security filtering and so on. Due to the variety of web content and text length, the traditional classification methods can't effectively solve the problem of large-scale web page classification. In this paper, we design and implement a real-time automatic Web page classification system AWCS, including self-feedback system architecture, multi-dimensional network services classification standard, active and passive combining network service discovery and collection technology, automatic self-correction network service classification techniques. Performance tests show that the classification accuracy of AWCS is significantly higher than the traditional algorithms. This framework offers a promising approach for large-scale real-time network data classification system.
引用
收藏
页码:977 / 982
页数:6
相关论文
共 7 条
[1]  
Dai WY, 2006, LECT NOTES COMPUT SC, V4016, P435
[2]  
Godoy D, 2010, EXPLOITING SOCIAL CA
[3]  
Gowri Shanthi S., 2012, INT J ADV RES COMPUT, V1
[4]  
Kou G, 2012, ANN OPERATIONS RES
[5]  
Lai W, 2011, US Patent, Patent No. [8,051,083, 8051083]
[6]  
Lindemann C., 2006, P 8 INT WORKSH WEB I
[7]  
Zhang Tong, 2006, KDD 06