Research and Design of Internet Public Opinion Analysis System

被引:17
作者
Guan, Quanlong [1 ]
Ye, Saizhi [2 ]
Yao, Guoxiang [2 ]
Zhang, Huanming [1 ]
Wei, Linfeng [2 ]
Song, Gazi [2 ]
He, Kejing [3 ]
机构
[1] Jinan Univ, Network & Educ Technol Ctr, Guangzhou 510630, Guangdong, Peoples R China
[2] Jinan Univ, Coll Informat Sci Technol, Guangzhou 510630, Guangdong, Peoples R China
[3] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China
来源
2009 IITA INTERNATIONAL CONFERENCE ON SERVICES SCIENCE, MANAGEMENT AND ENGINEERING, PROCEEDINGS | 2009年
关键词
Internet public opinion; Web page summarization; test classification; vector space model;
D O I
10.1109/SSME.2009.62
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Internet is becoming a spreading platform for the public opinion. It is important to grasp the internet public opinion in time and understand the trends of their opinion correctly. Text classification plays a fundamental role in a number of information management and retrieval tasks. But web-page classification is much more difficult than pure-text classification due to a large variety of noisy information embedded in Web pages. In this paper, we propose a system scheme for the analysis of the Internet public opinion (IPO).We apply web-page classification through summarization to extract the most relevant content from the Web pages and then pass them to standard text classification algorithms(NB or SVM). We comprehensive use text classification and text clustering algorithms, which have been shown to be efficient and effective for singly using. Through the result of the experiment,we have proved the superiority of the system system's architecture in the system design.
引用
收藏
页码:173 / +
页数:2
相关论文
共 12 条
[1]  
[Anonymous], 2004, P 27 ANN INT ACM SIG, DOI DOI 10.1145/1008992.1009035
[2]  
[Anonymous], P 10 INT WORLD WID W
[3]  
DAVID MW, 2003, INT C ADV INF NETW A, P716
[4]  
JORGENSEN P, 2005, INCORPORATING CONTEX, P1081
[5]  
LI PY, 2008, TEXT DOCUMENT CLUSTE, P381
[6]  
SHEN PD, P INFORM PROCESSING, V43
[7]  
SHEN Y, 2003, IMPROVING PERFORMANC
[8]   Security, Internet connectivity and aircraft data networks [J].
Thanthry, N ;
Ali, MS ;
Pendse, R .
39TH ANNUAL 2005 INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY, PROCEEDINGS, 2005, :251-255
[9]  
Turney P. D., 2002, ARXIVCS0212012
[10]  
YAO GX, 2008, P ISECS CCCM 2008, V2, P353