A novel approach for clustering sentiments in Chinese blogs based on graph similarity

被引:8
作者
Feng, Shi [1 ]
Pang, Jun [2 ]
Wang, Daling [1 ]
Yu, Ge [1 ]
Yang, Feng [1 ]
Xu, Dongping [2 ]
机构
[1] Northeastern Univ, Shenyang 110819, Peoples R China
[2] Wuhan Univ Technol, Wuhan 430070, Peoples R China
关键词
Blog mining; Sentiment analysis; Blog clustering; Graph-based representation; WEB;
D O I
10.1016/j.camwa.2011.07.043
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Blog clustering is an important approach for online public opinion analysis. The traditional clustering methods, usually group blogs by keywords, stories and timeline, which usually ignore opinions and emotions expressed in the blog articles. In this paper, an integrated graph-based model for clustering Chinese blogs by embedded sentiments is proposed. A novel graph-based representation and the corresponding clustering algorithm are applied on the Chinese blog search results. The proposed model SoB-graph considers not only sentiment words but also structural information in blogs. Experimental results show that comparing with the traditional graph-based document representation model and vector space document representation model, the proposed SOB-graph model has achieved better performance in clustering sentiments in Chinese blog documents. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2770 / 2778
页数:9
相关论文
共 27 条
  • [1] Agarwal Nitin, 2008, 2008 8th International Conference on Web Engineering (ICWE), P336, DOI 10.1109/ICWE.2008.9
  • [2] [Anonymous], VLDB
  • [3] [Anonymous], 2004, WWW 2004 WORKSH WEBL
  • [4] Bar-llan J., 2004, P WWW ALT PAP TRACK
  • [5] Bekkerman R, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P684
  • [6] Using cocitation information to estimate political orientation in web documents
    Efron, M
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 9 (04) : 492 - 511
  • [7] Fan T., 2009, SENTIMENT ORIENTED C
  • [8] Feng S, 2009, LECT NOTES COMPUT SC, V5678, P140, DOI 10.1007/978-3-642-03348-3_16
  • [9] Feng S, 2009, LECT NOTES COMPUT SC, V5446, P332, DOI 10.1007/978-3-642-00672-2_30
  • [10] Hossain M.S. Angryk., 2007, IEEE ICDM Workshop on Mining Graphs and Complex Structures, USA, P417, DOI DOI 10.1109/ICDMW.2007.104