Clustering to Improve Microblog Stream Summarization

被引:2
|
作者
Olariu, Andrei [1 ]
机构
[1] Univ Bucharest, Fac Math & Comp Sci, Bucharest, Romania
来源
14TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2012) | 2012年
关键词
microblog; summarization; text clustering;
D O I
10.1109/SYNASC.2012.10
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Microblogging has shown a massive increase in use over the past couple of years. According to recent statistics, Twitter (the most popular microblogging platform) has over 340 million posts per day coming from its 140 million active users. In order to help users manage this information overload or to assess the full information potential of such microblogging streams (sequences of posts), a few summarization algorithms have been proposed. However, they are designed to work on a stream of posts filtered on a particular keyword, whereas most streams suffer from noise or have posts referring to more than one topic. Because of this, the generated summary is incomplete and even meaningless. We approach the problem of summarizing a stream and propose adding a layer of text clustering as a preprocessing step. We show how, by clustering posts into related groups and then applying a summarization algorithm, the quality of the summary improves.
引用
收藏
页码:220 / 226
页数:7
相关论文
共 50 条
  • [1] Multimedia Summarization for Social Events in Microblog Stream
    Bian, Jingwen
    Yang, Yang
    Zhang, Hanwang
    Chua, Tat-Seng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (02) : 216 - 228
  • [2] Continuous Summarization for Microblog Streams Based on Clustering
    Wu, Qunhui
    Lv, Jianghua
    Ma, Shilong
    NEURAL INFORMATION PROCESSING, PT II, 2015, 9490 : 371 - 379
  • [3] On Multimodal Microblog Summarization
    Saini, Naveen
    Saha, Sriparna
    Bhattacharyya, Pushpak
    Mrinal, Shubhankar
    Mishra, Santosh Kumar
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 9 (05) : 1317 - 1329
  • [4] Hierarchical Stream Clustering Based NEWS Summarization System
    Raja, M. Arun Manicka
    Swamynathan, S.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (01): : 1263 - 1280
  • [5] Ensemble Algorithms for Microblog Summarization
    Dutta, Soumi
    Chandra, Vibhash
    Mehra, Kanav
    Das, Asit Kumar
    Chakraborty, Tanmoy
    Ghosh, Saptarshi
    IEEE INTELLIGENT SYSTEMS, 2018, 33 (03) : 4 - 14
  • [6] Continuous Summarization over Microblog Threads
    Song, Liangjun
    Zhang, Ping
    Bao, Zhifeng
    Sellis, Timos
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), PT II, 2017, 10178 : 511 - 526
  • [7] Combining Graph Connectivity and Genetic Clustering to Improve Biomedical Summarization
    Menendez, Hector D.
    Plaza, Laura
    Camacho, David
    2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 2740 - 2747
  • [8] Topic Detection from Large Scale of Microblog Stream with High Utility Pattern Clustering
    Huang, Jiajia
    Peng, Min
    Wang, Hua
    PIKM'15: PROCEEDINGS OF THE 8TH PH.D. WORKSHOP IN INFORMATION AND KNOWLEDGE MANAGEMENT, 2015, : 3 - 10
  • [9] Social Stream Clustering to Improve Events Extraction
    Jenhani, Ferdaous
    Gouider, Mohamed Salah
    Ben Said, Lamjed
    INTELLIGENT DECISION TECHNOLOGIES 2017, KES-IDT 2017, PT II, 2018, 73 : 319 - 329
  • [10] Topic and sentiment aware microblog summarization for twitter
    Syed Muhammad Ali
    Zeinab Noorian
    Ebrahim Bagheri
    Chen Ding
    Feras Al-Obeidat
    Journal of Intelligent Information Systems, 2020, 54 : 129 - 156