Clustering to Improve Microblog Stream Summarization

被引:2
作者
Olariu, Andrei [1 ]
机构
[1] Univ Bucharest, Fac Math & Comp Sci, Bucharest, Romania
来源
14TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2012) | 2012年
关键词
microblog; summarization; text clustering;
D O I
10.1109/SYNASC.2012.10
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Microblogging has shown a massive increase in use over the past couple of years. According to recent statistics, Twitter (the most popular microblogging platform) has over 340 million posts per day coming from its 140 million active users. In order to help users manage this information overload or to assess the full information potential of such microblogging streams (sequences of posts), a few summarization algorithms have been proposed. However, they are designed to work on a stream of posts filtered on a particular keyword, whereas most streams suffer from noise or have posts referring to more than one topic. Because of this, the generated summary is incomplete and even meaningless. We approach the problem of summarizing a stream and propose adding a layer of text clustering as a preprocessing step. We show how, by clustering posts into related groups and then applying a summarization algorithm, the quality of the summary improves.
引用
收藏
页码:220 / 226
页数:7
相关论文
共 44 条
[31]   Query-Relevant Sentiment Summarization Based on Facet Identification and Sentence Clustering [J].
Moulton, Matthew ;
Gao, Sophie ;
Ng, Yiu-Kai .
37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, :704-712
[32]   Stream Clustering of Chat Messages with Applications to Twitch Streams [J].
Carnein, Matthias ;
Assenmacher, Dennis ;
Trautmann, Heike .
ADVANCES IN CONCEPTUAL MODELING, ER 2017, 2017, 10651 :79-88
[33]   High-performance social networking: microblog community detection based on efficient interactive characteristic clustering [J].
Wang, Ru ;
Rho, Seungmin ;
Cai, Wandong .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (02) :1209-1221
[34]   High-performance social networking: microblog community detection based on efficient interactive characteristic clustering [J].
Ru Wang ;
Seungmin Rho ;
Wandong Cai .
Cluster Computing, 2017, 20 :1209-1221
[35]   Study on microblog public opinion data mining algorithm based on multi-visual clustering model [J].
Li, Lin-lin ;
Hou, Wei-zhen ;
Liu, Jing .
INTERNATIONAL JOURNAL OF AUTONOMOUS AND ADAPTIVE COMMUNICATIONS SYSTEMS, 2020, 13 (02) :151-165
[36]   Disastrous Event and Sub-Event Detection From Microblog Posts Using Bi-Clustering Method [J].
Roy Chowdhury, Shatadru ;
Basu, Srinka ;
Maulik, Ujjwal .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) :161-170
[37]   Lifelong Learning Augmented Short Text Stream Clustering Method [J].
Qiang, Jipeng ;
Xu, Wanyin ;
Li, Yun ;
Yuan, Yunhao ;
Zhu, Yi .
IEEE ACCESS, 2021, 9 (09) :70493-70501
[38]   An Improved Web Information Summarization Method Using Sentence Similarity-Based Soft Clustering [J].
Tang, Jun ;
Zhao, Xiaojuan .
2009 INTERNATIONAL CONFERENCE ON FUTURE BIOMEDICAL INFORMATION ENGINEERING (FBIE 2009), 2009, :292-295
[39]   Video Summarization through Total Variation, Deep Semi-supervised Autoencoder and Clustering Algorithms [J].
da Silva, Eden Pereira ;
Ramos, Eliaquim Monteiro ;
da Silva, Leandro Tavares ;
Cardoso, Jaime S. ;
Giraldi, Gilson A. .
VISAPP: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4: VISAPP, 2020, :315-322
[40]   Multidocument Arabic Text Summarization Based on Clustering and Word2Vec to Reduce Redundancy [J].
Abdulateef, Samer ;
Khan, Naseer Ahmed ;
Chen, Bolin ;
Shang, Xuequn .
INFORMATION, 2020, 11 (02)