Detecting prominent microblog users over crisis events phases

被引:8
作者
Bizid, Imen [1 ]
Nayef, Nibal [1 ]
Boursier, Patrice [1 ]
Doucet, Antoine [1 ]
机构
[1] Univ La Rochelle, L3i, La Rochelle, France
关键词
Information retrieval from microblogs; Prominent users prediction; User behavior modeling; Phase-temporal representation;
D O I
10.1016/j.is.2017.12.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
During crisis events such as disasters, the need for real-time information retrieval (IR) from microblogs becomes essential. However, the huge amount and the variety of the shared information in real time during such events over-complicates this task. Unlike existing IR approaches based on content analysis, we propose to tackle this problem by using user-centric IR approaches with identifying and tracking prominent microblog users who are susceptible to share relevant and exclusive information at an early stage of each analyzed event phase. This approach ensures real-time access to the valuable microblogs information required by the emergency teams. In this approach, we propose a phase-aware probabilistic model for predicting and ranking prominent microblog users over time according to their behavior using Mixture of Gaussians Hidden Markov Models (MoG-HMM). The model utilizes a new user representation which takes into account both the user and the event specificities over time. This user representation comprises the following new aspects (1) Modeling microblog users behavior evolution by considering the different event phases (2) Characterizing users activity over time through a temporal sequence representation (3) Time-series-based selection of the most discriminative features (4) prominent users prediction using probabilistic phase-aware models learned a priori. We have conducted experiments during flooding events: we trained our identification models using a dataset relative to the "Alpes-Maritimes floods" and we tested its identification performance using a new dataset relative to another flooding disaster "Herault floods". The achieved results show that our model significantly outperforms phase-unaware models and identifies most of the prominent users at an early stage of each event phase. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:173 / 188
页数:16
相关论文
共 26 条
[1]  
[Anonymous], 2011, P 4 ACM INT C WEB SE, DOI [DOI 10.1145/1935826.1935843, 10.1145/1935826.1935843]
[2]  
[Anonymous], 2011, P 20 INT C WORLD WID
[3]  
[Anonymous], 2010, P 3 ACM INT C WEB SE, DOI DOI 10.1145/1718487.1718520
[4]  
[Anonymous], 2010, P 2010 43 HAWAII INT, DOI DOI 10.1353/JSM.2016.0009
[5]   AN INEQUALITY WITH APPLICATIONS TO STATISTICAL ESTIMATION FOR PROBABILISTIC FUNCTIONS OF MARKOV PROCESSES AND TO A MODEL FOR ECOLOGY [J].
BAUM, LE ;
EAGON, JA .
BULLETIN OF THE AMERICAN MATHEMATICAL SOCIETY, 1967, 73 (03) :360-&
[6]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[7]  
Bizid I., 2015, KES AMSTA, V15, P1
[8]  
Bizid I, 2015, CIKM, P1715
[9]   Prominent Users Detection during Specific Events by Learning On-and Off-topic Features of User Activities [J].
Bizid, Imen ;
Nayef, Nibal ;
Boursier, Patrice ;
Faiz, Sami ;
Morcos, Jacques .
PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, :500-503
[10]   Efficient Influence Maximization in Social Networks [J].
Chen, Wei ;
Wang, Yajun ;
Yang, Siyu .
KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, :199-207