Modeling E-mail Networks and Inferring Leadership Using Self-Exciting Point Processes

被引:56
作者
Fox, Eric W. [1 ]
Short, Martin B. [1 ]
Schoenberg, Frederic P. [1 ]
Coronges, Kathryn D. [1 ]
Bertozzi, Andrea L. [1 ]
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
关键词
Conditional intensity; Enron E-mail dataset; Hawkes process; IkeNet dataset; Social networks; HEAVY TAILS;
D O I
10.1080/01621459.2015.1135802
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose various self-exciting point process models for the times when e-mails are sent between individuals in a social network. Using an expectation maximization (EM)-type approach, we fit these models to an e-mail network dataset from West Point Military Academy and the Enron e-mail dataset. We argue that the self-exciting models adequately capture major temporal clustering features in the data and perform better than traditional stationary Poisson models. We also investigate how accounting for diurnal and weekly trends in e-mail activity improves the overall fit to the observed network data. A motivation and application forfitting these self-exciting models is to use parameter estimates to characterize important e-mail communication behaviors such as the baseline sending rates, average reply rates, and average response times. A primary goal is to use these features, estimated from the self-exciting models, to infer the underlying leadership status of users in the West Point and Enron networks. Supplementary materials for this article are available online.
引用
收藏
页码:564 / 584
页数:21
相关论文
共 31 条
  • [1] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [2] [Anonymous], 2004, Information Sciences Institute Technical Report
  • [3] [Anonymous], 2013, Temporal Networks
  • [4] Application of Branching Models in the Study of Invasive Species
    Balderama, Earvin
    Schoenberg, Frederic Paik
    Murray, Erin
    Rundel, Philip W.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (498) : 467 - 476
  • [5] The origin of bursts and heavy tails in human dynamics
    Barabási, AL
    [J]. NATURE, 2005, 435 (7039) : 207 - 211
  • [6] Cohen W.W, 2015, ENRON EMAIL DATASET
  • [7] Congress, 2003, REPORT INVESTIGATION
  • [8] Segmentation and Automated Social Hierarchy Detection through Email Network Analysis
    Creamer, German
    Rowe, Ryan
    Hershkop, Shlomo
    Stolfo, Salvatore J.
    [J]. ADVANCES IN WEB MINING AND WEB USAGE ANALYSIS, 2009, 5439 : 40 - +
  • [9] Daley D. J., 2003, An Introduction to the Theory of Point Processes, V2, DOI [10.1007/b97277, DOI 10.1007/B97277]
  • [10] Modelling Dyadic Interaction with Hawkes Processes
    Halpin, Peter F.
    De Boeck, Paul
    [J]. PSYCHOMETRIKA, 2013, 78 (04) : 793 - 814