Large-Scale Social-Media Analytics on Stratosphere

被引:0
作者
Boden, Christoph [1 ]
Markl, Volker [1 ]
Karnstedt, Marcel [2 ]
Fernandez, Miriam [3 ]
机构
[1] TU Berlin, Berlin, Germany
[2] NUI Galway, DERI, Galway, Ireland
[3] Knowledge Media Inst, Milton Keynes, Bucks, England
来源
PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION) | 2013年
关键词
Role Analysis; Behaviour Analysis; Online Communities; Scalability; Stratosphere; Community Analysis; Boards.ie;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The importance of social-media platforms and online communities - in business as well as public context - is more and more acknowledged and appreciated by industry and researchers alike. Consequently, a wide range of analytics has been proposed to understand, steer, and exploit the mechanics and laws driving their functionality and creating the resulting benefits. However, analysts usually face significant problems in scaling existing and novel approaches to match the data volume and size of modern online communities. In this work, we propose and demonstrate the usage of the massively parallel data processing system Stratosphere, based on second order functions as an extended notion of the MapReduce paradigm, to provide a new level of scalability to such social-media analytics. Based on the popular example of role analysis, we present and illustrate how this massively parallel approach can be leveraged to scale out complex data-mining tasks, while providing a programming approach that eases the formulation of complete analytical workflows.
引用
收藏
页码:257 / 260
页数:4
相关论文
共 8 条
  • [1] [Anonymous], 2010, SoCC, DOI DOI 10.1145/1807128.1807148
  • [2] Chan J., 2010, ICWSM, V10, P215
  • [3] Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
  • [4] Ewen S., 2013, SIGMOD
  • [5] SQL/MapReduce: A practical approach to self-describing, polymorphic, and parallelizable user-defined functions
    Friedman, Eric
    Pawlowski, Peter
    Cieslewicz, John
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (02): : 1402 - 1413
  • [6] Hueske F, 2013, PROC INT CONF DATA, P1292, DOI 10.1109/ICDE.2013.6544927
  • [7] Leich M., 2013, BTW
  • [8] Rowe M., 2012, JWS, V18