Developing a Real-time Data Analytics Framework For Twitter Streaming Data

被引:20
|
作者
Yadranjiaghdam, Babak [1 ]
Yasrobi, Seyedfaraz [1 ]
Tabrizi, Nasseh [1 ]
机构
[1] East Carolina Univ, Dept Comp Sci, Greenville, NC 27858 USA
关键词
Streaming processing; Big Data; Kafka; Spark; Twitter; Real-time; BIG DATA;
D O I
10.1109/BigDataCongress.2017.49
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Twitter is an online social networking service with more than 300 million users, generating a huge amount of information every day. Twitter's most important characteristic is its ability for users to tweet about events, situations, feelings, opinions, or even something totally new, in real time. Currently there are different workflows offering real-time data analysis for Twitter, presenting general processing over streaming data. This study will attempt to develop an analytical framework with the ability of in-memory processing to extract and analyze structured and unstructured Twitter data. The proposed framework includes data ingestion, stream processing, and data visualization components with the Apache Kafka messaging system that is used to perform data ingestion task. Furthermore, Spark makes it possible to perform sophisticated data processing and machine learning algorithms in real time. We have conducted a case study on tweets about the earthquake in Japan and the reactions of people around the world with analysis on the time and origin of the tweets.
引用
收藏
页码:329 / 336
页数:8
相关论文
共 50 条
  • [21] Real-Time Classification of Streaming Sensor Data
    Kasetty, Shashwati
    Stafford, Candice
    Walker, Gregory P.
    Wang, Xiaoyue
    Keogh, Eamonn
    20TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL 1, PROCEEDINGS, 2008, : 149 - +
  • [22] Real-time processing of streaming big data
    Safaei, Ali A.
    REAL-TIME SYSTEMS, 2017, 53 (01) : 1 - 44
  • [23] Real-time processing of streaming big data
    Ali A. Safaei
    Real-Time Systems, 2017, 53 : 1 - 44
  • [24] Real-time streaming of environmental field data
    Vivoni, ER
    Camilli, R
    COMPUTERS & GEOSCIENCES, 2003, 29 (04) : 457 - 468
  • [25] Real-Time Data Analytics: An Algorithmic Perspective
    Morshed, Sarwar Jahan
    Rana, Juwel
    Milrad, Marcelo
    DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 311 - 320
  • [26] Real-Time Clickstream Data Analytics and Visualization
    Hanamanthrao, Ramanna
    Thejaswini, S.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 2139 - 2144
  • [27] A Streamlined Approach for Real-Time Data Analytics
    Arora, Shruti
    Rani, Rinkle
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2018, : 732 - 736
  • [28] Event detection from real-time twitter streaming data using community detection algorithm
    Jagrati Singh
    Digvijay Pandey
    Anil Kumar Singh
    Multimedia Tools and Applications, 2024, 83 : 23437 - 23464
  • [29] Event detection from real-time twitter streaming data using community detection algorithm
    Singh, Jagrati
    Pandey, Digvijay
    Singh, Anil Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 23437 - 23464
  • [30] Real-Time Twitter Trend Analysis Using Big Data Analytics and Machine Learning Techniques
    Rodrigues, Anisha P.
    Fernandes, Roshan
    Bhandary, Adarsh
    Shenoy, Asha C.
    Shetty, Ashwanth
    Anisha, M.
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021