Developing a Real-time Data Analytics Framework For Twitter Streaming Data

被引:20
|
作者
Yadranjiaghdam, Babak [1 ]
Yasrobi, Seyedfaraz [1 ]
Tabrizi, Nasseh [1 ]
机构
[1] East Carolina Univ, Dept Comp Sci, Greenville, NC 27858 USA
来源
2017 IEEE 6TH INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS 2017) | 2017年
关键词
Streaming processing; Big Data; Kafka; Spark; Twitter; Real-time; BIG DATA;
D O I
10.1109/BigDataCongress.2017.49
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Twitter is an online social networking service with more than 300 million users, generating a huge amount of information every day. Twitter's most important characteristic is its ability for users to tweet about events, situations, feelings, opinions, or even something totally new, in real time. Currently there are different workflows offering real-time data analysis for Twitter, presenting general processing over streaming data. This study will attempt to develop an analytical framework with the ability of in-memory processing to extract and analyze structured and unstructured Twitter data. The proposed framework includes data ingestion, stream processing, and data visualization components with the Apache Kafka messaging system that is used to perform data ingestion task. Furthermore, Spark makes it possible to perform sophisticated data processing and machine learning algorithms in real time. We have conducted a case study on tweets about the earthquake in Japan and the reactions of people around the world with analysis on the time and origin of the tweets.
引用
收藏
页码:329 / 336
页数:8
相关论文
共 50 条
  • [41] Real-time credit card fraud detection using Streaming Analytics
    Rajeshwari, U.
    Babu, B. Sathish
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2016, : 439 - 444
  • [42] RUBA: Real-time Unstructured Big Data Analysis Framework
    Kim, Jaein
    Kim, Nacwoo
    Lee, Byungtak
    Park, Joonho
    Seo, Kwangik
    Park, Hunyoung
    2013 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2013): FUTURE CREATIVE CONVERGENCE TECHNOLOGIES FOR NEW ICT ECOSYSTEMS, 2013, : 520 - 524
  • [43] Optimizing performance of Real-Time Big Data stateful streaming applications on Cloud
    Gupta, Amit
    Jain, Sushant
    2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022), 2022, : 1 - 4
  • [44] Automated Real-Time Analysis of Streaming Big and Dense Data on Reconfigurable Platforms
    Rouhani, Bita Darvish
    Mirhoseini, Azalia
    Songhori, Ebrahim M.
    Koushanfar, Farinaz
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2016, 10 (01)
  • [45] Platform for Automated Real-Time High Performance Analytics on Medical Image Data
    Allen, William J.
    Gabr, Refaat E.
    Tefera, Getaneh B.
    Pednekar, Amol S.
    Vaughn, Matthew W.
    Narayana, Ponnada A.
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2018, 22 (02) : 318 - 324
  • [46] Real-time big data analytics for hard disk drive predictive maintenance
    Su, Chuan-Jun
    Huang, Shi-Feng
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 71 : 93 - 101
  • [47] Real-time Anomaly Detection and Classification in Streaming PMU Data
    Hannon, Christopher
    Deka, Deepjyoti
    Jin, Dong
    Vuffray, Marc
    Lokhov, Andrey Y.
    2021 IEEE MADRID POWERTECH, 2021,
  • [48] Towards Real-Time Road Traffiic Analytics using Telco Big Data
    Costa, Constantinos
    Chatzimilioudis, Georgios
    Zeinalipour-Yazti, Demetrios
    Mokbel, Mohamed F.
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL WORKSHOP ON REAL-TIME BUSINESS INTELLIGENCE AND ANALYTICS, 2017,
  • [49] Real-time Twitter Sentiment Analysis for Moroccan Universities using Machine Learning and Big Data Technologies
    Lasri I.
    Riadsolh A.
    Elbelkacemi M.
    International Journal of Emerging Technologies in Learning, 2023, 18 (05) : 42 - 61
  • [50] Scalable Containerized Pipeline for Real-time Big Data Analytics
    Aurangzaib, Rana
    Iqbal, Waheed
    Abdullah, Muhammad
    Bukhari, Faisal
    Ullah, Faheem
    Erradi, Abdelkarim
    2022 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2022), 2022, : 25 - 32