Developing a Real-time Data Analytics Framework For Twitter Streaming Data

被引:20
|
作者
Yadranjiaghdam, Babak [1 ]
Yasrobi, Seyedfaraz [1 ]
Tabrizi, Nasseh [1 ]
机构
[1] East Carolina Univ, Dept Comp Sci, Greenville, NC 27858 USA
来源
2017 IEEE 6TH INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS 2017) | 2017年
关键词
Streaming processing; Big Data; Kafka; Spark; Twitter; Real-time; BIG DATA;
D O I
10.1109/BigDataCongress.2017.49
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Twitter is an online social networking service with more than 300 million users, generating a huge amount of information every day. Twitter's most important characteristic is its ability for users to tweet about events, situations, feelings, opinions, or even something totally new, in real time. Currently there are different workflows offering real-time data analysis for Twitter, presenting general processing over streaming data. This study will attempt to develop an analytical framework with the ability of in-memory processing to extract and analyze structured and unstructured Twitter data. The proposed framework includes data ingestion, stream processing, and data visualization components with the Apache Kafka messaging system that is used to perform data ingestion task. Furthermore, Spark makes it possible to perform sophisticated data processing and machine learning algorithms in real time. We have conducted a case study on tweets about the earthquake in Japan and the reactions of people around the world with analysis on the time and origin of the tweets.
引用
收藏
页码:329 / 336
页数:8
相关论文
共 50 条
  • [1] Big Data Streaming Platforms to Support Real-time Analytics
    Fernandes, Eliana
    Salgado, Ana Carolina
    Bernardino, Jorge
    ICSOFT: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2020, : 426 - 433
  • [2] Using a Rich Context Model for Real-Time Big Data Analytics in Twitter
    Sotsenko, Alisa
    Jansen, Marc
    Milrad, Marcelo
    Rana, Juwel
    2016 IEEE 4TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW), 2016, : 228 - 233
  • [3] Twitter Streaming Data Analytics for Disaster Alerts
    Shah, Syed Attique
    Ben Yahia, Sadok
    McBride, Keegan
    Jamil, Akhtar
    Draheim, Dirk
    2ND INTERNATIONAL INFORMATICS AND SOFTWARE ENGINEERING CONFERENCE (IISEC), 2021,
  • [4] Big Data Stream Computing in Healthcare Real-Time Analytics
    Ta, Van-Dai
    Liu, Chuan-Ming
    Nkabinde, Goodwill Wandile
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2016), 2016, : 37 - 42
  • [5] Real-time streaming mobility analytics
    Garzo, Andras
    Benczur, Andras A.
    Sidlo, Csaba Istvan
    Tahara, Daniel
    Wyatt, Erik Francis
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [6] Real-Time Data Analytics: An Algorithmic Perspective
    Morshed, Sarwar Jahan
    Rana, Juwel
    Milrad, Marcelo
    DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 311 - 320
  • [7] Real-Time Clickstream Data Analytics and Visualization
    Hanamanthrao, Ramanna
    Thejaswini, S.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 2139 - 2144
  • [8] An incremental approach for real-time Big Data visual analytics
    Garcia, Ignacio
    Casado, Ruben
    Bouchachia, Abdelhamid
    2016 IEEE 4TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW), 2016, : 177 - 182
  • [9] Real-time Big Data Analytics for Multimedia Transmission and Storage
    Wang, Kun
    Mi, Jun
    Xu, Chenhan
    Shu, Lei
    Deng, Der-Jiunn
    2016 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2016,
  • [10] Big Data Analytics Architecture for Real-Time Traffic Control
    Amini, Sasan
    Gerostathopoulos, Ilias
    Prehofer, Christian
    2017 5TH IEEE INTERNATIONAL CONFERENCE ON MODELS AND TECHNOLOGIES FOR INTELLIGENT TRANSPORTATION SYSTEMS (MT-ITS), 2017, : 710 - 715