Real-Time Processing of Big Data Streams: Lifecycle, Tools, Tasks, and Challenges<bold> </bold>

被引:0
作者
Gurcan, Fatih [1 ]
Berigel, Muhammet [2 ]
机构
[1] Karadeniz Tech Univ, Dept Comp Engn, Trabzon, Turkey
[2] Karadeniz Tech Univ, Dept Management Informat Syst, Trabzon, Turkey
来源
2018 2ND INTERNATIONAL SYMPOSIUM ON MULTIDISCIPLINARY STUDIES AND INNOVATIVE TECHNOLOGIES (ISMSIT) | 2018年
关键词
Big data streams; real-time big data processing; lifecycle; tools; tasks; challenges <bold>; </bold>;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In today's technological environments, the vast majority of big data-driven applications and solutions are based on real-time processing of streaming data. The real-time processing and analytics of big data streams play a crucial role in the development of big-data driven applications and solutions. From this perspective, this paper defines a lifecycle for the real-time big data processing. It describes existing tools, tasks, and frameworks by associating them with the phases of the lifecycle, which include data ingestion, data storage, stream processing, analytical data store, and analysis and reporting. The paper also investigates the real-time big data processing tools consisting of Flume, Kafka, Nifi, Storm, Spark Streaming, S4, Flink, Samza, Hbase, Hive, Cassandra, Splunk, and Sap Hana. As well as, it discusses the up-to-date challenges of the real-time big data processing such as "volume, variety and heterogeneity", "data capture and storage", "inconsistency and incompleteness", "scalability", "real-time processing", "data visualization", "skill requirements", and "privacy and security". This paper may provide valuable insights into the understanding of the lifecycle, related tools and tasks, and challenges of real-time big data processing.<bold> </bold>
引用
收藏
页码:284 / 289
页数:6
相关论文
共 13 条
[1]  
[Anonymous], 2011, BIG DATA NEXT FRONTI
[2]  
[Anonymous], 2001, 3D DATA MANAGEMENT C
[3]  
Carbone P, 2015, IEEE DATA ENG B, V36
[4]   Emerging trends and technologies in big data processing [J].
Casado, Ruben ;
Younas, Muhammad .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (08) :2078-2091
[5]   Data-intensive applications, challenges, techniques and technologies: A survey on Big Data [J].
Chen, C. L. Philip ;
Zhang, Chun-Yang .
INFORMATION SCIENCES, 2014, 275 :314-347
[6]  
Katal A, 2013, INT CONF CONTEMP, P404, DOI 10.1109/IC3.2013.6612229
[7]   Big Data: Survey, Technologies, Opportunities, and Challenges [J].
Khan, Nawsher ;
Yaqoob, Ibrar ;
Hashem, Ibrahim Abaker Targio ;
Inayat, Zakira ;
Ali, Waleed KamaleldinMahmoud ;
Alam, Muhammad ;
Shiraz, Muhammad ;
Gani, Abdullah .
SCIENTIFIC WORLD JOURNAL, 2014,
[8]   Real-time processing of streaming big data [J].
Safaei, Ali A. .
REAL-TIME SYSTEMS, 2017, 53 (01) :1-44
[9]  
Samal B., 2017, Int. J. Eng. Comput. Sci, V6, P22551, DOI [10.18535/ijecs/v6i10.04, DOI 10.18535/IJECS/V6I10.04]
[10]   Beyond Batch Processing: Towards Real-Time and Streaming Big Data [J].
Shahrivari, Saeed .
COMPUTERS, 2014, 3 (04) :117-129