An efficient architecture for processing real-time traffic data streams using apache flink

被引:0
作者
B. Gnana Deepthi
K. Sandhya Rani
P. Venkata Krishna
V. Saritha
机构
[1] Sri Padmavati Mahila Visvavidyalayam,Department of Computer Science and Engineering
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Big Data; Big Data Processing; Stream Computing; Apache Flink;
D O I
暂无
中图分类号
学科分类号
摘要
Big Data technologies emerging day by day and are making drastic changes in various real-world applications. Traditional data mining tools adequate to process volumes of data but from past decades the rapid growth in data becomes difficult for processing. Due to continuous flow of data, data streams require additional computational processing than the traditional one. Big data stream processing considers different features of the data streams heterogeneity, scalability, fault tolerance and query optimization. Efficient implementation of these features in real-world applications using big data analytics is a challenging job during data storage, processing, and analysis phases. Therefore, the proposed model FRTSPS is a generic architecture which is influenced by popular big data processing Lambda architecture, based on distributed computing platform. The architecture using open-source platform Apache Flink for doing data processing. Flink is a popular platform for processing historical and stream data flows at once parallelly. Its stateful streaming can obtain more scalability and flexibility along with high throughput and low latency than the remaining stream processing programming models.
引用
收藏
页码:37369 / 37385
页数:16
相关论文
共 47 条
[1]  
Isah H(2019)A survey of distributed data stream processing frameworks IEEE Access 7 154300-154316
[2]  
Abughofa T(2020)A review on big data real-time stream processing and its scheduling techniques Int J Parallel Emergent Distrib Syst 35 571-601
[3]  
Mahfuz S(2016)Apache Flink in current research It-Inform Technol 58 157-165
[4]  
Ajerla D(2016)A formal definition of Big Data based on its essential features Libr Rev 65 122-135
[5]  
Zulkernine F(2022)Big data for traffic estimation and prediction: a survey of data and tools Appl Syst Innov 5 23-28
[6]  
Khan S(2019)BigData analysis in healthcare: apache hadoop, apache spark and apache flink Front Health Inform 8 14-109431
[7]  
Tantalaki N(2020)Composing high-level stream processing pipelines J Big Data 7 1-28
[8]  
Souravlas S(2021)Influencing factors in the scalability of distributed stream processing jobs IEEE Access 9 109413-93763
[9]  
Roumeliotis M(2019)An efficient and unique TF/IDF algorithmic model-based data analysis for handling applications with big data streaming Electronics 8 1331-116
[10]  
Rabl T(2020)A distributed stream processing middleware framework for real-time analysis of heterogeneous data on big data platform: case of environmental monitoring Sensors 20 3166-1858