An event-based data processing system using Kafka container cluster on Kubernetes environment

被引:4
作者
Liu, Jung-Chun [1 ]
Hsu, Ching-Hsien [2 ]
Zhang, Jia-Hao [1 ]
Kristiani, Endah [1 ,3 ]
Yang, Chao-Tung [1 ,4 ]
机构
[1] Tunghai Univ, Dept Comp Sci, 1727, Sec 4, Taiwan Blvd, Taichung 407224, Taiwan
[2] Asia Univ, Coll Informat & Elect Engn, 500 Lioufeng Rd, Taichung 41354, Taiwan
[3] Krida Wacana Christian Univ, Dept Informat, Tanjung Duren Raya 4, Jakarta Barat 11470, Indonesia
[4] Tunghai Univ, Res Ctr Smart Sustainable Circular Econ, 1727, Sec 4, Taiwan Blvd, Taichung 407224, Taiwan
关键词
Smart manufacturing; Container; Kubernetes; Kafka cluster; Big data; ENERGY MANAGEMENT; BIG DATA; CONSUMPTION; INDUSTRIAL; FRAMEWORK;
D O I
10.1007/s00521-023-08326-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Smart manufacturing has become a big trend of a new industrial revolution in the manufacturing industry. The advancement of the Internet of Things has made production more efficient and effective through the automated collecting data system and Big Data technology. Dealing with a large amount of real-time production data will be a significant issue for intelligent manufacturing. This paper uses Apache Kafka's high-performance, low-latency data stream processing platform to process data collection and store it in the Big Data System. Kafka was deployed through Kubernetes, where it has improved on the architecture's scalability and applies this architecture to the aerospace manufacturing autoclave. These data are then used to analyze the autoclave equipment anomaly. Testing performed on the Kafka Producer Throughput demonstrates that in the event that all other parameters remain unchanged, the real throughput will increase along with the increase in the throughput limit that is being used. For instance, when the throughput limit is 1.2 million, the maximum throughput of this experiment is reached at 1.13 million transactions per second, while the transfer rate is 552.88 megabytes per second (MB/s). The value of the fetch size parameter is set to 10,48,576 by default (1 M). It takes half a time and a quarter of a time down, and it takes up to 2.5 times the value that was preset before you can witness the change in the parameters that affect the performance. The performance achieves its peak of 1.43 million data transferred per second at a speed of 347.93 megabytes per second, and the performance after that has a tendency to remain consistent.
引用
收藏
页码:8095 / 8112
页数:18
相关论文
共 50 条
[41]   Event-Based Runtime Verification of Temporal Properties Using Time Basic Petri Nets [J].
Camilli, Matteo ;
Gargantini, Angelo ;
Scandurra, Patrizia ;
Bellettini, Carlo .
NASA FORMAL METHODS (NFM 2017), 2017, 10227 :115-130
[42]   Model of Knowledge-Based Process Management System Using Big Data in the Wireless Communication Environment [J].
Kyoo-Sung Noh .
Wireless Personal Communications, 2018, 98 :3147-3162
[43]   Model of Knowledge-Based Process Management System Using Big Data in the Wireless Communication Environment [J].
Noh, Kyoo-Sung .
WIRELESS PERSONAL COMMUNICATIONS, 2018, 98 (04) :3147-3162
[44]   Proposition of an employability prediction system using data mining techniques in a big data environment [J].
Saouabi, Mohamed ;
Ezzati, Abdellah .
INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE, 2019, 14 (02) :411-424
[45]   Flow data processing paradigm and its application in smart city using a cluster analysis approach [J].
Xiang Zou ;
Jinghua Cao ;
Wei Sun ;
Quan Guo ;
Tao Wen .
Cluster Computing, 2019, 22 :435-444
[46]   Flow data processing paradigm and its application in smart city using a cluster analysis approach [J].
Zou, Xiang ;
Cao, Jinghua ;
Sun, Wei ;
Guo, Quan ;
Wen, Tao .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (02) :435-444
[47]   Communication-Efficient Cluster Scalable Genomics Data Processing Using Apache Arrow Flight [J].
Ahmad, Tanveer ;
Ma, Chengxin ;
Al-Ars, Zaid ;
Hofstee, H. Peter .
2022 21ST INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC 2022), 2022, :138-146
[48]   Design of computer big data processing system based on genetic algorithm [J].
Chen, Song .
SOFT COMPUTING, 2023, 27 (11) :7667-7678
[49]   Design of computer big data processing system based on genetic algorithm [J].
Song Chen .
Soft Computing, 2023, 27 :7667-7678
[50]   VISUAL ENVIRONMENT MONITORING SYSTEM BASED ON CLOUD COMPUTING AND BIG DATA [J].
Fang, Li ;
Zhi, Zhang .
JOURNAL OF ENVIRONMENTAL PROTECTION AND ECOLOGY, 2022, 23 (05) :2150-2157