Machine Learning with Distributed Data Management and Process Architecture

被引:0
作者
Baysal, Engin [1 ]
Bayilmis, Cuneyt [2 ]
机构
[1] Istanbul Gedik Univ, Gedik Vocat Sch, Istanbul, Turkey
[2] Sakarya Unveristy, Comp & Informat Engn, Sakarya, Turkey
来源
2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK) | 2019年
关键词
big data; big data analytics; machine learning; apache spark; pyspark; logistic regression; yarn;
D O I
10.1109/ubmk.2019.8907073
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the development of technology that takes place more and more every day in our lives, it becomes almost impossible to manage and process the data produced and thus brought about the necessity of storage and analysis. Both the data size and the increase in the variety of data have necessitated the development of new methods in this context. In this study, distributed data management and analysis tools which are developed for data that cannot be processed in traditional regulations have been used. The machine learning application has been developed by using Logistic Regression classification algorithm. The application was implemented with the data set obtained from the sensors using pyspark libraries on the Spark cluster created using the Google Cloud service. And the working environment managed by YARN, has been observed during the implementation of the application.
引用
收藏
页码:53 / 57
页数:5
相关论文
共 50 条
[41]   The Use of Machine Learning for Analyzing Real-World Data in Disease Prediction and Management: Systematic Review [J].
Alhumaidi, Norah Hamad ;
Dermawan, Doni ;
Kamaruzaman, Hanin Farhana ;
Alotaiq, Nasser .
JMIR MEDICAL INFORMATICS, 2025, 13
[42]   Optimization of Healthcare Process Management Using Machine Learning [J].
Avgoustis, Andreas ;
Exarchos, Themis ;
Vrahatis, Aristidis G. ;
Vlamos, Panagiotis .
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT I, AIAI 2024, 2024, 711 :187-200
[43]   A Distributed Big Data Analytics Architecture for Vehicle Sensor Data [J].
Alexakis, Theodoros ;
Peppes, Nikolaos ;
Demestichas, Konstantinos ;
Adamopoulou, Evgenia .
SENSORS, 2023, 23 (01)
[44]   Advances and opportunities in machine learning for process data analytics [J].
Qin, S. Joe ;
Chiang, Leo H. .
COMPUTERS & CHEMICAL ENGINEERING, 2019, 126 :465-473
[45]   Distributed Supervised Sentiment Analysis of Tweets: Integrating Machine Learning and Streaming Analytics for Big Data Challenges in Communication and Audience Research [J].
Arcila Calderon, Carlos ;
Ortega Mohedano, Felix ;
Alvarez, Mateo ;
Vicente Marino, Miguel .
EMPIRIA, 2019, (42) :113-136
[46]   The machine learning framework for traffic management in smart cities [J].
Tiwari, Pulkit .
MANAGEMENT OF ENVIRONMENTAL QUALITY, 2024, 35 (02) :445-462
[47]   Soteria: Preserving Privacy in Distributed Machine Learning [J].
Brito, Claudia ;
Ferreira, Pedro ;
Portela, Bernardo ;
Oliveira, Rui ;
Paulo, Joao .
38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, :135-142
[48]   Road Traffic Event Detection Using Twitter Data, Machine Learning, and Apache Spark [J].
Alomari, Ebtesam ;
Mehmood, Rashid ;
Katib, Iyad .
2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, :1888-1895
[49]   Large-scale data-driven financial risk management & analysis using machine learning strategies [J].
Murugan M.S. ;
T S.K. .
Measurement: Sensors, 2023, 27
[50]   Applying Big Data Analysis and Machine Learning Approaches for Optimal Production Management [J].
Tileubay, Sarsenkul ;
Doszhanov, Bayanali ;
Mailykhanova, Bulgyn ;
Kulmurzayev, Nurlan ;
Sarsenbayeva, Aisanim ;
Akanova, Zhadyra ;
Toxanova, Sveta .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (12) :633-643