Integrated framework to integrate Spark-based big data analytics and for health monitoring and recommendation in sports using XGBoost algorithm

被引：1

作者：

Zhao, Yin ^{[1
]}

Ramos, Ma. Finipina ^{[2
]}

Li, Bin ^{[3
]}

机构：

[1] Southwest Med Univ, Sch Phys Educ, Studies Sect, Luzhou 646000, Sichuan, Peoples R China

[2] Jose Rizal Univ Jose Rizal Univ, Grad Sch, Mandaluyong 1552, Philippines

[3] Southwest Med Univ, Sch Phys Educ, Luzhou 646000, Sichuan, Peoples R China

来源：

SOFT COMPUTING | 2023年 / 28卷 / 2期

关键词：

Big data; Spark; Data mining; XGBoost algorithm; Sports medical integration; Service system construction;

D O I：

10.1007/s00500-023-09450-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, technological advancements have been replicated in various industries, including sports medicine. Recent developments, such as big data analytics and data mining, which have revolutionized medical services in sports, are apparent in this transformation. This technological shift is motivated by the need to enhance athletic performance, prevent injuries, and offer individualized health advice. Modern lifestyles have simultaneously increased people's attention to their health, creating a demand for better medical services. However, China's ability to provide superior medical care needs to be improved due to a lack of medical resources and an ever-increasing patient population. To address these challenges, this research paper presents an integrated framework that leverages Spark-based big data analytics and the XGBoost algorithm. The framework aims to provide a robust sports medical service encompassing real-time health monitoring and data-driven insights. Powered by the formidable distributed computing platform Spark, it adeptly manages extensive sports data generated during training and events, facilitating instant health evaluations. Incorporating the XGBoost algorithm for data mining amplifies health prediction and recommendation capabilities. Renowned for its predictive prowess, XGBoost excels in discerning intricate sports data patterns and trends. Its proficiency in tackling intricates feature selection and modeling tasks ensures precision and actionable insights. Empirical findings underscore substantial enhancements in sports medical services. When applied to chronic disease datasets, the XGBoost algorithm garnered an impressive 93% trust rate. In contrast to conventional methods like K-Nearest Neighbors (KNN), Random Forest (RF), Decision Trees (DT), Support Vector Machines (SVM), Naive Bayes (NB), and Logistic Regression (LR), the proposed framework consistently outperforms these established techniques. This remarkable performance underscores the transformative potential of the integrated framework in revolutionizing sports medical services.

引用

页码：1585 / 1608

页数：24

共 43 条

[1] Integrated framework to integrate Spark-based big data analytics and for health monitoring and recommendation in sports using XGBoost algorithm
Yin Zhao
Ma. Finipina Ramos
Bin Li
Soft Computing, 2024, 28 : 1585 - 1608
[2] A Dynamic Spark-based Classification Framework for Imbalanced Big Data
Abdel-Hamid, Nahla B.
ElGhamrawy, Sally
El Desouky, Ali
Arafat, Hesham
JOURNAL OF GRID COMPUTING, 2018, 16 (04) : 607 - 626
[3] A Dynamic Spark-based Classification Framework for Imbalanced Big Data
Nahla B. Abdel-Hamid
Sally ElGhamrawy
Ali El Desouky
Hesham Arafat
Journal of Grid Computing, 2018, 16 : 607 - 626
[4] Efficient Spark-Based Framework for Big Geospatial Data Query Processing and Analysis
Aljawarneh, Isam Mashhour
Bellavista, Paolo
Corradi, Antonio
Montanari, Rebecca
Foschini, Luca
Zanotti, Andrea
2017 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2017, : 851 - 856
[5] An Efficient Spark-Based Hybrid Frequent Itemset Mining Algorithm for Big Data
Al-Bana, Mohamed Reda
Farhan, Marwa Salah
Othman, Nermin Abdelhakim
DATA, 2022, 7 (01)
[6] A distributed frequent itemset mining algorithm using Spark for Big Data analytics
Feng Zhang
Min Liu
Feng Gui
Weiming Shen
Abdallah Shami
Yunlong Ma
Cluster Computing, 2015, 18 : 1493 - 1501
[7] A distributed frequent itemset mining algorithm using Spark for Big Data analytics
Zhang, Feng
Liu, Min
Gui, Feng
Shen, Weiming
Shami, Abdallah
Ma, Yunlong
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (04): : 1493 - 1501
[8] A new Apache Spark-based framework for big data streaming forecasting in IoT networks
Fernandez-Gomez, Antonio M.
Gutierrez-Aviles, David
Troncoso, Alicia
Martinez-Alvarez, Francisco
JOURNAL OF SUPERCOMPUTING, 2023, 79 (10) : 11078 - 11100
[9] A new Apache Spark-based framework for big data streaming forecasting in IoT networks
Antonio M. Fernández-Gómez
David Gutiérrez-Avilés
Alicia Troncoso
Francisco Martínez-Álvarez
The Journal of Supercomputing, 2023, 79 : 11078 - 11100
[10] An Efficient Parallel Algorithm for Clustering Big Data based on the Spark Framework
Dafir, Zineb
Slaoui, Said
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 890 - 896

← 1 2 3 4 5 →